Voice Recognition Books

MagicBeanDip.com

Page 1 of 34 - Go to page: 1 2 3 4 5 6 12

Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions (Addison-Wesley Signature Series)

Gregor Hohpe, Bobby Woolf

Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions (Addison-Wesley Signature Series) Gregor Hohpe, Bobby Woolf Amazon Price: $47.99
List Price: $59.99
Usually ships in 24 hours
By: Addison-Wesley Professional
Amazon Marketplace: 49 new & used starting at $43.94

Buy at Amazon.com

Browse similar items by category:
Subjects -> Business & Investing -> Industries & Professions -> MIS
Subjects -> Business & Investing -> Management & Leadership -> Management
Subjects -> Computers & Internet -> Business & Culture -> Manager's Guides to Computing

Customer Reviews:
Total reviews: 31 Average rating: 4.5 of 5

Excellent book for Software Architect and Software Engineer 5 out of 5 stars.
1 of 1 people found this review helpful.

Many books have been written about SOA, but most of them are just about the theory of SOA. It's important for Software Architects and Software Engineers to understand the theory, but just knowing the theory is not enough to develop system utilizing SOA principles.

This book fits nicely to bridge the gap between theory and practice. It contains not only the theory behind the patterns that can be used to design a loosely coupled, scalable system, but also the code in Java and C# on how to implement the pattern to build the system.

If you are serious on building a loosely couple system and strongly believe on the powerful of messaging system to accomplish this task, then you have to read this book from the beginning to the end, it will help you to design the system without reinventing the wheel.

Great book for messaging pattern understanding 5 out of 5 stars.
0 of 0 people found this review helpful.

This is a fantastic book if you are looking for patterns to base your messaging designs and architecture around. The way this book goes about explaining some of the asynchronous messaging patterns seemed to provide a great deal of benefit to developers and designers who were stuck in the synchronous way of doing things. Great explanations and illustrations, would recommend to anyone researching EAI or ESB technologies or just a more structured, efficient way of messaging in general.

Speech and Language Processing (2nd Edition) (Prentice Hall Series in Artificial Intelligence)

Daniel Jurafsky, James H. Martin

Speech and Language Processing (2nd Edition) (Prentice Hall Series in Artificial Intelligence) Daniel Jurafsky, James H. Martin Amazon Price: $92.00
List Price: $115.00
Usually ships in 24 hours
By: Prentice Hall
Amazon Marketplace: 28 new & used starting at $92.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Machine Vision
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Human Vision & Language Systems
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Customer Reviews:
Total reviews: 1 Average rating: 5.0 of 5

Editorial Review:

An explosion of Web-based language techniques, merging of distinct fields, availability of phone-based dialogue systems, and much more make this an exciting time in speech and language processing. The first of its kind to thoroughly cover language technology – at all levels and with all modern technologies – this book takes an empirical approach to the subject, based on applying statistical and other machine-learning algorithms to large corporations. Builds each chapter around one or more worked examples demonstrating the main idea of the chapter, usingthe examples to illustrate the relative strengths and weaknesses of various approaches. Adds coverage of statistical sequence labeling, information extraction, question answering and summarization, advanced topics in speech recognition, speech synthesis. Revises coverage of language modeling, formal grammars, statistical parsing, machine translation, and dialog processing. A useful reference for professionals in any of the areas of speech and language processing.

Dragon NaturallySpeaking for Dummies

David C. Kay, Doug Muder

Dragon NaturallySpeaking for Dummies David C. Kay, Doug Muder Amazon Price: $26.99
List Price: $29.99
Usually ships in 24 hours
By: For Dummies
Amazon Marketplace: 27 new & used starting at $19.50

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Business -> General
Subjects -> Computers & Internet -> Software -> Voice Recognition

Customer Reviews:
Total reviews: 11 Average rating: 3.0 of 5

Editorial Review:

Free at last! Finally, someone has come along to free you from your keyboard. With Dragon NaturallySpeaking, the miraculous voice-recognition software in your computer, you can browse the Web, control your applications, control your desktop, write documents, and more without ever once laying finger to plastic. But don’t run out and get yourself fitted for that Star Fleet uniform just yet, cadet. Dragon NaturallySpeaking is the most accurate voice recognition software on the market, and while it really does deliver on all its claims, it can be very finicky, and getting top results can be tricky.

The complete guide to the care of feeding or your Dragon, Dragon NaturallySpeaking For Dummies is a must-have companion for voice-recognition trailblazers who are ready to:

  • Kiss that keyboard goodbye and say hello to hands-free computing
  • Verbally control your Windows desktop and most applications
  • Dictate, edit, format and proofread documents in Word and WordPerfect
  • Browse the Web and compose and send email by voice
  • Use a pocket digital recorder on the run

Here’s all you need to fire up your Dragon and get it dancing to your tune. Your total guide to installing, configuring, fine-tuning and getting the most out of that amazing voice recognition software, Dragon NaturallySpeaking For Dummies covers all the bases, including:

  • Installing, configuring, and launching your Dragon
  • Dictating, editing, proofreading, and formatting documents in NaturallySpeaking
  • Recording speech onto the NaturallySpeaking recorder and transcribing recorded speech
  • Dictating into other applications
  • Controlling your desktop and windows by voice
  • Using NaturalWord for Word and WordPerfect
  • Browsing the Web, emailing and faxing by voice
  • Managing databases hands-free
  • Maximizing voice recognition accuracy
  • Having multiple users and vocabularies
  • Adding specialized items and verbal shortcuts to Dragon’s vocabulary

With the introduction of Dragon NaturallySpeaking the old dream of hands-free computing has finally become reality. Now let Dragon NaturallySpeaking For Dummies show you how to give your Dragon wings and make it soar.

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition (Prentice Hall Series in Artificial Intelligence)

Daniel Jurafsky, James H. Martin

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition (Prentice Hall Series in Artificial Intelligence) Daniel Jurafsky, James H. Martin Amazon Price: $92.00
List Price: $115.00
Usually ships in 24 hours
By: Prentice Hall
Amazon Marketplace: 40 new & used starting at $12.95

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> General
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Machine Learning
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Machine Vision

Customer Reviews:
Total reviews: 18 Average rating: 4.5 of 5

Good oveview, slightly overrated: broad and shallow 3 out of 5 stars.
27 of 33 people found this review helpful.

GENERAL IDEA: Broad coverage, it lacks depth and details - particularly practical details. That is, the presentation is often sketchy, mainly because it approaches too many subjects for its available space. I would not say that this book is strong on theory either. It is quite obvious that it avoids getting too formal and precise, probably to remain attractive for non-specialists too.

CASE STUDY: One specific problem I had with the Hidden Markov Models, that are supperficially presented (or spread I could say) in several separate sections of the book, so it's not been a pleasure trying to actually understand them properly and completely as a fundamental concept, to make them work in my particular application.

TITLE: The book's title IS misleading because it starts with "Speeech" and this book's main subject is not speech but (written) language. Actually there are only a few chapters on speech.

CONCLUSION: Get this book if you are looking for a good overview of the field. The book will introduce you to a thousand of topics. As soon as you need in-depth coverage of some particular topic, you will look for additional resources.

Editorial Review:

This book takes an empirical approach to language processing, based on applying statistical and other machine-learning algorithms to large corpora.Methodology boxes are included in each chapter. Each chapter is built around one or more worked examples to demonstrate the main idea of the chapter. Covers the fundamental algorithms of various fields, whether originally proposed for spoken or written language to demonstrate how the same algorithm can be used for speech recognition and word-sense disambiguation. Emphasis on web and other practical applications. Emphasis on scientific evaluation. Useful as a reference for professionals in any of the areas of speech and language processing.

Statistical Methods for Speech Recognition (Language, Speech, and Communication)

Frederick Jelinek

Statistical Methods for Speech Recognition (Language, Speech, and Communication) Frederick Jelinek Amazon Price: $38.88
List Price: $54.00
Usually ships in 24 hours
By: The MIT Press
Amazon Marketplace: 15 new & used starting at $32.62

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> Software -> Natural Language Processing

Customer Reviews:
Total reviews: 7 Average rating: 5.0 of 5

Thorough Overview of Stats and Algorithms for Speech Rec 5 out of 5 stars.
18 of 18 people found this review helpful.

This book provides a comprehensive introduction to the statistical models and algorithms used for speech recognition. Jelinek sets up the speech recognition problem in the traditional way as the decoding half of Shannon's noisy channel model. While Jelinek glosses over signal processing, he provides an excellent overview of the symbolic stages of processing involved in speech recognition.

After a quick introduction, Jelinek digs into the statistics behind Hidden Markov Models (HMMs), the foundation of almost all of today's speech recognizers. This is followed by chapters devoted to acoustic modeling (probability of acoustics given words) and language modeling (probability of a given sequence of words), and the algorithmic search induced by this model. There are also advanced chapters on fast match (widely used heuristics for pruning search), the Expectation-Maximization (EM) algorithm for training, and the use of decision trees, maximum entropy and backoff for language models. He covers several auxiliary topics including information theory and perplexity, the spelling to phoneme mapping, and the use of triphones for cross-phoneme modeling. Each chapter is a worthy introduction to an important topic.

This book does not presuppose much in the way of mathematical, computational, or linguistic background. A simple intro to probability and some experience with search problems would be of help, but isn't necessary -- you'll learn a lot about these topics reading the book.

All in all, this is the best thorough introduction to speech recognition that you can find. Read it along with Manning and Schuetze's "Foundations of Statistical Natural Language Processing" from the same series; there's a little overlap in language modeling, but not much. You might want to start with the gentler book by Jurafsky and Martin, "Speech and Language Processing", before tackling either Jelinek or Manning and Schuetze.

Editorial Review:

This book reflects decades of important research on the mathematical foundations of speech recognition. It focuses on underlying statistical techniques such as hidden Markov models, decision trees, the expectation-maximization algorithm, information theoretic goodness criteria, maximum entropy probability estimation, parameter and data clustering, and smoothing of probability distributions. The author's goal is to present these principles clearly in the simplest setting, to show the advantages of self-organization from real data, and to enable the reader to apply the techniques.

Discrete-Time Speech Signal Processing: Principles and Practice (Prentice Hall Signal Processing Series)

Thomas F. Quatieri

Discrete-Time Speech Signal Processing: Principles and Practice (Prentice Hall Signal Processing Series) Thomas F. Quatieri Amazon Price: $80.00
List Price: $100.00
Usually ships in 24 hours
By: Prentice Hall PTR
Amazon Marketplace: 12 new & used starting at $65.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition

Customer Reviews:
Total reviews: 2 Average rating: 4.0 of 5

Editorial Review:

Essential principles, practical examples, current applications, and leading-edge research.

In this book, Thomas F. Quatieri presents the field's most intensive, up-to-date tutorial and reference on discrete-time speech signal processing. Building on his MIT graduate course, he introduces key principles, essential applications, and state-of-the-art research, and he identifies limitations that point the way to new research opportunities.

Quatieri provides an excellent balance of theory and application, beginning with a complete framework for understanding discrete-time speech signal processing. Along the way, he presents important advances never before covered in a speech signal processing text book, including sinusoidal speech processing, advanced time-frequency analysis, and nonlinear aeroacoustic speech production modeling. Coverage includes:

Speech production and speech perception: a dual view

Crucial distinctions between stochastic and deterministic problems

Pole-zero speech models

Homomorphic signal processing

Short-time Fourier transform analysis/synthesis

Filter-bank and wavelet analysis/synthesis

Nonlinear measurement and modeling techniques

The book's in-depth applications coverage includes speech coding, enhancement, and modification; speaker recognition; noise reduction; signal restoration; dynamic range compression, and more. Principles of Discrete-Time Speech Processing also contains an exceptionally complete series of examples and Matlab exercises, all carefully integrated into the book's coverage of theory and applications.

Fundamentals of Speech Recognition (Prentice Hall Signal Processing Series)

Lawrence Rabiner, Biing-Hwang Juang

Fundamentals of Speech Recognition (Prentice Hall Signal Processing Series) Lawrence Rabiner, Biing-Hwang Juang Amazon Price: $88.20
List Price: $98.00
Usually ships in 24 hours
By: Prentice Hall PTR
Amazon Marketplace: 20 new & used starting at $58.25

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> Software -> Natural Language Processing

Customer Reviews:
Total reviews: 7 Average rating: 4.5 of 5

Good but contaminated with Linear Predictive Coding 3 out of 5 stars.
14 of 19 people found this review helpful.

Since this book misguides students of speech signal processing with the outdated compression technique of Linear Predictive Coding (LPC, which is far inferior to cepstral vocoding because of LPC's stateful memory of voiced excitation from one frame to the next), it ought to be half the price of Jelinek's book, not twice.

Editorial Review:

Provides a theoretically sound, technically accurate, and complete description of the basic knowledge and ideas that constitute a modern system for speech recognition by machine. Covers production, perception, and acoustic-phonetic characterization of the speech signal; signal processing and analysis methods for speech recognition; pattern comparison techniques; speech recognition system design and implementation; theory and implementation of hidden Markov models; speech recognition based on connected word models; large vocabulary continuous speech recognition; and task- oriented application of automatic speech recognition. For practicing engineers, scientists, linguists, and programmers interested in speech recognition.

The Dragon: NaturallySpeaking Guide Speech Recognition Made Fast and Simple

Dan Newman

The Dragon: NaturallySpeaking Guide Speech Recognition Made Fast and Simple Dan Newman List Price: $19.95
By: Waveside Pub
Amazon Marketplace: 1 new & used starting at $41.12

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> Software -> Natural Language Processing

Customer Reviews:
Total reviews: 7 Average rating: 5.0 of 5

Editorial Review:

The latest voice-recognition software boasts accuracy of 96 percent or better, corresponding to just a handful of errors in the average business e-mail message. Dragon NaturallySpeaking consistently comes out at or near the top of the rankings of such software. The Dragon NaturallySpeaking Guide documents the popular productivity aid fully, taking the reader from initial setup and training through fairly advanced vocabulary-expansion procedures and macro-building.

Author Dan Newman recognizes that Dragon NaturallySpeaking represents a whole new breed of program for many people, and takes time to explain the details of its efficient use. Along the way, you get a comprehensive look at NaturallySpeaking's user interface, so you can look up any detail whose function baffles you.

This book takes special care to highlight differences among the Standard, Professional, Medical, and Legal variants of Dragon NaturallySpeaking, making it a good choice if you're thinking about deploying the software but unsure about which version to buy. Coverage of the program's shortcut facilities are great too, including coverage of shorthands (which are short passages of text inserted with a single command) and macros (which can insert long passages of text and include variables, making it easy to generate form letters). Though Dragon NaturallySpeaking is far from perfect and most experts agree that it will have to improve its accuracy to gain wide acceptance, this book is a very good snapshot of the program as it exists today. --David Wall

Topics covered: Choosing a version of Dragon NaturallySpeaking, training for maximum recognition, issuing voice commands, integrating with Microsoft Word and other programs, creating shorthands and macros, and using Dragon's handheld voice recorder.

Voice User Interface Design

Michael H. Cohen, James P. Giangola, Jennifer Balogh

Voice User Interface Design Michael H. Cohen, James P. Giangola, Jennifer Balogh Amazon Price: $46.74
List Price: $54.99
Usually ships in 24 hours
By: Addison-Wesley Professional
Amazon Marketplace: 20 new & used starting at $28.95

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Programming -> Software Design, Testing & Engineering -> Software Development
Subjects -> Computers & Internet -> Programming -> General
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Customer Reviews:
Total reviews: 3 Average rating: 4.5 of 5

Very comprehensive 5 out of 5 stars.
11 of 11 people found this review helpful.

This book describes a methodology and gives pieces of advice for developing speech applications. The focus is on telephony applications. The book is based on the experience of the authors in developing such applications at Nuance.

The book is organized into four parts:
1. Introduction
2. Requirement gathering
3. Detailed design
4. Development and Tuning
Each part starts with a description of the general principles guiding the development of a speech application. They end with an "applied" example showing how these principles are used in a real application.

The introduction provides an overview of speech technology and an overview of the methodology (requirements, detailed design, development/tuning) used to develop a speech application. This methodology is used as a guide for the rest of the book.

The requirement gathering part covers meeting with the company that wants to deploy the speech application and getting information from them. The same kind of information as for other software projects is required: business case, target customers, environment integration, scope of the system, etc. Two interesting additions to the usual process are:
1. Specifying the persona. How should the system be perceived (serious, funny, etc.)? This will impact the prompts, the selection of the voice actor, and the design of the dialog flow.
2. Specifying the type of interaction: system directed or user directed. The former relies on grammars. The latter relies on SLM and robust parsing. This has a huge influence on design and realization.

The detailed design phase is concerned with designing the dialog flow, the prompts and the grammars. The authors put an emphasis in developing systems that (1) sound good and (2) are efficient. Sounding good means developing prompts that abide to spoken language rules (by opposition to written language) and paying attention to prosody. The sections on prompt design and prosody are very informative. Efficiency is ensured by making the dialog flow nicely. Techniques include thinking in terms of user scenarios, providing shortcuts to common tasks, educating users about efficient ways of using the system. Efficiency is also improved by helping users to recover from errors efficiently. Techniques here include quick confirmation strategies, providing help prompts, and providing access to main menu/operator.

The development and tuning part focuses mainly on tuning grammars and working with the voice actor. Tuning the grammar is done to ensure appropriate coverage while maintaining good recognition accuracy. Tuning must be based on real data since it is difficult to predict how people will use the system. Working with the voice actor is an important part of the system development. The authors give pieces of advice on how to have successful recording sessions.

The book has a nice balance of general principles and pieces of advice that can be directly applied. Compared to Kotelly's book, it has a more in-depth coverage of the topics. Compared to Balentine's book it provides a broader view of the development process as well as more detailed explanations of the principles behind the recommendations. On the minus side, the book is solely based on the experience of the authors. Although this experience is extensive, it seems that parts of the book are somewhat biased (e.g., SLM vs. grammar-based speech recognition, high focus on personas). It is not always clear when the numbers given in the books are based on real experience and when they are invented by the authors for the mock application. Some of the pieces of advice may also be difficult to directly apply in practice, since they depend on using vendor tools.

In my opinion this book should be required reading for developers of telephony applications and providers of platforms for speech application development.

Springer Handbook of Speech Processing (Springer Handbook of)

Springer Handbook of Speech Processing (Springer Handbook of) Amazon Price: $159.20
List Price: $199.00
Usually ships in 1 to 3 weeks
By: Springer
Amazon Marketplace: 19 new & used starting at $142.59

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Machine Vision
Subjects -> Computers & Internet -> Computer Science -> Circuitry -> Communication & Signal Processing
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Customer Reviews:
Total reviews: 1 Average rating: 5.0 of 5

Editorial Review:

From common consumer products such as cell phones and MP3 players to more sophisticated projects such as human-machine interfaces and responsive robots, speech technologies are now everywhere. Many think that it is just a matter of time before more applications of the science of speech become inescapable in our daily life. This handbook is meant to play a fundamental role for sustainable progress in speech research and development. Springer Handbook of Speech Processing targets three categories of readers: graduate students, professors and active researchers in academia and research labs, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. The handbook could also be used as a sourcebook for one or more graduate courses on signal processing for speech and different aspects of speech processing and applications. A quickly accessible source of application-oriented, authoritative and comprehensive information about these technologies, it combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.


Page 1 of 34 - Go to page: 1 2 3 4 5 6 12

Return to MagicBeanDip.com

This page was created in 1.6576 seconds.