Voice Recognition Books - Page 4

MagicBeanDip.com

Page 4 of 34 - Go to page: 1 2 3 4 5 6 7 8 9 15

Electronic Speech Synthesis

Electronic Speech Synthesis List Price: $50.00
By: McGraw-Hill Companies
Amazon Marketplace: 10 new & used starting at $1.35

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> General

Speech Recognition for the Health Professions: Using Dragon Naturally Speaking

Michael Freeman Bliss

Speech Recognition for the Health Professions: Using Dragon Naturally Speaking Michael Freeman Bliss List Price: $49.20
By: Prentice Hall
Amazon Marketplace: 22 new & used starting at $1.72

Buy at Amazon.com

Browse similar items by category:
Subjects -> Business & Investing -> Business Life -> General
Subjects -> Business & Investing -> Business Life -> General AAS
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Customer Reviews:
Total reviews: 1 Average rating: 3.0 of 5

lots of info, somewhat helpful 3 out of 5 stars.
6 of 6 people found this review helpful.

i just bought DNS9 Medical for my primary care practice.

the program itself is incredibly improved from older versions. the other doc in my office tried it *without* doing any of the training, and had 100% accuracy in word recognition straight out of the box. now, she's a believer.

this book seems aimed more at a medical transcriptionist than at physicians. that said, i did find some parts useful. however, the chapter on "history of speech recognition" didn't do a lot for me, and much of the other info was available in the "help" files of the program.

also, there were a number of annoying errors: for example, the chapter on "improving recognition accuracy" talks about "the four tenants (sic) of voice recognition." there were quite a few mistakes in other areas: for example, illustrations in which the text was supposed to be changed to all caps was shown in bold italic.

still, this book would do very well if i wanted to train a transcriptionist to listen to my dictation and then dictate it into DNS.

if you're new to DNS, i'd spend a few days playing with the program first, paying attention to the help menus and especially to how much you can do with "add new command." it's trivially easy to add new boilerplate, greatly increasing the speed of dictation (especially normals).

Editorial Review:

For courses in Medical Transcription and Medical Clerical. This first-of-its-kind educational tool introduces skill sets that promote successful speech recognition to students entering the profession of healthcare documentation. The texts understandable format enables students to become familiar with the history of speech recognition, gain an understanding of hardware requirements necessary to successfully operate the software, get an overview of a popular speech software program, and learn how to successfully use speech recognition for professional dictation.

Mathematical Models of Spoken Language

Stephen Levinson

Mathematical Models of Spoken Language Stephen Levinson Amazon Price: $103.36
List Price: $140.00
Usually ships in 24 hours
By: Wiley
Amazon Marketplace: 24 new & used starting at $24.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> Software -> Natural Language Processing

Editorial Review:

Mathematical Models of Spoken Language presents the motivations for, intuitions behind, and basic mathematical models of natural spoken language communication. A comprehensive overview is given of all aspects of the problem from the physics of speech production through the hierarchy of linguistic structure and ending with some observations on language and mind.

The author comprehensively explores the argument that these modern technologies are actually the most extensive compilations of linguistic knowledge available.Throughout the book, the emphasis is on placing all the material in a mathematically coherent and computationally tractable framework that captures linguistic structure.

It presents material that appears nowhere else and gives a unification of formalisms and perspectives used by linguists and engineers. Its unique features include a coherent nomenclature that emphasizes the deep connections amongst the diverse mathematical models and explores the methods by means of which they capture linguistic structure.

This contrasts with some of the superficial similarities described in the existing literature; the historical background and origins of the theories and models; the connections to related disciplines, e.g. artificial intelligence, automata theory and information theory; an elucidation of the current debates and their intellectual origins; many important little-known results and some original proofs of fundamental results, e.g. a geometric interpretation of parameter estimation techniques for stochastic models and finally the author's own unique perspectives on the future of this discipline.

There is a vast literature on Speech Recognition and Synthesis however, this book is unlike any other in the field. Although it appears to be a rapidly advancing field, the fundamentals have not changed in decades. Most of the results are presented in journals from which it is difficult to integrate and evaluate all of these recent ideas. Some of the fundamentals have been collected into textbooks, which give detailed descriptions of the techniques but no motivation or perspective. The linguistic texts are mostly descriptive and pictorial, lacking the mathematical and computational aspects. This book strikes a useful balance by covering a wide range of ideas in a common framework. It provides all the basic algorithms and computational techniques and an analysis and perspective, which allows one to intelligently read the latest literature and understand state-of-the-art techniques as they evolve. 

Desirable Future: Consumer Electronics in Tomorrow's World (Science Museum TechKnow Series)

Jack Challoner

Desirable Future: Consumer Electronics in Tomorrow's World (Science Museum TechKnow Series) Jack Challoner Amazon Price: $20.00
List Price: $20.00
Usually ships in 24 hours
By: Wiley
Amazon Marketplace: 1 new & used starting at $20.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Business & Culture -> Culture
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition

Editorial Review:

With the pace of current development in consumer electronics, it is tempting to picture a future in which everyone carries miniature gadgets whose batteries never run out, and which inform, entertain, translate, communicate, memorise and organise. Is this a realistic picture… or is the future of consumer electronics threatened by factors such as climate change, global politics and, of course, people's continued willingness to 'buy into' the modern world?

In Desirable Future? Jack Challoner takes a thought-provoking look at this powerful industry. He describes the technologies - such as new power supplies, increased miniaturisation, and convergence - that will drive it forward, and looks at the role that artificial intelligence and speech recognition will play. He asks whether the current rate of progress is sustainable, or whether the industry is heading for a state of collapse... He raises some fascinating issues along the way, such as:  

  • Will you ever be able to have intelligent conversations with your gadgets?
  • Will we have to keep buying new formats for films and music?
  • How many obsolete mobile phones and chargers do you have?
  • Can you only afford new gadgets when they're already going out of fashion?
  • Is built-in obsolescence a commercial strategy or a necessity?

If you're someone who can't make it through the day without your mobile phone, PDA, MP3 player etc, then this book will give you some food for thought about how we became such a gadget-obsessed society and what the future holds…

About the author
Jack has written nearly 30 books for children, teenagers and adults. In addition he often acts as consultant science editor for books, magazines, science activity packs and CD-ROMs. He also presents live science shows in museums, schools and libraries.

http://www.explaining-science.co.uk/

Speech Recognition: Theory and C++ Implementation

Claudio Becchetti, Lucio Prina Ricotti

Speech Recognition: Theory and C++ Implementation Claudio Becchetti, Lucio Prina Ricotti List Price: $190.00
By: Wiley
Amazon Marketplace: 14 new & used starting at $103.66

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Human Vision & Language Systems
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Theory of Computing
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> General

Customer Reviews:
Total reviews: 6 Average rating: 4.0 of 5

Editorial Review:

Automatic Speech Recognition (ASR) is the enabling technology for hands-free dictation and voice-triggered computer menus. It is becoming increasingly prevalent in environments such as private telephone exchanges and real-time information services. Speech Recognition introduces the principles of ASR systems, including the theory and implementation issues behind multi-speaker continuous speech recognition. Focusing on the algorithms employed in commercial and laboratory systems, the treatment enables the reader to devise practical solutions for ASR system problems. It addresses in detail C++ programming techniques used to develop ASR applications, thus offering skills that will prove useful in any large C++ based software project. Possible extensions of the well-established ASR technology are highlighted, based on "Hidden Markov Models" applied to fields such as modelling and prediction of econometric series. Features include:
* Accompanying website containing all C++ source code of a complete laboratory multi-speaker continuous-speech ASR system (e.g. Initialisation, Training, Recognition, Evaluation, etc.) www.wiley.com/go/becchetti_speech
* Detailed theoretical, mathematical and technical explanations of ASR
* A practical account of the functioning of ASR
A crucial source of information for researchers, developers and project managers involved with ASR systems, Speech Recognition is also structured for use by students of digital signal processing, speech recognition and C++ programming techniques.

VoiceXML: Professional Developer's Guide with CDROM

Chetan Sharma, Jeff Kunins

VoiceXML: Professional Developer's Guide with CDROM Chetan Sharma, Jeff Kunins List Price: $49.99
By: Wiley
Amazon Marketplace: 2 new & used starting at $125.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Digital Music -> General
Subjects -> Computers & Internet -> Digital Music -> General AAS
Subjects -> Computers & Internet -> Programming -> Languages & Tools -> General

Customer Reviews:
Total reviews: 10 Average rating: 5.0 of 5

Good coverage, up-to-date, very userful 5 out of 5 stars.
11 of 11 people found this review helpful.

This is the best VoiceXML book I've seen. Most VoiceXML books try to do too much: talk about voice hardware, telephony, the history of voice, tts, as well as be a VoiceXML reference. The weakness of these books is that one or more of these sections reveals that the authors do not really command the knowledge needed to make these sections useful. This book also attempts to do these things, but for the most part is able to carry it off.

If you're looking for a reference, this is the book to get. The reference section is current VoiceXML 2.0 (October 2001), which is an advantage in and of itself. But the real strength of the reference section is its depth. Each element, (e.g., , , ) has an entry for syntax (how to invoke the element), a description (what the element is used for), a thorough discussion of its attributes (that is, a description of the attribute), a usage statement (the elements parents and children), and an example (a snipet of complete code that uses the element). The examples and discussion of attributes really set this book apart from its peers.

There is a brief discussion of the architecture of a VoiceXML app, and a couple of paragraphs discussing the differences between VoiceXML 1.0 and 2.0.

The book also gives, contrary to my expectations, a history of the voice industry, a history of VoiceXML, and a discussion of players in the industry. What makes this book's treatment of these topics unusual is that the authors (particularly Kunins, I suspect) actually know these fields. I don't normally want these sections in a reference book (it just adds bulk around the section I really want) but I found them quite compelling here. I learned quite a bit from reading them.

The book also contains sections on Dynamic VoiceXML, Security, Voice App Life Cycle, VUI Design, the Future of VoiceXML, and a case study. I haven't read these sections yet, so I can't comment on them. I do know, however, that the sections I have read are sufficiently superior to make this THE VoiceXML book on their own.

If I were to criticize the book, I would fault the authors' lavish praise of TellMe (this is minor and not unexpected) and the examples in the reference section. The examples are quite good for someone learning VoiceXML, and the authors are commended for including them. The fault (albeit a minor one) is that they are fairly vanilla. So, while I would have preferred more examples, I concede that such examples would make the book much larger and the inclusion of "advanced" examples to the exclusion of "canonical" examples would have made them less useful to developers learning VoiceXML.

Overall, if you are going to own one VoiceXML reference, THIS should be that one.

Editorial Review:

Learn how to build voice-enabled applications using VoiceXML
VoiceXML is designed for creating human-computer dialogs that feature synthesized speech, digitized audio, recognition of spoken and key input, recording of spoken input, telephony, and mixed-initiative conversations. Providing a detailed look at this markup language, VoiceXML (Version 2.0 is covered) takes the reader from the basics of voice solutions all the way through to building and running an application. It also reviews the critical success factors when designing and implementing a voice strategy and provides a glimpse into the future of voice technologies. The authors include discussions on how to generate dynamic VoiceXML content as well as the design of compelling and effective voice user interfaces.
CD-ROM includes code from the book as well as development toolkits and reference material from multiple vendors.

Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison

Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison Amazon Price: $35.00
List Price: $35.00
Usually ships in 24 hours
By: Center for the Study of Language and Inf
Amazon Marketplace: 15 new & used starting at $23.00

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Theory of Computing
Subjects -> Computers & Internet -> Computer Science -> General AAS
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Customer Reviews:
Total reviews: 1 Average rating: 4.0 of 5

Still a useful source of information 4 out of 5 stars.
18 of 18 people found this review helpful.

This book, originally published in 1983, was reissued in 1999, no doubt because of the importance of genetic sequencing in recent years. What is neat about the book is it shows how algorithms from one field can be applied to solve problems in another, possibly totally disparate field, one example being computational linguistics and sequence algorithms in computational biology.

A general overview of sequence comparison is given in chapter 1 with applications to molecular biology, human speech, computer science, coding theory, gas chromotography, and bird songs discussed. The author discusses how deletion-insertion, compression-expansion, and substitution are employed in sequence comparison. Different metrics are introduced, such as the Levenshtein distance. Dynamic programming, which pretty much dominates the book, is introduced here also.

Part 1 of the book discusses sequence comparison in molecular biology. The use of dynamic programming is emphasized and its importance continues to this day. The advantages of using the dynamic programming method are outlined, and it is shown how to find the substring in a longer sequence with most optimum agreement to a shorter sequence. In addition, given an RNA molecule with a known nucleotide sequence, methods are discussed for predicting the way different parts of the molecule will bond to each other. These methods are based on dynamic programming. Mathematicians considering doing research on or about entering the field will profit from the section on the biological background. The treatment of RNA secondary structures is excellent.

In part 2, the emphasis is on speech processing and what is called "time-warping", which is a technique for comparing functions by altering the time axis. An interesting application is given to the comparison of bird songs. An algorithm is given for adjusting the time scales for two songs to arrange them in the most optimal alignment. In addition, the differences between compression and expansion and deletion and insertion are discussed in this part.

In part 3, a modified Smith-Waterman algorithm is employed to find similar portions in two sequences. Called local alignment in computational biology, it is shown in detail how to define the recurrences for the alignment and how to keep track of the pointers for backtracking. This part also generalizes the operations of substitution and Levenshtein distance. In addition, the strategy of doing sequence comparison by allowing transpositions is discussed. Such a strategy entails a generalized concept of trace, wherein trace lines can intersect each other, leading to entangling of the traces into knots or plaids. The usual dynamic programming techniques must then be extended to deal with these complications. One particular algorithm for this is discussed, called CELLAR, which involves the construction of a directed graph whose paths correspond to admissible sequences of generalizations of traces, called cuts. The computational complexity of this algorithm is discussed. In addition, an O(n^2/logn) algorithm is given for computing string-edit distances.

The last part of the book deals with studying comparisons between random sequences. Combinatorial arguments are used to derive upper bounds on the expected length of the longest common subsequences of two random sequences. Other miscellaneous results dealing with comparing common subsequences of two random sequences are given.

Editorial Review:

Time Warps, String Edits and Macromolecules is a young classic in computational science. The computational perspective is that of sequence processing, in particular the problem of recognizing related sequences. The book is the first, and still best compilation of papers explaining how to measure distance between sequences, and how to compute that measure effectively. This is called string distance, Levenshtein distance, or edit distance. The book contains lucid explanations of the basic techniques; well-annotated examples of applications; mathematical analysis of its computational (algorithmic) complexity; and extensive discussion of the variants needed for weighted measures, timed sequences (songs), applications to continuous data, comparison of multiple sequences and extensions to tree-structures. This theory finds applications in molecular biology, speech recognition, analysis of bird song and error correcting in computer software.

Evolving Connectionist Systems: Methods and Applications in Bioinformatics, Brain Study and Intelligent Machines (Perspectives in Neural Computing)

Nikola Kasabov

Evolving Connectionist Systems: Methods and Applications in Bioinformatics, Brain Study and Intelligent Machines (Perspectives in Neural Computing) Nikola Kasabov List Price: $159.00
By: Springer
Amazon Marketplace: 7 new & used starting at $38.43

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Neural Networks
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> General
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> General AAS

Customer Reviews:
Total reviews: 2 Average rating: 4.5 of 5

Editorial Review:

Many methods and models have been proposed for solving difficult problems such as prediction, planning and knowledge discovery in application areas such as bioinformatics, speech and image analysis. Most, however, are designed to deal with static processes which will not change over time. Some processes - such as speech, biological information and brain signals - are not static, however, and in these cases different models need to be used which can trace, and adapt to, the changes in the processes in an incremental, on-line mode, and often in real time. This book presents generic computational models and techniques that can be used for the development of evolving, adaptive modelling systems. The models and techniques used are connectionist-based (as the evolving brain is a highly suitable paradigm) and, where possible, existing connectionist models have been used and extended. The first part of the book covers methods and techniques, and the second focuses on applications in bioinformatics, brain study, speech, image, and multimodal systems. It also includes an extensive bibliography and an extended glossary. Evolving Connectionist Systems is aimed at anyone who is interested in developing adaptive models and systems to solve challenging real world problems in computing science or engineering. It will also be of interest to researchers and students in life sciences who are interested in finding out how information science and intelligent information processing methods can be applied to their domains.

VoiceXML 2.0 Developer's Guide : Building Professional Voice-enabled Applications with JSP, ASP & Coldfusion

VoiceXML 2.0 Developer's Guide : Building Professional Voice-enabled Applications with JSP, ASP & Coldfusion Amazon Price: $44.99
List Price: $49.99
Usually ships in 24 hours
By: McGraw-Hill Companies
Amazon Marketplace: 22 new & used starting at $9.99

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Digital Music -> General
Subjects -> Computers & Internet -> Digital Music -> General AAS
Subjects -> Computers & Internet -> Programming -> Java -> General

Customer Reviews:
Total reviews: 2 Average rating: 3.0 of 5

Bark with little bite 2 out of 5 stars.
4 of 5 people found this review helpful.

While this is the only book I could find with direct reference to VoiceXML, ASP and SALT, it was a dissapointment. The title says VoiceXML 2.0, but the major examples are all given in 1.0 syntax. There are many errors and the formatting is poor. Finally, there is no discussion about mixed-initiative applications or natural language processing.

Cut the crap 4 out of 5 stars.
2 of 2 people found this review helpful.

This is a no crap book. I didnt need a tome that would tell me what i want. I know what i need to do i just needed a book that would help in the How To part. Some of the application discussed in this book are of commercial quality in their design and funtionality. the stuff on the voice command performace shows the author's experiance on the matter. I could have done with some more stuff on IP telephony but the application discussed here elaborates a design which is common more or less in a lot of IP telephony apps. Nothing really usefull though but you can realy take the concept and the code further as you please and gives you something to think about. The very presence of the IP telephony introductory chapter in the books kind of completes the book and the discussion. I wouldnt mind though if this book had a few more pages and completed many application that i though were on the verge on being turn key solutions.

Editorial Review:

-- Includes 3 full-scale enterprise level applications with 100% source code -- the only VoiceXML book on the market that delivers this -- Includes unique mixture of VoiceXML with other hot technologies, including ASP.NET, JSP, Servlets, and ColdFusion -- All code tested and certified by DreamTech software research lab using industry leading hardware & software -- tested against VoiceXML 1.0 & 2.0

Communication Acoustics (Signals and Communication Technology)

Communication Acoustics (Signals and Communication Technology) Amazon Price: $159.00
List Price: $159.00
Usually ships in 24 hours
By: Springer
Amazon Marketplace: 25 new & used starting at $145.52

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Machine Vision
Subjects -> Computers & Internet -> Computer Science -> Software Engineering -> Information Systems
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing

Editorial Review:

Communication Acoustics deals with the fundamentals of those areas of acoustics which are related to modern communication technologies. Due to the advent of digital signal processing and recording in acoustics, these areas have enjoyed an enormous upswing during the last 4 decades. The book chapters represent review articles covering the most relevant areas of the field. They are written with the goal of providing students with comprehensive introductions. Further they offer a supply of numerous references to the relevant literature. Besides its usefulness as a textbook, this will make the book a source of valuable information for those who want to improve or refresh their knowledge in the field of communication acoustics - and to work their way deeper into it. Due to its interdisciplinary character Communication Acoustics is bound to attract readers from many different areas, such as: acoustics, cognitive science, speech science, and communication technology.


Page 4 of 34 - Go to page: 1 2 3 4 5 6 7 8 9 15

Return to MagicBeanDip.com

This page was created in 1.2261 seconds.