Voice Recognition Books - Page 2

MagicBeanDip.com

Page 2 of 34 - Go to page: 1 2 3 4 5 6 7 13

Introduction to Video Search Engines

David C. Gibbon, Zhu Liu

Introduction to Video Search Engines David C. Gibbon, Zhu Liu Amazon Price: $71.96
List Price: $89.95
Usually ships in 24 hours
By: Springer
Amazon Marketplace: 12 new & used starting at $67.96

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Artificial Intelligence -> Computer Vision
Subjects -> Computers & Internet -> Databases -> Beginning & Introductory
Subjects -> Computers & Internet -> Databases -> General

Editorial Review:

Video search engines enable users to take advantage of constantly growing video resources like, for example, video on demand, Internet television and YouTube, for a wide variety of applications including entertainment, education and communications.

David Gibbon and Zhu Liu describe the current state of video search engine technology and inform us about opportunities to contribute to the development of this field. Their book has a practical emphasis with the goal of bringing readers up to date on the state of the art in multimedia search technologies and systems. It explains the overall process of video content acquisition, indexing and retrieval with browsing, it provides overviews of constituent technologies such as information retrieval, Internet video systems, video and multimedia processing to extract index data, and it gives examples of research prototypes and existing commercial systems and describes their features. In parallel with the functional discussion, a historical perspective is provided, including many references to academic and industrial sources. Background information on digital media encoding and streaming standards, and information retrieval is also offered, making the book self-contained.

"Introduction to Video Search Engines" is intended for professionals and senior undergraduates or first-year graduate students in computer science or computer engineering, specializing in computer vision or multimedia systems. As multimedia search spans multiple disciplines, it is also valuable as a state-of-the-art reference for researchers and developers working in constituent technologies such as speech processing or information retrieval who seek to broaden their knowledge beyond their current areas of expertise.

Using Soundtrack: Produce Original Music for Video, DVD, and Multimedia

Douglas Spotted Eagle

Using Soundtrack: Produce Original Music for Video, DVD, and Multimedia Douglas Spotted Eagle Amazon Price: $36.95
List Price: $36.95
Usually ships in 1 to 2 months
By: CMP Books
Amazon Marketplace: 22 new & used starting at $6.02

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Digital Music -> General
Subjects -> Computers & Internet -> Digital Music -> General AAS
Subjects -> Computers & Internet -> Programming -> General

Customer Reviews:
Total reviews: 2 Average rating: 5.0 of 5

A Great Way to Learn 5 out of 5 stars.
4 of 4 people found this review helpful.

I am not a musician. The last instrument I played was the basoon (and no one ever asks you to play that in normal social circles).

So to summarize my music knowledge is rudimentary. Douglas Spotted Eagle's is not (cripes he has a Grammy!) But he does lord that knowledge over you, rather he generously shares his wisdom.

This book is an easy and engaging read AND you you learn something from it. Soundtrack is a powerful tool, and DSE opens it wide up so you can get the most out of it. Douglas combines years of music expereince and his extensive background in digital media to craft an excellent book.

This book is a required reference in our office and shoudl be in yours too,

Editorial Review:

Covering the basics of producing great audio tracks to accompany video projects, Using Soundtrack provides recording and editing tips and guidance on noise reduction tools, audio effects, and Final Cut Pro's powerful real-time audio mixer. Readers also learn how Soundtrack can be used to give video projects a professional finish with the addition of custom, royalty-free scoring. Theory is presented on a need-to-know basis and practical tutorials provide hands-on techniques for common tasks, including editing video to audio, editing audio to video, changing the length of a music bed, editing dialog, and mixing dialog with music and sound effects. The accompanying CDROM includes tutorial lessons and sample media.

Learn the basics of great audio, including advice on recording sound, editing tips, noise reduction tools, audio effects, and Final Cut Pro's powerful real-time audio mixer with Using Soundtrack 1.2

Discrete-Time Processing of Speech Signals (Ieee Press Classic Reissue)

John R., Jr. Deller, John H. L. Hansen, John G. Proakis

Discrete-Time Processing of Speech Signals (Ieee Press Classic Reissue) John R., Jr. Deller, John H. L. Hansen, John G. Proakis Amazon Price: $124.00
List Price: $155.00
Usually ships in 24 hours
By: Wiley-IEEE Press
Amazon Marketplace: 25 new & used starting at $94.90

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Circuitry -> Communication & Signal Processing
Subjects -> Computers & Internet -> Networking -> Networks, Protocols & APIs -> General
Subjects -> Computers & Internet -> Networking -> Networks, Protocols & APIs -> General AAS

Customer Reviews:
Total reviews: 7 Average rating: 4.0 of 5

Editorial Review:

Commercial applications of speech processing and recognition are fast becoming a growth industry that will shape the next decade. Now students and practicing engineers of signal processing can find in a single volume the fundamentals essential to understanding this rapidly developing field. IEEE Press is pleased to publish a classic reissue of Discrete-Time Processing of Speech Signals. Specially featured in this reissue is the addition of valuable World Wide Web links to the latest speech data references.

This landmark book offers a balanced discussion of both the mathematical theory of digital speech signal processing and critical contemporary applications. The authors provide a comprehensive view of all major modern speech processing areas: speech production physiology and modeling, signal analysis techniques, coding, enhancement, quality assessment, and recognition. You will learn the principles needed to understand advanced technologies in speech processing -- from speech coding for communications systems to biomedical applications of speech analysis and recognition.

Ideal for self-study or as a course text, this far-reaching reference book offers an extensive historical context for concepts under discussion, end-of-chapter problems, and practical algorithms. Discrete-Time Processing of Speech Signals is the definitive resource for students, engineers, and scientists in the speech processing field.

An Instructor's Manual presenting detailed solutions to all the problems in the book is available upon request from the Wiley Makerting Department.

Voice User Interface Design

Michael H. Cohen, James P. Giangola, Jennifer Balogh

Voice User Interface Design Michael H. Cohen, James P. Giangola, Jennifer Balogh Amazon Price: $44.40
List Price: $54.99
Usually ships in 24 hours
By: Addison-Wesley Professional
Amazon Marketplace: 20 new & used starting at $27.99

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Software Engineering -> Design Tools & Techniques
Subjects -> Computers & Internet -> Computer Science -> Software Engineering -> Information Systems
Subjects -> Computers & Internet -> Computer Science -> Human-Computer Interaction

Customer Reviews:
Total reviews: 3 Average rating: 4.5 of 5

Editorial Review:

This book is a comprehensive and authoritative guide to voice user interface (VUI) design. The VUI is perhaps the most critical factor in the success of any automated speech recognition (ASR) system, determining whether the user experience will be satisfying or frustrating, or even whether the customer will remain one. This book describes a practical methodology for creating an effective VUI design. The methodology is scientifically based on principles in linguistics, psychology, and language technology, and is illustrated here by examples drawn from the authors' work at Nuance Communications, the market leader in ASR development and deployment. The book begins with an overview of VUI design issues and a description of the technology. The authors then introduce the major phases of their methodology. They first show how to specify requirements and make high-level design decisions during the definition phase. They next cover, in great detail, the design phase, with clear explanations and demonstrations of each design principle and its real-world applications. Finally, they examine problems unique to VUI design in system development, testing, and tuning.Key principles are illustrated with a running sample application. A companion Web site provides audio clips for each example: www.VUIDesign.org The cover photograph depicts the first ASR system, Radio Rex: a toy dog who sits in his house until the sound of his name calls him out. Produced in 1911, Rex was among the few commercial successes in earlier days of speech recognition. Voice User Interface Design reveals the design principles and practices that produce commercial success in an era when effective ASRs are not toys but competitive necessities.

The VoiceXML Handbook: Understanding and Building the Phone-Enabled Web

Bob Edgar

The VoiceXML Handbook: Understanding and Building the Phone-Enabled Web Bob Edgar List Price: $39.95
By: CMP Books
Amazon Marketplace: 13 new & used starting at $7.84

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Digital Music -> General
Subjects -> Computers & Internet -> Digital Music -> General AAS
Subjects -> Computers & Internet -> Networking -> Data in the Enterprise -> General

Customer Reviews:
Total reviews: 13 Average rating: 3.5 of 5

Not very informative 2 out of 5 stars.
5 of 5 people found this review helpful.

As a VoiceXML developer, I looked forward to this book. I was disappointed. Too much time was spent speculating on Version 2.0 and not enough time explaining Version 1.0. If you are looking to learn VoiceXML this is not the book.

VoiceXML for very beginners 2 out of 5 stars.
4 of 4 people found this review helpful.

The book is a general overview of telephony application and a thin introduction to VoiceXML. It covers important matters in a very rapid and unprecise way. It contains even errors in the examples.

Not so informative. 2 out of 5 stars.
1 of 1 people found this review helpful.

I can find more information on the internet on this subject than reading this book. In fact I turned to the internet while reading this book for answers to the questions this book failed to answer.

Editorial Review:

VoiceXML combines the power of the Internet with the flexibility of voice, using the telephone as the access point to the Internet. XML is replacing HMTL as the state of the art for enabling customers and employees to access business over the Internet. VoiceXML enables companies to provide the same information via the phone with minimal additional development. This book shows how to build phone-enabled Web sites with VoiceXML. For Web developers it explains the essentials of telephony, and for telecom experts it explains the essentials of the Web.

Digital Speech: Coding for Low Bit Rate Communication Systems

A. M. Kondoz

Digital Speech: Coding for Low Bit Rate Communication Systems A. M. Kondoz Amazon Price: $80.00
List Price: $80.00
Usually ships in 24 hours
By: Wiley
Amazon Marketplace: 28 new & used starting at $49.95

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Professional & Technical -> Engineering -> Electrical & Electronics -> Electronics -> Digital Audio
Subjects -> Professional & Technical -> Engineering -> Electrical & Electronics -> Electronics -> General

Customer Reviews:
Total reviews: 1 Average rating: 4.0 of 5

Actually two books in one 4 out of 5 stars.
1 of 1 people found this review helpful.

This book has some good background material on speech coding plus some material on new speech processing and coding techniques. In order to lay the foundation of speech coding technology the book reviews sampling, quantizations, and then the basic nature of speech signals and the theory and tools applied in speech coding. The last two chapters of the book consists of some recent research in the areas of voice activity detection and speech enhancement.

This book shows a real disconnect in style between its first nine chapters and the last two. The first nine are very well written and cover all the basics of speech coding. The mathematical aspects of speech coding are clearly laid out and derived, and the book makes liberal use of some very good illustrations that model various parts of speech coders in block diagram form. In order to understand all of this, though, you should already have a good grasp of digital signal processing and probability theory including random processes. Particularly good are the discussions on the short-time Fourier transform of speech and the effects windowing has on it, the long-term prediction of speech, and pitch detection methods. The last two chapters are written like research papers, and are not very clear at all. There is quite a bit of probability theory on display there, and the notation is hard to discern. There are no exercises at all in this book, but each chapter does have a pretty good summary, even the research chapters.

I would say if you want to learn about speech coding this is a good economical way to learn, although I think Speech Coding Algorithms: Foundation and Evolution of Standardized Coders by Chu is probably a better book for beginners. You might want to stop reading after chapter nine of this book unless you are just extremely interested in the research portion.

Editorial Review:

This newly updated book covers all aspects of digital speech coding from an introduction to the background, sampling and analysis, quantisation methods and coders through to the recent research in areas such as voice activity detection and speech enhancement.

It has a number of new and updated chapters that advance the principles laid out in the first edition and bring the technology described completely up-to-date.  New chapters include:

  •  ‘Harmonic Coders’
  •  ‘Integration of harmonic and analysis by synthesis coders’
  •  ‘Voice Activity Detection’ 
  •  ‘Speech Enhancement’

It is the only book available that covers all these areas in a single text that is ideal for practising communications engineers, researchers and developers of communications systems and senior undergraduate students in electrical and electronic engineering.


 

Dynamic Speech Models (Synthesis Lectures on Speech and Audio Processing)

Li Deng

Dynamic Speech Models (Synthesis Lectures on Speech and Audio Processing) Li Deng Amazon Price: $34.30
List Price: $40.00
Usually ships in 24 hours
By: Morgan and Claypool Publishers
Amazon Marketplace: 4 new & used starting at $34.30

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition
Subjects -> Computers & Internet -> General

Editorial Review:

What are the compelling reasons for carrying out dynamic speech modeling? We provide the answer in two related aspects. First, scientific inquiry into the human speech code has been relentlessly pursued for several decades. As an essential carrier of human intelligence and knowledge, speech is the most natural form of human communication. Embedded in the speech code are linguistic (as well as para-linguistic) messages, which are conveyed through four levels of the speech chain. Underlying the robust encoding and transmission of the linguistic messages are the speech dynamics at all the four levels. Mathematical modeling of speech dynamics provides an effective tool in the scientific methods of studying the speech chain. Such scientific studies help understand why humans speak as they do and how humans exploit redundancy and variability by way of multitiered dynamic processes to enhance the efficiency and effectiveness of human speech communication. Second, advancement of human language technology, especially that in automatic recognition of natural-style human speech is also expected to benefit from comprehensive computational modeling of speech dynamics. The limitations of current speech recognition technology are serious and are well known. A commonly acknowledged and frequently discussed weakness of the statistical model underlying current speech recognition technology is the lack of adequate dynamic modeling schemes to provide correlation structure across the temporal speech observation sequence. Unfortunately, due to a variety of reasons, the majority of current research activities in this area favor only incremental modifications and improvements to the existing HMM-based state-of-the-art. For example, while the dynamic and correlation modeling is known to be an important topic, most of the systems nevertheless employ only an ultra-weak form of speech dynamics; e.g., differential or delta parameters. Strong-form dynamic speech modeling, which is the focus of this monograph, may serve as an ultimate solution to this problem. After the introduction chapter, the main body of this monograph consists of four chapters. They cover various aspects of theory, algorithms, and applications of dynamic speech models, and provide a comprehensive survey of the research work in this area spanning over past 20~years. This monograph is intended as advanced materials of speech and signal processing for graudate-level teaching, for professionals and engineering practioners, as well as for seasoned researchers and engineers specialized in speech processing.

Modern Methods of Speech Processing (The Springer International Series in Engineering and Computer Science)

Modern Methods of Speech Processing (The Springer International Series in Engineering and Computer Science) Amazon Price: $244.00
List Price: $244.00
Usually ships in 24 hours
By: Springer
Amazon Marketplace: 19 new & used starting at $105.06

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Circuitry -> Communication & Signal Processing
Subjects -> Computers & Internet -> Graphic Design -> General
Subjects -> Computers & Internet -> Graphic Design -> General AAS

Editorial Review:

The term `speech processing' refers to the scientific discipline concerned with the analysis and processing of speech signals in order to gain the best benefit in various practical scenarios. These different practical scenarios correspond to a large variety of applications of speech processing research. Examples of some applications include enhancement, coding, synthesis, recognition and speaker recognition. This field has experienced very rapid growth, particularly during the past ten years. The ideal aim is to develop algorithms for a certain task that maximize performance, are computationally feasible and are robust under a wide class of conditions. Modern Methods of Speech Processing provides a cohesive collection of chapters describing recent advances in various branches of the subject. The main focus is on describing specific research directions through a detailed analysis and review of both the theoretical and practical settings. Audience: Graduate students embarking on speech research as well as the experienced researcher already working in the field, who can utilize the book as a reference guide.

Speech and Audio Signal Processing: Processing and Perception of Speech and Music

Ben Gold, Nelson Morgan

Speech and Audio Signal Processing: Processing and Perception of Speech and Music Ben Gold, Nelson Morgan By: Wiley
Amazon Marketplace: 32 new & used starting at $36.84

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Computer Science -> Circuitry -> Communication & Signal Processing
Subjects -> Computers & Internet -> Software -> Business -> Speech Processing
Subjects -> Computers & Internet -> Software -> Voice Recognition

Customer Reviews:
Total reviews: 4 Average rating: 4.5 of 5

Speech and Audio Signal Processing: Processing and Perceptio 5 out of 5 stars.
21 of 22 people found this review helpful.

This is a book much needed in the speech and audio community because of its unique perspective on these topics. By their very nature, speech, music and other audio signals are only fully understood if one takes into account their perception, production, and the context within whcih they exist (language, symphony). To appreciate what to process about such signals, the scientist must have a broad appreciation of linguistics, hearing, vocal tract models, and the brain in general, in addition to the standard engineering tools and approaches. This is why this book is valuable. It indeed attempts to reach out to all these fields with just enough details to inspire the reader, and to provide links to existing more detailed literature. The book is well written, full of excellent illustrations, and it was the perfect choice for a class to graduate students in the Electrical Engineering Department where I teach at the University of Maryland. I highly recommend it.

Editorial Review:

Speech and music are the most basic means of adult human communication. As technology advances and increasingly sophisticated tools become available to use with speech and music signals, scientists can study these sounds more effectively, and invent new ways of applying them for the benefit of humankind. This book includes coverage of the physiology and psychoacoustics of hearing as well as the results from research on pitch and speech perception, vocoding methods and information on many aspects of automatic speech recognition (ASR) systems. The authors have made use of their own research in these fields, as well as the methods and results of many other contributors.

Techniques in Speech Acoustics (Text, Speech and Language Technology)

J. Harrington, S. Cassidy

Techniques in Speech Acoustics (Text, Speech and Language Technology) J. Harrington, S. Cassidy Amazon Price: $165.00
List Price: $165.00
Usually ships in 4 to 7 weeks
By: Springer
Amazon Marketplace: 2 new & used starting at $164.99

Buy at Amazon.com

Browse similar items by category:
Subjects -> Computers & Internet -> Digital Music -> General
Subjects -> Computers & Internet -> Digital Music -> General AAS
Subjects -> Computers & Internet -> Operating Systems -> General

Editorial Review:

Techniques in Speech Acoustics provides an introduction to the acoustic analysis and characteristics of speech sounds. The first part of the book covers aspects of the source-filter decomposition of speech, spectrographic analysis, the acoustic theory of speech production and acoustic phonetic cues. The second part is based on computational techniques for analysing the acoustic speech signal including digital time and frequency analyses, formant synthesis, and the linear predictive coding of speech. There is also an introductory chapter on the classification of acoustic speech signals which is relevant to aspects of automatic speech and talker recognition. Included with the book is a CD-ROM containing extensive speech corpora, the EMU speech analysis tools, extensions to the X-LISP-STAT programming language that are adapted to speech analysis, and numerous exercises that are linked to the major themes of the book and which can be run on Windows-95 and UNIX platforms.
The book and CD-ROM are intended for use as teaching materials on undergraduate and postgraduate speech acoustics and experimental phonetics courses; they are also aimed at researchers from phonetics, linguistics, computer science, psychology and engineering who wish to gain an understanding of the basis of speech acoustics and its application to fields such as speech synthesis and automatic speech recognition.

Page 2 of 34 - Go to page: 1 2 3 4 5 6 7 13

Return to MagicBeanDip.com

This page was created in 1.5909 seconds.