hig.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
The Algorithms of Speech Recognition: programming and simulating in MATLAB
University of Gävle, Faculty of Engineering and Sustainable Development.
2012 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

The aim of this thesis work is to investigate the algorithms of speech recognition. The author programmed and simulated the designed systems for algorithms of speech recognition in MATLAB. There are two systems designed in this thesis. One is based on the shape information of the cross-correlation plotting. The other one is to use the Wiener Filter to realize the speech recognition. The simulations of the programmed systems in MATLAB are accomplished by using the microphone to record the speaking words. After running the program in MATLAB, MATLAB will ask people to record the words three times. The first and second recorded words are different words which will be used as the reference signals in the designed systems. The third recorded word is the same word as the one of the first two recorded words. After recording words, the words will become the signals’ information which will be sampled and stored in MATLAB. Then MATLAB should be able to give the judgment that which word is recorded at the third time compared with the first two reference words according to the algorithms programmed in MATLAB. The author invited different people from different countries to test the designed systems. The results of simulations for both designed systems show that the designed systems both work well when the first two reference recordings and the third time recording are recorded from the same person. But the designed systems all have the defects when the first two reference recordings and the third time recording are recorded from the different people. However, if the testing environment is quiet enough and the speaker is the same person for three time recordings, the successful probability of the speech recognition is approach to 100%. Thus, the designed systems actually work well for the basical speech recognition.

Place, publisher, year, edition, pages
2012. , p. 71
Keywords [en]
speech recognition, MATLAB, cross-correlation, Wienner Filter
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:hig:diva-11835Archive number: TEX100808OAI: oai:DiVA.org:hig-11835DiVA, id: diva2:525564
Uppsok
Technology
Examiners
Available from: 2012-05-28 Created: 2012-05-08 Last updated: 2012-05-28Bibliographically approved

Open Access in DiVA

fulltext(1792 kB)28146 downloads
File information
File name FULLTEXT01.pdfFile size 1792 kBChecksum SHA-512
da85b6e4075632b03b712559958c01b07fd35d9b2574774a730697cce30040310da4cce0111140cda8e6c18b0d12da0a4e9a4a34e861363b21c280c6488c3416
Type fulltextMimetype application/pdf

By organisation
Faculty of Engineering and Sustainable Development
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 29837 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 2083 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf