词条 | Speech recognition software for Linux |
释义 |
}} As of the early 2000s, several speech recognition (SR) software packages exist for Linux. Some of them are free and open-source software and others are proprietary software. Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for communicating operational commands to a computer. Linux native speech recognitionHistoryIn the late 1990s, a Linux version of ViaVoice, created by IBM, was made available to users for no charge. In 2002, the free software development kit (SDK) was removed by the developer. Development statusIn the early 2000s, there was a push to get a high-quality Linux native speech recognition engine developed. As a result, several projects dedicated to creating Linux speech recognition programs were begun, such as Mycroft, which is similar to Microsoft Cortana, but open source. Speech sample crowdsourcingIt is essential to compile a speech corpus to produce acoustic models for speech recognition projects. VoxForge is a free speech corpus and acoustic model repository that was built with the aim of collecting transcribed speech to be used in speech recognition projects. VoxForge accepts crowdsourced speech samples and corrections of recognized speech sequences. It is licensed under a GNU General Public License (GPL). Speech recognition conceptThe first step is to begin recording an audio stream on a computer. The user has two main processing options:
Remote recognition was formerly used by smartphones because they lacked sufficient performance, working memory, or storage to process speech recognition within the phone. These limits have largely been overcome although server-based SR on mobile devices remains universal. Speech recognition in browserDiscrete speech recognition can be performed within a web browser and works well with supported browsers. Remote SR does not require installing software on a desktop computer or mobile device as it is mainly a server-based system with the inherent security issues noted above.
Free speech recognition enginesThe following is a list of projects dedicated to implementing speech recognition in Linux, and major native solutions. These are not end-user applications. These are programming libraries that may be used to develop end-user applications.
Possibly active projects: {{Expand list|date=April 2017}}
It is possible for developers to create Linux speech recognition software by using existing packages derived from open-source projects. Inactive projects:
Proprietary speech recognition engines
Voice control and keyboard shortcutsSpeech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for sending operational commands to a computer or appliance. Voice control typically requires a much smaller vocabulary and thus is much easier to implement. Simple software combined with keyboard shortcuts, have the earliest potential for practically accurate voice control in Linux. Running Windows speech recognition software with LinuxVia compatibility layerIt is possible to use programs such as Dragon NaturallySpeaking in Linux, by using Wine, though some problems may arise, depending on which version is used.[23] Via virtualized WindowsIt is also possible to use Windows speech recognition software under Linux. Using no-cost virtualization software, it is possible to run Windows and NaturallySpeaking under Linux. VMware Server or VirtualBox support copy and paste to/from a virtual machine, making dictated text easily transferable to/from the virtual machine. See also
References1. ^{{cite web |title=A TensorFlow implementation of Baidu's DeepSpeech architecture |date=2017-12-05 |url=https://github.com/mozilla/DeepSpeech |publisher=Mozilla |access-date=2017-12-05}} 2. ^Lera KDE git repository – (2015) – https://cgit.kde.org/scratch/grasch/lera.git/ Retrieved 2017-07-25. 3. ^{{cite web |url=https://speechpad.pw |title=Speech to text online, Windows and Linux integration |website=speechpad.pw}} 4. ^{{cite web |url=https://github.com/andre-luiz-dos-santos/speech-app |title=andre-luiz-dos-santos/speech-app |website=GitHub |date=2018-07-12}} 5. ^{{cite web |url=http://thenerdshow.com/platypus.html |title=The Nerd Show – Platypus |website=thenerdshow.com}} 6. ^{{cite web |url=http://thenerdshow.com/freespeech.html |title=FreeSpeech Realtime Speech Recognition and Dictation |website=TheNerdShow.com}} 7. ^{{cite web |url=http://vedics.sourceforge.net/ |title=Vedics |publisher=}} 8. ^{{cite web |url=https://wiki.gnome.org/Projects/GnomeVoiceControl |title=Projects/GnomeVoiceControl – GNOME Wiki! |website=wiki.gnome.org}} 9. ^{{cite web |url=https://github.com/rcorcs/NatI |title=rcorcs/NatI |website=GitHub |date=2018-09-24}} 10. ^{{cite web |url=https://github.com/worden341/sphinxkeys |title=worden341/sphinxkeys |website=GitHub |date=2016-07-11}} 11. ^Simon KDE – Main Developer until 2015 Peter Grasch – (accessed 2017/09/04) – 12. ^{{cite web |url=https://jasperproject.github.io/ |title=Jasper |author= |website=GitHub}} 13. ^{{cite web |url=http://www.kiecza.net/daniel/linux/ |title=Linux |first=Daniel |last=Kiecza |website=Kiecza.net}} 14. ^{{cite web |url=http://freespeech.sourceforge.net/ |title=Open Mind Speech – Free Speech Recognition for Linux |website=freespeech.sourceforge.net}} 15. ^{{cite web |url=http://www.openmind.org/ |title=Open Mind Initiative |date= |archive-url=https://web.archive.org/web/20030805105416/http://openmind.org/ |archive-date=2003-08-05 |access-date=2019-03-16}} 16. ^{{cite web |url=http://perlbox.sourceforge.net/ |title=Perlbox.org Linux Speech Control and Voice Recognition |website=perlbox.sourceforge.net}} 17. ^{{cite web |url=http://xvoice.sourceforge.net/ |title=Xvoice |website=xvoice.sourceforge.net}} 18. ^{{cite web |url=http://www.verbio.com |title=Verbio |website=www.verbio.com}} 19. ^{{cite web |url=http://www.speechatsri.com |title=SRI Speech: Home |website=www.speechatsri.com}} 20. ^{{cite web |url=http://isl.ira.uka.de/english/1406.php |title=KIT – Janus Recognition Toolkit |first=Roedder, Margit |last=(IAR) |date=26 January 2018 |website=isl.ira.uka.de}} 21. ^{{cite web |url=http://www.lumenvox.com |title=Speech and Multifactor Authentication Technologies |author= |website=LumenVox |access-date=2013-02-28}} 22. ^{{cite web |url=http://www.vocapia.com |title=Speech to Text Software & Service – Speech Recognition Software |author= |date=2018-12-30 |website=Vocapia Research |access-date=2019-03-16}} 23. ^{{cite web |url=http://appdb.winehq.org/objectManager.php?sClass=application&iId=2077 |title=WineHQ – Dragon Naturally Speaking |website=appdb.winehq.org}} External links
4 : Linux audio video-related software|Speech recognition|Ergonomics|GNOME Accessibility |
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。