Tag:speech recognition
-
ffmpeg installation
ffmpeg installation FFmpeg is a set of open source computer programs that can be used to record, convert digital audio and video to streams. Under the LGPL or GPL license. It provides a complete solution for recording, converting, and streaming audio and video. There are four ways to install ffmpeg, namely apt installation, precompiled version […]
-
springboot integration vosk to achieve simple voice recognition function
vosk open source speech recognition Vosk is the open source speech recognition toolkit.Things that Vosk supports include: Nineteen languages are supported – Chinese, English, Indian English, German, French, Spanish, Portuguese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Persian, Filipino, Ukrainian, Kazakh. Work offline on mobile devices – Raspberry Pi, Android, iOS. Install it with […]
-
Speech coding techniques, AMR, AMR-NB, AMR-WB, EVS summary
I’ve recently become a bit interested in real-time speech coding technology, so I learned a bit about it. At first I heard about AMR-NB narrowband coding, and searched only to find more coding techniques, which are summarized here for future viewing. I. What is AMR, AMR-WB Full name Adaptive Multi-Rate and Adaptive Multi-Rate Wideband, mainly […]
-
OpenAI’s Artificial Intelligence Speech Recognition Model Whisper Explained and Used
1 whisper Introduction OpenAI, the company that owns the ChatGPT language model, has open-sourced the Whisper automated speech recognition system, and OpenAI emphasizes that Whisper’s speech recognition ability has reached the human level. Whisper is a general-purpose speech recognition model trained using a large amount of multilingual and multi-task supervised data, capable of achieving near-human […]
-
Speech recognition in action (python code)
Speech recognition in action (python : pyttsx, SAPI, SpeechLib example code) (I) Table of Contents for this article: I. Basic Principles of Speech Recognition (1) The origin and development of speech recognition (2) Basic principles of speech recognition (3) Speech recognition process (4) Recent developments in speech recognition II. Python Speech Recognition (1), text-to-speech conversion […]
-
Librosa Library – Speech Recognition, Speech Tone Recognition Training and Applications
Many students think that speech recognition is very difficult, but it is not, at first I also think so, but later found that speech recognition is the easiest, because students may not know that Python has an audio processing library Librosa, this library is very powerful, can be audio processing,spectrogramRepresentation, amplitude conversion, time-frequency conversion, feature […]
-
Introduction to javacv
Understand the history and development background of javacv JavaCV is an open source Java framework that provides Java-based interfaces for accessing various computer vision libraries and toolkits such as OpenCV, FFmpeg, etc. JavaCV is designed to provide Java developers with fast, simple and reliable image and video processing capabilities. The history of JavaCV dates back […]
-
STM32F103 Driving LD3320 Speech Recognition Module
STM32F103 Driving LD3320 Speech Recognition Module LD3320 Speech Recognition Module IntroductionModule Pin DefinitionsSTM32F103ZET6 development board and module wiringtest codeResults LD3320 Speech Recognition Module Introduction Based on LD3320, voice recognition/voice control/human-machine dialogue functions can be easily realized in any electronic products, even the simplest system with 51 as the main controller. Add VUI (Voice User Interface) […]
-
Whisper JAX Speech Recognition Local Deployment
https://nlpcloud.com/zh/how-to-install-and-deploy-whisper-the-best-open-source-alternative-to-google-speech-to-text.html whisperX Speech Recognition Local Deployment Video Tutorial whisper-jax most detailed installation tutorial | A claim than the whisper 70 times faster than the speech recognition project | Free and open source speech recognition projects whisperX Speech Recognition Local Deployment_JoeManba’s Blog – Blogs GitHub – sanchit-gandhi/whisper-jax: JAX implementation of OpenAI’s Whisper model for up to […]