speech-recognition

Here are 17 public repositories matching this topic...

ashishpatel26 / Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

python nlp natural-language-processing awesome tensorflow pytorch transformer speech-recognition seq2seq pretrained-models language-models natural-language-generation nlp-library language-model bert natural-language-understanding jax pytorch-transformers model-hub

Updated Aug 1, 2025
Jupyter Notebook

phineas-pta / fine-tune-whisper-vi

Star

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

docker aws vietnamese speech-recognition speech-to-text lora whisper fine-tuning multi-gpu-training

Updated Aug 15, 2025
Jupyter Notebook

Drakonis96 / whispad

Star

WhisPad is a note management tool where you can write or dictate your notes using local or API AI models (supports speaker diarization). Rewrite your texts using different styles, dive in using AI, translate, summarize, create mind maps, node graphs and even quizs and flashcards based on each note. A powerful companion for researchers and students.

ai notebook notes speech speech-recognition speech-to-text speech-processing notes-app speech-enhancement speech-

Updated Aug 6, 2025
C++

labrijisaad / Youtube-video-transcriptor

Star

In this notebook, I implemented a script to transcribe YouTube videos (and audio files in general) using Google's speech-to-text API.

youtube youtube-video speech-recognition transcript speech-to-text text-translation puthon youtube-transcripts googletranslateapi audio-split general-idea-summarization

Updated Dec 19, 2022
Jupyter Notebook

Jeronymous / deep_learning_notebooks

Star

Self-containing notebooks to play simply with some particular concepts in Deep Learning

machine-learning natural-language-processing deep-neural-networks deep-learning artificial-intelligence speech-recognition artificial-neural-networks automatic-speech-recognition speech-to-text tokenization tokenizers tokenizer-nlp

Updated Mar 3, 2025
Jupyter Notebook

thc1006 / whisper-colab-tpu-transcriber

Star

High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.

python machine-learning natural-language-processing deep-learning ffmpeg jupyter-notebook pytorch speech-recognition ipywidgets voice-to-text tpu google-colab audio-transcription huggingface-transformers pytorch-xla openai-whisper whisper-model multilingual-asr

Updated Jun 8, 2025
Jupyter Notebook

RylinnM / Speech-Emotion-Recognition

Star

A speech emotion recognition notebook that learns a model to identify the emotion within human speech with an accuracy of roughly 60%.

speech-recognition emotion-recognition

Updated Apr 29, 2023
Jupyter Notebook

spreusler / Speech-Recognition-CMUSphinx

Star

Simple Jupyter Notebook including a Speech Recognition implementation with CMUSphinx

python speech-recognition cmu-sphinx

Updated Aug 8, 2019
Jupyter Notebook

chizkidd / mnielsen-neural-networks-and-deep-learning

Star

Notebook implementation of Michael Nielsen's online book: Neural Networks and Deep Learning.

python machine-learning natural-language-processing deep-neural-networks deep-learning neural-network speech-recognition neural-networks educational image-recognition backpropagation

Updated Jan 9, 2025
Jupyter Notebook

MeowMeowSE3 / language-detection-ai

Star

Detect 18+ languages instantly using machine learning (BERT, LSTM, SVM) and NLP. Includes a Flask web app for real-time predictions, trained models, and detailed notebooks.

javascript java nlp elixir swing tensorflow cv keras inference speech-recognition basic-learning client-side detect-language chrome-ai

Updated Sep 23, 2025
Jupyter Notebook

Amir-Hofo / Speech_commands_Classification

Star

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 18, 2024
Jupyter Notebook

pranavgupta2603 / UrbanSound8k-MelSpectrogram

Star

The GitHub repository focuses on transforming audio files into mel-spectrogram images. It was created for the "UrbanSound8k Mel Spectrogram Images" dataset on Kaggle. Key features include sound visualization and dataset creation for sound analysis. The repository includes an Audio-to-Spectrogram.ipynb notebook for creating spectrograms.

audio music machine-learning computer-vision artificial-intelligence speech-recognition classification

Updated Dec 14, 2023
Jupyter Notebook

zaanind / Asr_CTC

Star

This repository provides a Jupyter notebook for (CTC) based Automatic Speech Recognition (ASR) system using TensorFlow and Keras. The primary focus of this repository is to demonstrate the implementation of a CTC ASR model and to show how to train it effectively on the "Yes No" dataset.

tensorflow speech-recognition automatic-speech-recognition speech-to-text yesno keras-tensorflow stater-kit ctc-loss

Updated Aug 18, 2024
Jupyter Notebook

GCABC123 / magnetron.artificial-intelligence-2.0.mincloud.proxia--INSTINCTIVE-MIND-5

Sponsor

Star

✭ MAGNETRON ™ ✭: This is a Google Colab/Jupyter Notebook for developing a HEARING PROXIA (B) when working with ARTIFICIAL INTELLIGENCE 2.0 ™ (ARTIFICIAL INTELLIGENCE 2.0™ is part of MAGNETRON ™ TECHNOLOGY).

Updated Sep 22, 2022
Jupyter Notebook

iamarunbrahma / spoken-digit-recognition

Star

In this notebook, we are recognizing digits from 0 to 9 based on audio recordings file. Input data will be in the form of speech signal and output will be a single digit.

lstm speech-recognition librosa

Updated May 18, 2023
Jupyter Notebook

HazwaniHasnun / WhisperAi

Star

Whisper AI is an automated speech recognition (ASR) system. It is open source and can be access via GitHub or HuggingFace. This is the simplest way to implement Whisper AI via Github using python Google Colab Notebook.

natural-language-processing speech-recognition speech-to-text speech-processing whisper-ai

Updated Feb 12, 2024
Jupyter Notebook

MdJafirAshraf / Speech-to-text-Language-Translator

Star

In this notebook, we will create to convert an audio file of an English speaker to text using a Speech to Text API using IBM-Watson. Then we will translate the English version to a Spanish version using a Language Translator API.

python api json jupyter-notebook speech-recognition speech-to-text ibm-watson language-translator ibm-cloud

Updated Jul 9, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 17 public repositories matching this topic...

ashishpatel26 / Treasure-of-Transformers

phineas-pta / fine-tune-whisper-vi

Drakonis96 / whispad

labrijisaad / Youtube-video-transcriptor

Jeronymous / deep_learning_notebooks

thc1006 / whisper-colab-tpu-transcriber

RylinnM / Speech-Emotion-Recognition

spreusler / Speech-Recognition-CMUSphinx

chizkidd / mnielsen-neural-networks-and-deep-learning

MeowMeowSE3 / language-detection-ai

Amir-Hofo / Speech_commands_Classification

pranavgupta2603 / UrbanSound8k-MelSpectrogram

zaanind / Asr_CTC

GCABC123 / magnetron.artificial-intelligence-2.0.mincloud.proxia--INSTINCTIVE-MIND-5

iamarunbrahma / spoken-digit-recognition

HazwaniHasnun / WhisperAi

MdJafirAshraf / Speech-to-text-Language-Translator

Improve this page

Add this topic to your repo