Skip to content
GitHub Universe 2025
Explore 100+ talks, demos, and workshops at Universe 2025. Choose your favorites.
#

speech-recognition

Here are 17 public repositories matching this topic...

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

  • Updated Aug 1, 2025
  • Jupyter Notebook

WhisPad is a note management tool where you can write or dictate your notes using local or API AI models (supports speaker diarization). Rewrite your texts using different styles, dive in using AI, translate, summarize, create mind maps, node graphs and even quizs and flashcards based on each note. A powerful companion for researchers and students.

  • Updated Aug 6, 2025
  • C++
whisper-colab-tpu-transcriber

High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.

  • Updated Jun 8, 2025
  • Jupyter Notebook

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

  • Updated Jun 18, 2024
  • Jupyter Notebook

The GitHub repository focuses on transforming audio files into mel-spectrogram images. It was created for the "UrbanSound8k Mel Spectrogram Images" dataset on Kaggle. Key features include sound visualization and dataset creation for sound analysis. The repository includes an Audio-to-Spectrogram.ipynb notebook for creating spectrograms.

  • Updated Dec 14, 2023
  • Jupyter Notebook
magnetron.artificial-intelligence-2.0.mincloud.proxia--INSTINCTIVE-MIND-5

✭ MAGNETRON ™ ✭: This is a Google Colab/Jupyter Notebook for developing a HEARING PROXIA (B) when working with ARTIFICIAL INTELLIGENCE 2.0 ™ (ARTIFICIAL INTELLIGENCE 2.0™ is part of MAGNETRON ™ TECHNOLOGY).

  • Updated Sep 22, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more