GitHub topics: nvidia-nemo
GoogleCloudPlatform/nvidia-nemo-on-gke
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
Language: HCL - Size: 964 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 12 - Forks: 7

bunyaminergen/WavLMMSDD
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

cr4yfish/nouv
Free AI & Community powered Learning Experience
Language: TypeScript - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 41 - Forks: 0

j3soon/LLM-Tutorial
LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.
Language: Jupyter Notebook - Size: 3.07 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

HROlive/Poland-End-To-End-LLM-Bootcamp
This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.
Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

GameOfPods/PAT
PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website
Language: Python - Size: 150 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

transiteration/stt_kz_quartznet15x5
Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.
Language: Python - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Rumeysakeskin/Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Language: Python - Size: 8.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 3

Rumeysakeskin/ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model
Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

InfiniteHelios/nemo-audio-profanity-detector-app
Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.
Language: Python - Size: 238 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aaaastark/NeMo-WeightsBiases-TTS
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases
Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rumeysakeskin/NLP-Onnx-TensorRT
Joint Intent/Slot Classification for Jetson Nano, TX1/TX2, Xavier NX, and AGX Xavier
Language: Jupyter Notebook - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ssharkov03/ru-speech-recognition
Module for russian speech recognition using NVIDIA Nemo.
Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Rumeysakeskin/ASR-fine-tuning-for-low-resource-languages
Transfer learning for ASR with subword encoding CTC model (NVIDIA NeMo Citrinet) on low-resource languages
Language: Jupyter Notebook - Size: 455 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

denizariyan/Real-Time-Auto-Transcriber
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

JINHXu/tutorial-speaker-identification-with-nemo
The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.
Language: Python - Size: 176 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 2
