GitHub topics: nvidia-nemo

Repositories

wcks13589/LLM-Tutorial

LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.

Language: Python - Size: 3.11 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 1

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 3

GoogleCloudPlatform/nvidia-nemo-on-gke

Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine

Language: HCL - Size: 964 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 7

cr4yfish/nouv

Free AI & Community powered Learning Experience

Language: TypeScript - Size: 1.41 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 41 - Forks: 0

HROlive/Poland-End-To-End-LLM-Bootcamp

This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.

Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

GameOfPods/PAT

PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website

Language: Python - Size: 150 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

transiteration/stt_kz_quartznet15x5

Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.

Language: Python - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Rumeysakeskin/Turkish-Text-to-Speech

Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan

Language: Python - Size: 8.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 3

Rumeysakeskin/ASR-Quantization

Post-training quantization on Nvidia Nemo ASR model

Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

InfiniteHelios/nemo-audio-profanity-detector-app

Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.

Language: Python - Size: 238 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aaaastark/NeMo-WeightsBiases-TTS

Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Rumeysakeskin/NLP-Onnx-TensorRT

Joint Intent/Slot Classification for Jetson Nano, TX1/TX2, Xavier NX, and AGX Xavier

Language: Jupyter Notebook - Size: 146 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ssharkov03/ru-speech-recognition

Module for russian speech recognition using NVIDIA Nemo.

Language: Python - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Rumeysakeskin/ASR-fine-tuning-for-low-resource-languages

Transfer learning for ASR with subword encoding CTC model (NVIDIA NeMo Citrinet) on low-resource languages

Language: Jupyter Notebook - Size: 455 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

denizariyan/Real-Time-Auto-Transcriber

Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.

Language: Python - Size: 25.4 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

JINHXu/tutorial-speaker-identification-with-nemo

The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.

Language: Python - Size: 176 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 2

Related Keywords

nvidia-nemo 16 nemo 4 pytorch 3 speech-recognition 3 nvidia 3 asr 2 audio 2 speaker-recognition 2 pytorch-lightning 2 fastpitch 2 nvidia-cuda 2 nvidia-gpu 2 hifigan 2 llm 2 nemo-guardrails 2 speaker-diarization 2 speech-to-text 2 low-resource-languages 2 profanity-detection 1 pyqt5 1 text-to-speech 1 quantization 1 post-training-quantization 1 weights-and-biases 1 intent-slot-classification 1 model-deployment 1 nvidia-jetson 1 onnx-models 1 onnxruntime 1 tensorrt-inference 1 tutorial 1 speaker-identification 1 neural-networks 1 neural-network 1 machine-learning 1 classification 1 transcriber 1 subtitle 1 real-time 1 hearing-impaired 1 audio-processing 1 accesibility 1 transfer-learning 1 tokenizer 1 google-sentencepiece 1 fine-tuning 1 ctc-model 1 citrinet 1 spelling-correction 1 russian-language 1 chunking 1 waveform-generator 1 llama2 1 gpt 1 react 1 nextjs 1 mistral 1 generative-ai 1 gemini 1 education 1 ai 1 megatron-lm 1 gke 1 wavlm 1 speech-embedding 1 speech 1 microsoft 1 embedding 1 diarization 1 tensorrt-llm 1 turkish-text-to-speech 1 tts 1 speech-synthesis 1 spectrogram-generator 1 phonetical-conversion 1 nvidia-docker 1 stt 1 transcription 1 summary 1 podcast 1 openai 1 books 1 triton 1 tensorrt 1 prompt-tuning 1 p-tuning 1 llm-training 1 llm-inference 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos