#

Transformer

A transformer is a deep learning architecture based on self-attention mechanisms, designed to process sequential data in parallel. Transformers are the foundation of modern large language models and are widely used in natural language processing, computer vision, and generative AI.

Here are 6,835 public repositories matching this topic...

transformers

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

audio python nlp machine-learning natural-language-processing deep-learning pytorch transformer speech-recognition glm pretrained-models hacktoberfest gemma vlm pytorch-transformers model-hub llm qwen deepseek

Updated Apr 3, 2026
Python

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Apr 3, 2026
Python

annotated_deep_learning_paper_implementations

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

machine-learning reinforcement-learning deep-learning transformers pytorch transformer gan neural-networks literate-programming attention lora deep-learning-tutorial optimizers

Updated Jan 22, 2026
Python

whisper.cpp

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Mar 29, 2026
C++

open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

Updated Aug 21, 2024
Python

fishaudio / fish-speech

SOTA Open Source TTS

tts transformer llama valle vqvae vits vqgan

Updated Mar 30, 2026
Python

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

reinforcement-learning cuda inference transformer moe attention llama glm minimax wan diffusion vlm blackwell llm qwen deepseek gpt-oss qwen-image

Updated Apr 3, 2026
Python

lukasmasuch / best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

python nlp data-science machine-learning deep-learning tensorflow scikit-learn keras ml data-visualization pytorch transformer data-analysis gpt automl jax data-visualizations gpt-3 chatgpt

Updated Apr 2, 2026

amusi / CVPR2026-Papers-with-Code

CVPR 2026 论文和开源项目合集

Updated Mar 8, 2026

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Nov 19, 2025
Python

p-e-w / heretic

Fully automatic censorship removal for language models

transformer llm abliteration

Updated Apr 1, 2026
Python

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

machine-learning tutorial reinforcement-learning deep-learning cnn transformer gan rnn pruning transfer-learning bert diffusion self-attention network-compression chatgpt leedl-tutorial

Updated Nov 23, 2025
Jupyter Notebook

LaTeX-OCR

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

python machine-learning ocr latex deep-learning image-processing pytorch dataset transformer vit image2text im2text im2latex im2markup math-ocr vision-transformer latex-ocr

Updated Jan 18, 2025
Python

graykode / nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

nlp natural-language-processing tutorial tensorflow paper pytorch transformer attention bert

Updated Feb 21, 2024
Jupyter Notebook

alibaba / MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

machine-learning arm deep-learning vulkan ml transformer convolution embedded-devices mnn winograd-algorithm llm

Updated Apr 2, 2026
C++

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Mar 30, 2026
Python

cfug / dio

A powerful HTTP client for Dart and Flutter, which supports global settings, Interceptors, FormData, aborting and canceling a request, files uploading and downloading, requests timeout, custom adapters, etc.

dart http adapter middleware network timeout transformer interceptor flutter dio cancellable

Updated Mar 30, 2026
Dart

nano-vllm

GeeeekExplorer / nano-vllm

Nano vLLM

nlp deep-learning inference pytorch transformer llm

Updated Nov 3, 2025
Python

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Apr 2, 2026
Python

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

transformer language-model evaluation-framework

Updated Apr 1, 2026
Python

Followers: 167 followers
Website: github.com/topics/transformer
Wikipedia: Wikipedia