An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pre-training

yassinelahdiy/page-language-model

Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.

Language: Python - Size: 26.4 KB - Last synced at: about 12 hours ago - Pushed at: about 12 hours ago - Stars: 0 - Forks: 0

Event-AHU/Medical_Image_Analysis

Foundation models based medical image analysis

Language: Python - Size: 28.3 MB - Last synced at: about 11 hours ago - Pushed at: about 17 hours ago - Stars: 126 - Forks: 3

princeton-nlp/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language: Python - Size: 19 MB - Last synced at: about 15 hours ago - Pushed at: about 1 year ago - Stars: 599 - Forks: 52

RUCAIBox/LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language: Python - Size: 43.1 MB - Last synced at: about 15 hours ago - Pushed at: about 1 month ago - Stars: 11,383 - Forks: 882

GAIR-NLP/ProX

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Language: Python - Size: 15.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 236 - Forks: 18

modelscope/data-juicer

Data processing for and with foundation models! 🍎 πŸ‹ 🌽 ➑️ ➑️🍸 🍹 🍷

Language: Python - Size: 169 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 4,222 - Forks: 227

Zehong-Wang/Awesome-Foundation-Models-on-Graphs

A collection of graph foundation models including papers, codes, and datasets.

Size: 3.61 MB - Last synced at: 2 days ago - Pushed at: 6 days ago - Stars: 6 - Forks: 0

ChandlerBang/awesome-self-supervised-gnn

Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

Language: Python - Size: 730 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 1,662 - Forks: 161

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Language: Python - Size: 38.7 MB - Last synced at: about 14 hours ago - Pushed at: 3 months ago - Stars: 1,304 - Forks: 132

Southla/Supervised_Learning

Supervised Learning project from TripleTen

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

helicalAI/helical

This repository contains the python package for Helical

Language: Python - Size: 21.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 107 - Forks: 14

linwhitehat/ET-BERT

The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.

Language: Python - Size: 13.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 458 - Forks: 92

OpenDriveLab/ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Language: Python - Size: 35.6 MB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 306 - Forks: 21

dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Language: Python - Size: 50.5 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 3,055 - Forks: 523

SalesforceAIResearch/uni2ts

Unified Training of Universal Time Series Forecasting Transformers

Language: Jupyter Notebook - Size: 7.11 MB - Last synced at: 7 days ago - Pushed at: 23 days ago - Stars: 1,094 - Forks: 127

yzhuoning/Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Size: 60.5 KB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 1,191 - Forks: 57

EgoAlpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Language: Jupyter Notebook - Size: 44.2 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 1,576 - Forks: 96

cxcscmu/Craw4LLM

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Language: Python - Size: 79.1 KB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 608 - Forks: 56

nancheng58/Awesome-LLM4RS-Papers

Large Language Model-enhanced Recommender System Papers

Size: 159 KB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 662 - Forks: 53

qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.

Size: 40 KB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 1,032 - Forks: 77

sayakpaul/probing-vits

Probing the representations of Vision Transformers.

Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 324 - Forks: 20

koudounasalkis/voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Language: Python - Size: 19.5 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 16 - Forks: 0

csiro-robotics/Pair-VPR

[IEEE RA-L 2025] The official repository for Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers

Language: Python - Size: 18.9 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 26 - Forks: 1

koalazf99/Awesome-DataCentric-LLM

Trending projects & awesome papers about data-centric llm studies.

Size: 13.7 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 34 - Forks: 2

SiyuanYan1/PanDerm

PanDerm: A General-Purpose Multimodal Foundation Model for Dermatology

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 15 - Forks: 0

WHU-Sigma/HyperSIGMA

The official repo for [TPAMI'25] "HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model"

Language: Python - Size: 80.5 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 210 - Forks: 18

jusiro/DLILP

[IPMI'25] A Reality-check of vision-language pre-training for radiology.

Language: Python - Size: 104 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

LirongWu/awesome-graph-self-supervised-learning

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

Size: 444 KB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 1,401 - Forks: 167

microsoft/Oscar πŸ“¦

Oscar and VinVL

Language: Python - Size: 715 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 1,048 - Forks: 252

balavenkatesh3322/audio-pretrained-model

A collection of Audio and Speech pre-trained models.

Size: 134 KB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 188 - Forks: 26

Tencent/TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language: Python - Size: 41.2 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 1,072 - Forks: 148

ViTAE-Transformer/MTP

The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"

Language: Python - Size: 18 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 215 - Forks: 11

ViTAE-Transformer/SAMRS

The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"

Language: Python - Size: 30 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 323 - Forks: 16

michiyasunaga/DrRepair

[ICML 2020] DrRepair: Learning to Repair Programs from Error Messages

Language: Python - Size: 1.77 MB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 194 - Forks: 32

sagizty/PuzzleTuning

The official repo of PuzzleTuning: Explicitly Bridge Pathological and Natural Image with Puzzles (arXiv: 2311.06712)

Language: Jupyter Notebook - Size: 41.5 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 13 - Forks: 3

GAIR-NLP/MathPile

[NeurlPS D&B 2024] Generative AI for Math: MathPile

Language: Python - Size: 3.39 MB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 410 - Forks: 22

OpenDriveLab/MPI

[RSS 2024] Learning Manipulation by Predicting Interaction

Language: Python - Size: 1.77 MB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 103 - Forks: 1

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Language: Python - Size: 16 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 964 - Forks: 211

ViTAE-Transformer/APTv2

The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K

Language: Python - Size: 9.89 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 19 - Forks: 0

StefanHeng/ECG-Representation-Learning

Self-supervised pre-training for ECG representation with inspiration from transformers & computer vision

Language: Python - Size: 37.7 MB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 24 - Forks: 6

microsoft/XPretrain

Multi-modality pre-training

Language: Python - Size: 3.59 MB - Last synced at: 15 days ago - Pushed at: 12 months ago - Stars: 490 - Forks: 37

zjunlp/MolGen

[ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback

Language: Python - Size: 16.4 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 155 - Forks: 13

korovod/nanotron Fork of huggingface/nanotron

Experimental fork of Nanotron, a minimalistic large language model 3D-parallelism training

Language: Python - Size: 12.3 MB - Last synced at: 6 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

lucidrains/mlm-pytorch

An implementation of masked language modeling for Pytorch, made as concise and simple as possible

Language: Python - Size: 18.6 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 179 - Forks: 24

Lupin1998/Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Language: Python - Size: 6.67 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 324 - Forks: 17

lucidrains/electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Language: Python - Size: 92.8 KB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 225 - Forks: 46

NVlabs/PS3

Scaling Vision Pre-Training to 4K Resolution

Size: 5.06 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 6 - Forks: 0

hongfz16/HCMoCo

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

Language: Python - Size: 2.71 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 120 - Forks: 7

ViTAE-Transformer/RSP

The official repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining"

Language: Python - Size: 16.5 MB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 142 - Forks: 8

wangxiao5791509/MultiModal_BigModels_Survey

[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models

Size: 13.2 MB - Last synced at: 17 days ago - Pushed at: 2 months ago - Stars: 286 - Forks: 17

jackroos/VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Language: Jupyter Notebook - Size: 5.41 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 740 - Forks: 111

camalab-ai/FIP

Frame Interpolatyion Pretraining for Video Denoising

Language: Python - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

showlab/all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Language: Python - Size: 1.53 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 281 - Forks: 17

ZhangYuanhan-AI/Bamboo

Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.

Language: Python - Size: 5.41 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 175 - Forks: 7

ShinoharaHare/LLM-Training

A distributed training framework for large language models powered by Lightning.

Language: Python - Size: 281 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 4

VITA-Group/CV_LTH_Pre-training

[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

Language: Python - Size: 1.44 MB - Last synced at: about 8 hours ago - Pushed at: over 2 years ago - Stars: 69 - Forks: 14

THUDM/GCC

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020

Language: Python - Size: 563 KB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 326 - Forks: 54

HROlive/Computer-Vision-for-Industrial-Inspection

How to create an end-to-end hardware-accelerated industrial inspection pipeline to automate defect detection.

Language: Jupyter Notebook - Size: 145 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bigdata-ustc/Zero-1-to-3

Source codes and datasets for paper "Zero-1-to-3: Domain-level Zero-shot Cognitive Diagnosis via One Batch of Early-bird Students towards Three Diagnostic Objectives" (AAAI2024)

Language: Python - Size: 18.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 3

CVMI-Lab/SlotCon

(NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping

Language: Python - Size: 1.63 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 96 - Forks: 9

graphixxxx/Denoising_AutoEncoder

This project integrates Autoencoders, PCA, and CNNs for efficient image processing, combining dimensionality reduction, denoising, and enhanced feature extraction for image analysis and compression.

Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

zhjohnchan/awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)

Size: 125 KB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 58 - Forks: 7

haofanwang/awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

Size: 18.6 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 2

kakaobrain/helo-word

Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task

Language: Python - Size: 4.1 MB - Last synced at: 17 days ago - Pushed at: over 5 years ago - Stars: 92 - Forks: 22

google-research-datasets/conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

Size: 97.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 380 - Forks: 20

VITA-Group/BERT-Tickets

[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Zhangyang Wang, Michael Carbin

Language: Python - Size: 3.29 MB - Last synced at: about 8 hours ago - Pushed at: over 3 years ago - Stars: 140 - Forks: 19

wxl1999/PLMPapers

A paper list of pre-trained language models (PLMs).

Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 80 - Forks: 32

hexuandeng/DRPruning

Language: Python - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

Shakilkhan24/Playground_DL

Random exploration of different large language models...

Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

HaoranZhuExplorer/AD-L-JEPA-Release

Source code repo for "AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data"

Language: Python - Size: 746 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

RS2002/Adversarial-MidiBERT

Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training

Language: Python - Size: 1.11 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 0

DeepGraphLearning/SiamDiff

Code for Pre-training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory Prediction (https://arxiv.org/abs/2301.12068)

Language: Python - Size: 1.86 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 39 - Forks: 5

playerony/TensorFlowTTS-ts

This project implements TensorflowTTS in Tensorflow.js using Typescript, enabling real-time text-to-speech in the browser. With pre-trained model for English language, you can generate high-quality speech from text input.

Language: TypeScript - Size: 31.5 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 5

YuanchenBei/Awesome-Pretraining-for-Graph-Neural-Networks

A curated list of papers on pre-training for graph neural networks (Pre-train4GNN).

Size: 169 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 178 - Forks: 12

VITA-Group/Adv-SS-Pretraining

[CVPR 2020] Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning

Language: Python - Size: 974 KB - Last synced at: about 8 hours ago - Pushed at: over 3 years ago - Stars: 85 - Forks: 13

DeepGraphLearning/GearNet

GearNet and Geometric Pretraining Methods for Protein Structure Representation Learning, ICLR'2023 (https://arxiv.org/abs/2203.06125)

Language: Python - Size: 512 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 278 - Forks: 29

floatingstarZ/GeRSP

The official implementation of GeRSP (Generic Knowledge Boosted Pre-training For Remote Sensing Images).

Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 4

HICAI-ZJU/OpenProtein

Open-Protein is an open source pre-training platform that supports multiple protein pre-training models and downstream tasks.

Language: Python - Size: 2.75 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 18 - Forks: 1

ZechengLi19/Awesome-Sign-Language

Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick start your awesome work with us!! 🀟🀟🀟

Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 93 - Forks: 1

maxkokot/CrowdCounting

Crowd Counting using Xception

Language: Jupyter Notebook - Size: 1.52 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

wenhuchen/KGPT

Code and Data for EMNLP2020 Paper "KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation"

Language: Python - Size: 1020 KB - Last synced at: 16 days ago - Pushed at: almost 4 years ago - Stars: 149 - Forks: 19

ChenRocks/UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Language: Python - Size: 172 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 786 - Forks: 109

zhanghm1995/Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

Size: 34.8 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 243 - Forks: 10

Shen-Lab/GraphCL

[NeurIPS 2020] "Graph Contrastive Learning with Augmentations" by Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, Yang Shen

Language: Python - Size: 244 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 558 - Forks: 103

wangtz19/NetMamba

Efficient Network Traffic Classification via Pre-training Unidirectional Mamba

Language: Python - Size: 1.22 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 63 - Forks: 5

KoreaMGLEE/Concept-based-curriculum-masking

Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking

Language: Python - Size: 171 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 1

louisbrulenaudet/tsdae

Tranformer-based Denoising AutoEncoder for Sentence Transformers Unsupervised pre-training.

Language: Python - Size: 87.9 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 6 - Forks: 3

mczhuge/Kaleido-BERT

πŸ’Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Language: Python - Size: 9.98 MB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 262 - Forks: 19

acbull/GPT-GNN

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Language: Python - Size: 9.51 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 487 - Forks: 87

lucidrains/coco-lm-pytorch

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

Language: Python - Size: 120 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 45 - Forks: 7

westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

Paper List of Pre-trained Foundation Recommender Models

Size: 444 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 307 - Forks: 25

FudanDISC/ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

Language: Python - Size: 10 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 4

SLAMPAI/large-scale-pretraining-transfer

Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training on full and few-shot transfer learning for natural and medical images" (https://arxiv.org/abs/2106.00116)

Language: Jupyter Notebook - Size: 401 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 4

fajieyuan/SIGIR2021_Conure

Pre-training and Lifelong learning for User Embedding and Recommender System

Language: Python - Size: 3.44 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 5

Event-AHU/VehicleMAE

[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang

Language: Python - Size: 6.71 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 19 - Forks: 1

fajieyuan/universal_user_representation

papers of universal user representation learning for recommendation

Size: 56.6 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 1

akanyaani/gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

Language: Python - Size: 4.67 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 259 - Forks: 83

GentleZhu/EGI

Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization (NeurIPS 21')

Language: Python - Size: 240 KB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 5

lucidrains/marge-pytorch

Implementation of Marge, Pre-training via Paraphrasing, in Pytorch

Language: Python - Size: 166 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 75 - Forks: 11

GanjinZero/KeBioLM

Improving Biomedical Pretrained Language Models with Knowledge [BioNLP 2021]

Language: Python - Size: 1.45 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 65 - Forks: 5

Related Keywords
pre-training 164 deep-learning 34 pytorch 24 transfer-learning 20 bert 19 llm 18 self-supervised-learning 18 large-language-models 15 transformer 15 fine-tuning 14 nlp 13 representation-learning 13 transformers 12 machine-learning 11 vision-and-language 11 graph-neural-networks 10 gpt 10 contrastive-learning 10 language-model 10 foundation-models 9 pre-trained-model 9 computer-vision 8 unsupervised-learning 8 instruction-tuning 7 natural-language-processing 7 classification 7 llama 7 artificial-intelligence 6 recommender-system 6 foundation-model 6 benchmark 5 mae 5 multimodal 5 continual-learning 5 remote-sensing 5 multimodal-learning 5 vision-language 4 lifelong-learning 4 few-shot-learning 4 transfer 4 pretraining 4 vision-transformer 4 chinese 4 efficiency 4 video-question-answering 4 vqa 4 huggingface 4 robotics 3 masked-image-modeling 3 self-attention 3 cold-start 3 semantic-segmentation 3 data-augmentation 3 synthetic-data 3 python 3 weakly-supervised-learning 3 video-understanding 3 ner 3 gpt-2 3 large-language-model 3 user-modeling 3 medical-image-analysis 3 user-representation 3 vision-language-model 3 clip 3 pruning 3 chatgpt 3 in-context-learning 3 llms 3 pre-trained-language-models 3 tensorflow 3 autonomous-driving 3 cross-domain-recommendation 3 multi-modal 3 prompt-tuning 3 survey 3 dataset 3 image-recognition 2 t5 2 object-detection 2 keras 2 change-detection 2 unilm 2 attention 2 audio 2 xlm-roberta 2 forecasting 2 awesome-list 2 pretext-task 2 chest-xray-images 2 vit 2 medical-imaging 2 segmentation 2 prompt 2 language-understanding 2 remote-sensing-foundation-model 2 bert-model 2 world-models 2 protein 2 protein-representation-learning 2