GitHub topics: simcse
hppRC/simple-simcse-ja
Exploring Japanese SimCSE
Language: Python - Size: 1.29 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 68 - Forks: 4

MoleculeTransformers/smiles-featurizers
Extract Molecular SMILES embeddings from language models pre-trained with various objectives architectures.
Language: Python - Size: 39.1 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

DolbyUUU/Reinforcement-Calibration-SimCSE
Reinforcement Calibration SimCSE, combining contrastive learning, artificial potential fields, perceptual loss, and RLHF to achieve improved Semantic Textual Similarity (STS) embeddings. PyTorch-based implementations of PerceptualBERT and ForceBasedInfoNCE, along with fine-tuning capabilities via RLHF and evaluation using SentEval.
Language: Python - Size: 371 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

daekeun-ml/KoSimCSE-SageMaker
This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distributed training is possible with Amazon SageMaker.
Language: Jupyter Notebook - Size: 158 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 7

sn2727/finetuning-embedding-models
Domain adaption for an embedding model using unsupervised and supervised finetuning on scientific texts for the SciFact retrieval task.
Language: Jupyter Notebook - Size: 406 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras
Transformer-based models implemented in tensorflow 2.x(using keras).
Language: Python - Size: 696 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

hellonlp/sentence-similarity
文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT
Language: Python - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 37 - Forks: 11

4AI/langml
A Keras-based and TensorFlow-backend NLP Models Toolkit.
Language: Python - Size: 16.7 MB - Last synced at: about 16 hours ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 1

luozhouyang/DeepSE
Sentence Embeddings using Deep Nerual Networks in PRODUCTION!
Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 1

Lollipop/CRLT
CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning
Language: Python - Size: 737 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

shawroad/Semantic-Textual-Similarity-Pytorch
experiments of some semantic matching models and comparison of experimental results.
Language: Python - Size: 9.21 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 140 - Forks: 15

jifei/simcse-tf2
A TensorFlow 2 Keras implementation of SimCSE with unsupervised and supervised.
Language: Python - Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 2

perceptiveshawty/RankCSE
Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)
Language: Python - Size: 392 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 4

muyuuuu/E-commerce-Search-Recall
天池阿里灵杰问天引擎电商搜索算法赛非官方 baseline,又名 NLP 从入门到 22/2771。
Language: Python - Size: 1.78 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 68 - Forks: 12

naivenlp/rapidnlp-datasets
Data pipelines for both TensorFlow and PyTorch!
Language: Python - Size: 117 KB - Last synced at: about 1 hour ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0
