An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: simcse

hppRC/simple-simcse-ja

Exploring Japanese SimCSE

Language: Python - Size: 1.29 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 68 - Forks: 4

MoleculeTransformers/smiles-featurizers

Extract Molecular SMILES embeddings from language models pre-trained with various objectives architectures.

Language: Python - Size: 39.1 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 1

DolbyUUU/Reinforcement-Calibration-SimCSE

Reinforcement Calibration SimCSE, combining contrastive learning, artificial potential fields, perceptual loss, and RLHF to achieve improved Semantic Textual Similarity (STS) embeddings. PyTorch-based implementations of PerceptualBERT and ForceBasedInfoNCE, along with fine-tuning capabilities via RLHF and evaluation using SentEval.

Language: Python - Size: 371 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

daekeun-ml/KoSimCSE-SageMaker

This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distributed training is possible with Amazon SageMaker.

Language: Jupyter Notebook - Size: 158 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 7

sn2727/finetuning-embedding-models

Domain adaption for an embedding model using unsupervised and supervised finetuning on scientific texts for the SciFact retrieval task.

Language: Jupyter Notebook - Size: 406 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras

Transformer-based models implemented in tensorflow 2.x(using keras).

Language: Python - Size: 696 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

hellonlp/sentence-similarity

文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

Language: Python - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 37 - Forks: 11

4AI/langml

A Keras-based and TensorFlow-backend NLP Models Toolkit.

Language: Python - Size: 16.7 MB - Last synced at: about 16 hours ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 1

luozhouyang/DeepSE

Sentence Embeddings using Deep Nerual Networks in PRODUCTION!

Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 1

Lollipop/CRLT

CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning

Language: Python - Size: 737 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

shawroad/Semantic-Textual-Similarity-Pytorch

experiments of some semantic matching models and comparison of experimental results.

Language: Python - Size: 9.21 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 140 - Forks: 15

jifei/simcse-tf2

A TensorFlow 2 Keras implementation of SimCSE with unsupervised and supervised.

Language: Python - Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 2

perceptiveshawty/RankCSE

Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)

Language: Python - Size: 392 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 4

muyuuuu/E-commerce-Search-Recall

天池阿里灵杰问天引擎电商搜索算法赛非官方 baseline,又名 NLP 从入门到 22/2771。

Language: Python - Size: 1.78 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 68 - Forks: 12

naivenlp/rapidnlp-datasets

Data pipelines for both TensorFlow and PyTorch!

Language: Python - Size: 117 KB - Last synced at: about 1 hour ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0