GitHub / Ighina 5 Repositories
PhD Candidate in Speech and Language Processing, passionate about Digital Humanities and building things from scratch.
Ighina/docling Fork of DS4SD/docling
Get your documents ready for gen AI
Language: Python - Size: 29.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Ighina/Ninety_Words
Repo for project for the British Council
Size: 0 Bytes - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Ighina/ARP_Score
Average Relative Proximity metrics and experiments used in the paper "When Cohesion Lies in the Embedding Space: New Framework and Methodologies for Embedding-Based Reference-Free Metrics for Topic Segmentation".
Language: Python - Size: 986 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ighina/LatinWSD
Repository for Paper "Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages: a Case Study on Latin" presented at COLING-LREC 2024
Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Ighina/TextGeneration
Qui c'è la ricetta di base per addestrare un nuovo modello neurale di generazione di testo partendo da testi arbitrari.
Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Ighina/CERTIFAI
A python implementation of CERTIFAI framework for machine learning models' explainability as discussed in https://www.aies-conference.com/2020/wp-content/papers/099.pdf
Language: Python - Size: 41 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 4

Ighina/NSE-TopicSegmentation
A repository including a variety of neural architectures for supervised topic segmentation
Language: Python - Size: 159 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Ighina/git-is-great
RSE git Module
Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/MultiModalSA
MultiModal Sentiment Analysis architectures for CMU-MOSEI.
Language: Python - Size: 2.88 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 6

Ighina/DeepTiling
A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive summarization and semantic search applications built on top of it.
Language: Python - Size: 4.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 2

Ighina/Audio-Topic-Segmentation
Repository for the paper "Exploring pre-trained Audio Neural Representations for Audio Topic Segmentation"
Language: Python - Size: 11.2 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/Language-Modelling-with-RNNs
A simple series of programs to train gated recurrent neural networks with PyTorch and generate text based on them.
Language: Python - Size: 6.62 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Ighina/Latin-ISE-WSD
A large scale automatic analysis of selected lemmas sense change across centuries based on the Latin-ISE corpus and the original BERT-based word sense disambiguation system by Bamman et al. (2020)
Language: Jupyter Notebook - Size: 48.2 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/latin-bert-ise-wsd Fork of dbamman/latin-bert
Using Latin BERT for large scale word sense disambiguation on ISE corpus
Size: 6.7 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Ighina/git-is-great-1 Fork of Iain-S/git-is-great
RSE Git Module
Size: 31.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/VQ-VAE_Topic
An implementation of the paper [Vector-Quantization-Based Topic Modeling](https://dl.acm.org/doi/10.1145/3450946), providing a series of VQ-VAE models for topic modelling. The model reaches state-of-the-art performance on Ng20 and enables the extraction of dense topic vectors for downstream tasks.
Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

Ighina/bad-boids Fork of alan-turing-institute/bad-boids
A deliberately badly programmed implementation of Boids for teaching
Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Ighina/demorepo
It's a demo
Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Ighina/Coursera_Capstone
Repository for the data science specialisation by IBM on Coursera
Language: Jupyter Notebook - Size: 3.38 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Ighina/SemanticNetworkVizR
codes to perform semantic network analysis on multiple concepts (defined as multiple words-set, i.e. dictionaries) across multiple texts with R
Language: R - Size: 2.35 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Ighina/DigitRecogniser
A very, very basic digit recogniser and gaussian calculators functions with basic Python
Language: Python - Size: 17.6 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Ighina/SemanticEgoNetwork
codes to perform exploratory semantic network analysis on one concept of interest
Language: R - Size: 84 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Ighina/FrequencyApp
Shiny app to discover and visualise the occurrences of words and/or word-sets (i.e. dictionaries) in given txt files (up to 5)
Language: R - Size: 93.8 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
