An open API service providing repository metadata for many open source software ecosystems.

GitHub / Ighina 5 Repositories

PhD Candidate in Speech and Language Processing, passionate about Digital Humanities and building things from scratch.

Ighina/docling Fork of DS4SD/docling

Get your documents ready for gen AI

Language: Python - Size: 29.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Ighina/Ninety_Words

Repo for project for the British Council

Size: 0 Bytes - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Ighina/ARP_Score

Average Relative Proximity metrics and experiments used in the paper "When Cohesion Lies in the Embedding Space: New Framework and Methodologies for Embedding-Based Reference-Free Metrics for Topic Segmentation".

Language: Python - Size: 986 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ighina/LatinWSD

Repository for Paper "Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages: a Case Study on Latin" presented at COLING-LREC 2024

Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Ighina/TextGeneration

Qui c'è la ricetta di base per addestrare un nuovo modello neurale di generazione di testo partendo da testi arbitrari.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Ighina/CERTIFAI

A python implementation of CERTIFAI framework for machine learning models' explainability as discussed in https://www.aies-conference.com/2020/wp-content/papers/099.pdf

Language: Python - Size: 41 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 4

Ighina/NSE-TopicSegmentation

A repository including a variety of neural architectures for supervised topic segmentation

Language: Python - Size: 159 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Ighina/git-is-great

RSE git Module

Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/MultiModalSA

MultiModal Sentiment Analysis architectures for CMU-MOSEI.

Language: Python - Size: 2.88 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 6

Ighina/DeepTiling

A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive summarization and semantic search applications built on top of it.

Language: Python - Size: 4.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 2

Ighina/Audio-Topic-Segmentation

Repository for the paper "Exploring pre-trained Audio Neural Representations for Audio Topic Segmentation"

Language: Python - Size: 11.2 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/Language-Modelling-with-RNNs

A simple series of programs to train gated recurrent neural networks with PyTorch and generate text based on them.

Language: Python - Size: 6.62 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Ighina/Latin-ISE-WSD

A large scale automatic analysis of selected lemmas sense change across centuries based on the Latin-ISE corpus and the original BERT-based word sense disambiguation system by Bamman et al. (2020)

Language: Jupyter Notebook - Size: 48.2 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/latin-bert-ise-wsd Fork of dbamman/latin-bert

Using Latin BERT for large scale word sense disambiguation on ISE corpus

Size: 6.7 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Ighina/git-is-great-1 Fork of Iain-S/git-is-great

RSE Git Module

Size: 31.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Ighina/VQ-VAE_Topic

An implementation of the paper [Vector-Quantization-Based Topic Modeling](https://dl.acm.org/doi/10.1145/3450946), providing a series of VQ-VAE models for topic modelling. The model reaches state-of-the-art performance on Ng20 and enables the extraction of dense topic vectors for downstream tasks.

Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

Ighina/bad-boids Fork of alan-turing-institute/bad-boids

A deliberately badly programmed implementation of Boids for teaching

Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Ighina/demorepo

It's a demo

Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Ighina/Coursera_Capstone

Repository for the data science specialisation by IBM on Coursera

Language: Jupyter Notebook - Size: 3.38 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Ighina/SemanticNetworkVizR

codes to perform semantic network analysis on multiple concepts (defined as multiple words-set, i.e. dictionaries) across multiple texts with R

Language: R - Size: 2.35 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Ighina/DigitRecogniser

A very, very basic digit recogniser and gaussian calculators functions with basic Python

Language: Python - Size: 17.6 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Ighina/SemanticEgoNetwork

codes to perform exploratory semantic network analysis on one concept of interest

Language: R - Size: 84 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Ighina/FrequencyApp

Shiny app to discover and visualise the occurrences of words and/or word-sets (i.e. dictionaries) in given txt files (up to 5)

Language: R - Size: 93.8 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0