Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

codeberg.org / haining

haining/stylistic_similarity_corpus

A corpus used for measuring stylistic similarity.

Language: - Size: 23.4 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/lower_level_features_for_chinese_authorship_attribution

Test a range of lower-level features for Chinese authorship attribution.

Language: - Size: 101 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/scientific_abstract_simplification

Simplify academic language found in scientific paper abstracts into more accessible levels with PLMs.

Language: - Size: 492 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/weibo_creative_language_use

A repo for creative language use on Weibo.

Language: - Size: 101 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/blog8965

The Blog8965 corpus is an English authorship attribution testbed for contemporary English prose. It has 8,965 candidate authors, 542k+ posts, and pre-defined data split (train/dev/test proportional to ca. 8:1:1).

Language: - Size: 138 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/sas

scientific abstract simplification

Language: Python - Size: 157 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/reduce-idiosyncratic-spelling

Reduce idiosyncratic spellings that tell the identity with T5 v1.1.

Language: - Size: 691 MB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/bible_style_transfer

Transfer style found in the Bible corpus with PLMs.

Language: - Size: 74.9 MB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/chinese_authorship_attribution

In search of better, lower-level stylistic signals for Chinese authorship attribution.

Language: - Size: 1.36 GB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/luxun

Demystifying the disputed texts between Lu Xun and his brother Zhou Zuoren (周 作人).

Language: - Size: 4.93 MB - Last synced: over 1 year ago - Stars: 1 - Forks: 0

haining/scientific_abstract_simplification_ πŸ“¦

The proposed project is to simplify academic language found in scientific paper abstracts into more accessible levels.

Language: - Size: 1.84 GB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/Classical-Modern

ιžεΈΈε…¨ηš„ζ–‡θ¨€ζ–‡οΌˆε€ζ–‡οΌ‰-ηŽ°δ»£ζ–‡εΉ³θ‘Œθ―­ζ–™

Language: - Size: 138 MB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/prompting_adversarial_stylometry

Learning disentanglement representation with ByT5-based VAE to conceal privacy-sensitive information found in one's writing style.

Language: - Size: 99.6 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/cctaa

Language: - Size: 281 KB - Last synced: over 1 year ago - Stars: 0 - Forks: 0

haining/cross-register-authorship-attribution-corpus

This corpus contains writing samples of eight authors living during the Ming and Qing Dynasties who were known capable of writing in both classical Chinese and vernacular Chinese.

Language: - Size: 8.37 MB - Last synced: over 1 year ago - Stars: 0 - Forks: 0