An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: reddit-dataset

giocoal/reddit-tldr-summarizer-and-topic-modeling

Extreme Extractive Text Summarization and Topic Modeling (using LSA and LDA techniques) over Reddit Posts from TLDRHQ dataset.

Language: Python - Size: 52.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

luminati-io/Reddit-dataset-samples

A sample dataset of over 1000 Reddit posts , extracted using the Bright Data API, ideal for sentiment analysis, consumer monitoring, trend identification, and competitor analysis.

Size: 4.14 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

DataSenseiAryan/TS3000_TheChatBOT

Its a social networking chat-bot trained on Reddit dataset . It supports open bounded queries developed on the concept of Neural Machine Translation. Beware of its being sarcastic just like its creator :stuck_out_tongue_closed_eyes: BDW it uses Pytorch framework and Python3.

Language: Python - Size: 704 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 5

eric810905/twiddit

Data Engineering Project @ Insight Data Science

Language: Python - Size: 979 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 3

will-molloy/MapReduce-K-means-image-processing 📦

K-means image/video data clustering via. MapReduce using Apache Spark. SOFTENG751 High Performance Computing (A+)

Language: Scala - Size: 46.5 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

koushikvikram/reddit-organization-sentiment

🤷👍👎 Extracting organization names using Named Entity Recognition (NER) and performing Sentiment Analysis on their Reddit posts.

Language: Jupyter Notebook - Size: 975 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0