An open API service providing repository metadata for many open source software ecosystems.

GitHub / CamilaJaviera91 / sql-mock-data

Generate a synthetic dataset with one million records of employee information from a fictional company, load it into a PostgreSQL database, create analytical reports using PySpark and large-scale data analysis techniques, and implement machine learning models to predict trends in hiring and layoffs on a monthly and yearly basis.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CamilaJaviera91%2Fsql-mock-data
PURL: pkg:github/CamilaJaviera91/sql-mock-data

Stars: 1
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 217 MB
Dependencies parsed at: Pending

Created at: 4 months ago
Updated at: 3 months ago
Pushed at: 3 months ago
Last synced at: 17 days ago

Topics: connection, faker, locale, logging, matplotlib, os, postgresql, psycopg2, pyspark, pyspark-sql, python, random, random-python, shutil, sparksession, sql, sys, unicode

    Loading...