Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / Akankshaaaa / Crawling-and-Analysis-of-Internship-Job-Portal
The project explores internships and jobs in the current market. The dataset is built by scraping publically available web pages of leading websites, Internshala and Monster India, as well as merging well known public dataset - stackoverflow developer survey from the years 2015 to 2020. We performed extensive data exploration to derive some insightful conclusions based on current trends in the software industry.
Stars: 0
Forks: 0
Open Issues: 0
License: None
Language: Jupyter Notebook
Repo Size: 20.3 MB
Dependencies:
128
Created: almost 3 years ago
Updated: 27 days ago
Last pushed: almost 3 years ago
Last synced: 27 days ago
Topics: data-mining, data-modeling, eda, jupyter-notebook, python
Files
Dependencies
- Automat20.2.0 *
- Jinja23.0.1 *
- MarkupSafe2.0.1 *
- PackageVersion *
- Pillow8.2.0 *
- Protego0.1.16 *
- PyDispatcher2.0.5 *
- PyPrind2.11.3 *
- PyYAML5.4.1 *
- Pygments2.9.0 *
- Scrapy2.5.0 *
- Send2Trash1.5.0 *
- Twisted21.2.0 *
- appnope0.1.2 *
- argon2-cffi20.1.0 *
- async-generator1.10 *
- attrs21.2.0 *
- backcall0.2.0 *
- beautifulsoup44.9.3 *
- bleach3.3.0 *
- certifi2021.5.30 *
- cffi1.14.5 *
- chardet4.0.0 *
- click8.0.1 *
- colorama0.4.4 *
- configparser5.0.2 *
- constantly15.1.0 *
- crayons0.4.0 *
- cryptography3.4.7 *
- cssselect1.1.0 *
- cycler0.10.0 *
- decorator4.4.2 *
- defusedxml0.7.1 *
- entrypoints0.3 *
- gensim4.0.1 *
- h23.2.0 *
- hpack3.0.0 *
- hyperframe5.2.0 *
- hyperlink21.0.0 *
- idna2.10 *
- incremental21.3.0 *
- ipykernel5.5.5 *
- ipython-genutils0.2.0 *
- ipython7.24.0 *
- ipywidgets7.6.3 *
- itemadapter0.2.0 *
- itemloaders1.0.4 *
- jedi0.18.0 *
- jellyfish0.8.2 *
- jmespath0.10.0 *
- joblib1.0.1 *
- jsonschema3.2.0 *
- jupyter-client6.1.12 *
- jupyter-contrib-core0.3.3 *
- jupyter-contrib-nbextensions0.5.1 *
- jupyter-core4.7.1 *
- jupyter-highlight-selected-word0.2.0 *
- jupyter-latex-envs1.4.6 *
- jupyter-nbextensions-configurator0.4.1 *
- jupyterlab-pygments0.1.2 *
- jupyterlab-widgets1.0.0 *
- kiwisolver1.3.1 *
- lxml4.6.3 *
- matplotlib-inline0.1.2 *
- matplotlib3.4.2 *
- mistune0.8.4 *
- msedge-selenium-tools3.141.3 *
- nbclient0.5.3 *
- nbconvert6.0.7 *
- nbformat5.1.3 *
- nest-asyncio1.5.1 *
- networkx2.5.1 *
- nltk3.6.2 *
- notebook6.4.0 *
- numpy1.20.3 *
- packaging20.9 *
- pandas1.2.4 *
- pandocfilters1.4.3 *
- parsel1.6.0 *
- parso0.8.2 *
- pexpect4.8.0 *
- pickleshare0.7.5 *
- pip21.1.1 *
- plotly4.14.3 *
- priority1.3.0 *
- prometheus-client0.10.1 *
- prompt-toolkit3.0.18 *
- ptyprocess0.7.0 *
- pyOpenSSL20.0.1 *
- pyasn1-modules0.2.8 *
- pyasn10.4.8 *
- pycparser2.20 *
- pyparsing2.4.7 *
- pyrsistent0.17.3 *
- python-dateutil2.8.1 *
- pytz2021.1 *
- pyzmq22.1.0 *
- queuelib1.6.1 *
- regex2021.4.4 *
- requests2.25.1 *
- retrying1.3.3 *
- scikit-learn0.24.2 *
- scipy1.6.3 *
- seaborn0.11.1 *
- segtok1.5.10 *
- selenium3.141.0 *
- service-identity21.1.0 *
- setuptools56.0.0 *
- six1.16.0 *
- smart-open5.1.0 *
- soupsieve2.2.1 *
- tabulate0.8.9 *
- terminado0.10.0 *
- testpath0.5.0 *
- textblob0.15.3 *
- threadpoolctl2.1.0 *
- tornado6.1 *
- tqdm4.61.0 *
- traitlets5.0.5 *
- urllib31.26.5 *
- w3lib1.22.0 *
- wcwidth0.2.5 *
- webdriver-manager3.4.2 *
- webencodings0.5.1 *
- widgetsnbextension3.5.1 *
- wordcloud1.8.1 *
- yake0.4.8 *
- zope.interface5.4.0 *