An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-projects

Noissey24/Applied-Data_Science-Capstone-Project

This is the final project I developed for the Coursera's IBM Data Science Professional Certificate, from Data Collection to the Presentation of valuable Insights.

Language: Jupyter Notebook - Size: 3.03 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

vishalv91/Global-Trade-Analytics

The objective of the project was to create innovative and interactive Tableau dashboards that focus on potential commodities, countries, year, trade amount and quantity. The client wanted to launch a new business unit, focusing on global trade and logistics, majorly in the countries such as USA, Canada and Australia The dataset provided by the client contained 59090 observations of 10 variables. The client insisted the data to be cleaned using Excel or R. The Dataset contained missing values and was cleaned using the R programming language. Tableau dashboards were created from the cleaned dataset.

Language: R - Size: 6.69 MB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 43 - Forks: 11

abhi-deshpande/guided-analytics-projects

These repository includes all my data analytics projects that were basically completed by following their respective tutorials.

Size: 6.31 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lsauchanka/Project-3

Analiza danych z portalu stackexchange.com.

Language: R - Size: 4.11 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

vishalv91/capstoneproject-realestate

Data: Boston Housing Dataset (HousingData.csv) Programming language(s): R Tool(s): RStudio Business problem: To understand the drivers behind the value of houses in Boston and provide data-driven recommendation to the client on how they can increase the value of housing.The Boston housing dataset consisted of 506 observations and 14 variables. Project challenge(s): MEDV (Median value of homes in Boston) was identified as the dependent variable. While the rest, were the independent variables. The goal was to find out which among the independent variables were statistically significant in driving the house prices (MEDV). The dataset consisted of missing values and outliers. Some of the variables had a skewed distribution. There was multicollinearity among few independent variables. Our Approach: Prior to model building, we tidied up our dataset by eliminating the rows that contained missing values. Replacing the missing values with median and mean of those variables were also done. Considering the three approaches, median imputation(replacing missing values with mean) was found to be the best approach. As the dependent variable "MEDV" (median value of houses) was continuous(numerical) in nature, we implemented the Multiple linear regression to build our model. Additional models were built from Decision trees and Random forest. On further investigation, we discovered that the dependent variable had a skewed distribution. By log transformation of this variable, we were able to get a normal distribution. Post transformation, we found out that the model built from Multiple linear regression with log transformed MEDV was the best in terms of MSE (Mean squared error) value and Adjusted R^2. All the assumptions of linear regression were met.

Language: R - Size: 1.63 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 5

thao-phan23/Proposal_for_image_search_feature_for_NITA_fashion_ecommerce_company

Implemented an image search feature on the website to improve the experience of 35% of customers dissatisfied with keyword searches. This enhancement incorporated the CNN model with 95% accuracy for image classification and the Siamese model for the top 5 similar products, leveraging the data set of 24,000+ images across 10 different classes.

Language: Jupyter Notebook - Size: 78.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

iweld/data-analyst-job-postings

A PostgreSQL project using a dataset that pulls job postings from Google's search results for Data Analyst positions in the United States. Dataset created by Luke Barousse.

Size: 69.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Kashish-Rastogi/Tableau

A portfolio of useful Tableau visualizations and dashboard are located in this repository. The README.md file within this repo contains a summary of each of the different workbooks.

Size: 4.23 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

jossus657/hack4la-projects

Portfolio of projects taken on by working for Hack for LA. As a member of the Data Science Community of Practice Team at Hack for LA, data science and analysis projects are taken on through a lens of community improvement and service. Code and datasets will be organized based on their respective project.

Language: Jupyter Notebook - Size: 6.52 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scaredmeow/template-py-data-project

Template for open-source and personal project related to data using python. This will work best if you are working with teams!

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

tinoswe/tinoswe.github.io

My Jekyll site

Language: HTML - Size: 915 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

kirkdotcam/dummydataproject2

Language: Python - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

delaney-data/SQL-CreateTablesImport

A demonstration of how to create tables in PostgreSQL and import data for analysis.

Size: 65.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

delaney-data/SQL-DataAnalysis

Using SQL to build insights and analysis.

Size: 62.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

simonschoe/dynamic-programming-with-rmarkdown

This lecture is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2021/22). :mortar_board:

Language: HTML - Size: 33.2 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kirkdotcam/dummydataproject1

Dummy project to act as a scaffold example for project 1

Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 3