Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-projects

vishalv91/Global-Trade-Analytics

The objective of the project was to create innovative and interactive Tableau dashboards that focus on potential commodities, countries, year, trade amount and quantity. The client wanted to launch a new business unit, focusing on global trade and logistics, majorly in the countries such as USA, Canada and Australia The dataset provided by the client contained 59090 observations of 10 variables. The client insisted the data to be cleaned using Excel or R. The Dataset contained missing values and was cleaned using the R programming language. Tableau dashboards were created from the cleaned dataset.

Language: R - Size: 6.69 MB - Last synced: about 23 hours ago - Pushed: over 6 years ago - Stars: 35 - Forks: 11

abhi-deshpande/guided-analytics-projects

These repository includes all my data analytics projects that were basically completed by following their respective tutorials.

Size: 6.31 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

lsauchanka/Project-3

Analiza danych z portalu stackexchange.com.

Language: R - Size: 4.11 MB - Last synced: 3 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

vishalv91/capstoneproject-realestate

Data: Boston Housing Dataset (HousingData.csv) Programming language(s): R Tool(s): RStudio Business problem: To understand the drivers behind the value of houses in Boston and provide data-driven recommendation to the client on how they can increase the value of housing.The Boston housing dataset consisted of 506 observations and 14 variables. Project challenge(s): MEDV (Median value of homes in Boston) was identified as the dependent variable. While the rest, were the independent variables. The goal was to find out which among the independent variables were statistically significant in driving the house prices (MEDV). The dataset consisted of missing values and outliers. Some of the variables had a skewed distribution. There was multicollinearity among few independent variables. Our Approach: Prior to model building, we tidied up our dataset by eliminating the rows that contained missing values. Replacing the missing values with median and mean of those variables were also done. Considering the three approaches, median imputation(replacing missing values with mean) was found to be the best approach. As the dependent variable "MEDV" (median value of houses) was continuous(numerical) in nature, we implemented the Multiple linear regression to build our model. Additional models were built from Decision trees and Random forest. On further investigation, we discovered that the dependent variable had a skewed distribution. By log transformation of this variable, we were able to get a normal distribution. Post transformation, we found out that the model built from Multiple linear regression with log transformed MEDV was the best in terms of MSE (Mean squared error) value and Adjusted R^2. All the assumptions of linear regression were met.

Language: R - Size: 1.63 MB - Last synced: 4 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 5

thao-phan23/Proposal_for_image_search_feature_for_NITA_fashion_ecommerce_company

Implemented an image search feature on the website to improve the experience of 35% of customers dissatisfied with keyword searches. This enhancement incorporated the CNN model with 95% accuracy for image classification and the Siamese model for the top 5 similar products, leveraging the data set of 24,000+ images across 10 different classes.

Language: Jupyter Notebook - Size: 78.7 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

iweld/data-analyst-job-postings

A PostgreSQL project using a dataset that pulls job postings from Google's search results for Data Analyst positions in the United States. Dataset created by Luke Barousse.

Size: 69.3 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 3 - Forks: 0

Kashish-Rastogi/Tableau

A portfolio of useful Tableau visualizations and dashboard are located in this repository. The README.md file within this repo contains a summary of each of the different workbooks.

Size: 4.23 MB - Last synced: 9 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

lukzmu/data-science

Collection of courses and projects related to Data Science.

Language: Jupyter Notebook - Size: 137 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

jossus657/hack4la-projects

Portfolio of projects taken on by working for Hack for LA. As a member of the Data Science Community of Practice Team at Hack for LA, data science and analysis projects are taken on through a lens of community improvement and service. Code and datasets will be organized based on their respective project.

Language: Jupyter Notebook - Size: 6.52 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

scaredmeow/template-py-data-project

Template for open-source and personal project related to data using python. This will work best if you are working with teams!

Language: Python - Size: 22.5 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 3 - Forks: 0

kirkdotcam/dummydataproject2

Language: Python - Size: 8.79 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

delaney-data/SQL-CreateTablesImport

A demonstration of how to create tables in PostgreSQL and import data for analysis.

Size: 65.4 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

delaney-data/SQL-DataAnalysis

Using SQL to build insights and analysis.

Size: 62.5 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1

simonschoe/dynamic-programming-with-rmarkdown

This lecture is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2021/22). :mortar_board:

Language: HTML - Size: 33.2 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

kirkdotcam/dummydataproject1

Dummy project to act as a scaffold example for project 1

Language: Jupyter Notebook - Size: 27.3 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 3