Project Ideas

Team 3 | CIS 4400 | 5:50 pm Tues/Thur

      1. MD Abir A. Choudhury – mdabir.choudhury@baruchmail.cuny.edu
      2. Steven Amadou – steven.amadou@baruchmail.cuny.edu
      3. Luan Da Silva – luan.dasilva@baruchmail.cuny.edu
      4. Pete Destil – pete.destil@baruchmail.cuny.edu

First Idea Dump: Wellbeing of Restaurants in NYC

The goal of this site is to create a health data warehouse aimed at analyzing the health of cities based on restaurant data. There are multiple ways we can go about achieving this, but we theorize that by analyzing the restaurant rating dataset, we can get a health outlook of the city. We also want to introduce a rodent population dataset in the city and correlate this to the health of the restaurants in the city. It’d be interesting to see if there is any correlation between the population density of rodents in certain neighborhoods and the health of the restaurants in those neighborhoods.

We hope to create an application based on this problem as an extension of this project if time permits. The application will provide users with a list of restaurants that are healthy based on the users’ location. This would be a mobile application.  The users would also be able to review the health of the restaurant to confirm whether the NYC ratings are correct or not. If the reviews don’t conform to the users’ reviews, the city would be able to go back to the restaurant to do another health inspection.

Data Sources:

  1. NYC Open Data Restaurant Dataset:

    https://data.cityofnewyork.us/Health/DOHMH-New-York-City-Restaurant-Inspection-Results/43nn-pn8j

    https://data.cityofnewyork.us/Health/DOHMH-New-York-City-Restaurant-Inspection-Results/rs6k-p7g6

    26 Dimensions

  2. NYC Open Data Rodent Population Dataset:

    https://data.cityofnewyork.us/Health/Rodent-Inspection/p937-wjvj

    20 Dimensions

 

Second Idea Dump: Healthy eating, fried food consumption, and mortality

Goal: What is the relationship between weight or BMI and meal preparation patterns, consumption of fresh/fast food, or snacking patterns. Why do grocery shopping patterns differ by income?

It has always been prevalent in North American culture to limit the amount of fried food we consume. Most of the fried food that we consume is from fast-food restaurants. By frying food the process takes away from the key nutrients and increases the formation of advanced glycation and acrylamide, which contribute to stress and inflammation. Looking at the different datasets we want to look for relationships between healthy eating, income, and lifestyle to see who is more at risk to develop diseases.

Data Sources

  1. https://www.kaggle.com/jleibow27/fried-food-consumption-and-mortality
  2. https://www.kaggle.com/bls/eating-health-module-dataset

 

Third Idea Dump:

The Centers for Medicare & Medicaid Services has a dataset on quality of care in over 4,000 medicare-certified hospitals. An interesting question to look at would be: by how much can life expectancy be predicted through looking at these medicare-certified hospitals. What other factor can good-quality vs poor hospitals influence a country’s life expectancy.

Data Sources

  1. Life-expectancy dataset – cdc.gov/nchs/nvss/usaleep/usaleep.html#life-expectancy
    1. 13 Dimensions

2. CMMS dataset – https://data.medicare.gov/data/hospital-compare#

1. Undetermined number of Dimensions