Skip to content

some data science projects in R, including reports in markdown format, codes, output plots & models.

Notifications You must be signed in to change notification settings

pariyamd/datascience_analyze_and_report

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

datascience_analyze_and_report

some data science projects in R, including reports in markdown format, codes, output plots & models.

This repo includes three of my datascience projects:

A Passage on Literature

this report includes questions about charles dickens' books & novels.

  1. most used top 20 words in dickens’ books
  2. wordcloud of dickens’ books
  3. five most used names in dickens’ books
  4. emtion (positivity and negativity in words) time series in les miserables
  5. most used verbs for men and women in dickens’ books
  6. unigram vs bigram distributions per chapter/ per book
  7. unigram, bigram distributions comparison between austins' and dickens' novels

Murder or Death

this data is death records from 90's to 2017 in this report i've illustrated some of the features to better understand the data:

  1. correlation of main attributes
  2. ratio of murder/suicide based on race, sex, age and education.

Then I've trained a Generalized linear model using H2o and plotted the False Positives and False negatives overally and detailed( based on race and sex ). and then some plots are drawn for better understanding the model such as DOC or acc/threshold. then i try to tune the depth of model and based on the best depth i tune other parameters. this model is later used in a shiny app to help judges better predict wheather a death was caused by a murder or suicide

Mobile Devices

in this report you can see:

  1. top mobile device manufacturers
  2. evolution of mobile dimensions in time
  3. Box plot of device thickness based on headphone jack feature
  4. evolution of ppi of deviced in timeage
  5. how to filter old nokia phones!
  6. which devices can float on water?
  7. correlation of battery capaity and weight
  8. Samsung brand series prices change over time
  9. Competition of companies and the amount of their production over time
  10. Growth rate and changes of price, RAM and camera size
  11. Mobile phone area change chart

Once upon a time in Netflix

You can also check out this post

About

some data science projects in R, including reports in markdown format, codes, output plots & models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages