Exploratory data analysis (EDA) is generally the first step in any data science project with the goal being to summarize the main features of the dataset. It helps the analyst gain a better understanding of the available data and often can unearth powerful insights. Data visualization is the most common technique in EDA. During this post, I...
Frequently, we encounter projects that require the combined use of Python, Microsoft Excel and some external databases that can only be accessed via Excel, or use cases that require the end product to be output to that format. Excel is still used as a key program for the vast majority of businesses and we are often challenged to create...
Our team recently designed a dashboard using R Shiny Leaflet allowing users to select many locations at one go on an interactive map. We created the map using the package leaflet.extras, which enables users to draw shapes on R Shiny Leaflet maps. When combined with the package sp and a function called findLocations, the leaflet.extras drawing...
The apply function in R is used as a fast and simple alternative to loops. It allows users to apply a function to a vector or data frame by row, by column or to the entire data frame. Below are a few basic uses of this powerful function as well as one of it’s sister functions lapply. There are other functions in the apply family (sapply,...
There were a lot of predictions from pundits in the lead up to the 2016 Election, many of which predicted a Hillary Clinton victory. We now know that Donald Trump earned the victory with key wins in a handful of surprise swing states. With this still settling in for some, we took a hard look at the data behind his victory and what drove his...
This year, the Red Oak Strategic team decided to undertake a new challenge in the world of polling and analytics : conduct a bi-weekly, public, national survey that we would execute and release using the Google Surveys platform. Beginning in August and continuing through the final week of the 2016 election, our partnership with GCS led to the...
Political polling faces a crisis of confidence. Major news outlets repeatedly ask “What’s the matter with polling?” after major misses like the Bernie Sanders’s primary upset in Michigan, where he beat Hillary Clinton 50–48 despite the fact that she was leading by up to 20 points in reputable polls. There is, however, hope for a polling...
Last Saturday, in what has now been widely publicized and discussed, Uber and Lyft lost an effort, Proposition 1, that would have rolled back a number of regulations on their services. As a result, in one of America’s most forward-thinking tech centers, the services stopped operating almost immediately.
Background Recognizing an opportunity to expand...
Time and again, across Red Oak Strategic’s...
The pace of our modern world, and the impressive...
While it might be tempting to liven up a report...
Interaction Design for Data Exploration...
- 2016 Election
- Apache Spark
- Business Intelligence
- Case Studies
- Data Processing
- Data Science
- Data Visualization
- Donald Trump
- Exploratory Data Science
- Financial Analytics
- Hillary Clinton
- Machine Learning
- Political Analytics
- Predictive Analytics
- Private Equity
- Python 3
- R Shiny
- Sparkling Water
- Time Series