Welcome! Here is the textbook for IRGN 453, which starts in April 2018. Other information from the course last year is available on this web site. Data Mining for Business Analytics: Concepts, Techniques, and Applications in R by Galit Shmueli et al. This textbook is required. It's a good survey of the topic. It uses… Continue reading 2018 Big Data Analytics textbook

## Project idea: Russian Twitter disinformatzia posts

NBC has a nice database of Russian disinformation tweets. If you are interested in Twitter or politics, they might provide the data for an interesting course project. Twitter deleted 200,000 Russian troll tweets. Read them here. Twitter makes it hard to find them, but NBC got some help putting together this data set. By the way,… Continue reading Project idea: Russian Twitter disinformatzia posts

## Articles about “data science”

The next iteration of my course starts on April 2, 2018. For people who are baffled by all the buzz words and conflicting advice (and who isn't?), I'm going to post some article links here. It will be a potpourri. When the course starts, I may go back and reorganize the material by topic. I… Continue reading Articles about “data science”

## Predicting polluted swimming

Many cities should be doing something like this, including my home town of San Diego. Measuring water quality is vital, but the result can take days. Doing this for another city could make a good class project, since historical data is often available. These developers have also turned it into a "beat the forecasters" game… Continue reading Predicting polluted swimming

## Staying familiar with R analytics

I subscribe to an email list called R-bloggers. Every day it summarizes ~5 blog posts about statistical analysis using R. Roughly one third are about some function or feature of R. The second, more interesting, third are short cases about a wide variety of topics that someone has analyzed - sports results, NAFTA, analyzing financial… Continue reading Staying familiar with R analytics

## Chart Relationship diagram from Financial Times

This diagram of about 80 kinds of charts, with clear explanations of their purposes, is impressive. It is the most comprehensive such list I have seen, and it's quite easy to understand. I have not looked for an R/ggplot version of this, but if one does not exist yet I suspect someone will soon create it. Here… Continue reading Chart Relationship diagram from Financial Times