- Hadoop was the name of the toy elephant that the original developer's daughter was playing with when he started the project (based on Google's white paper) ten years ago.
- The commercial version from Cloudera (and others) allows you to use Amazon servers to run data analysis and queries (AWS = Amazon Web Servers). Can be very expensive. Can place a bid for low price for batch projects running off peak time.
- Another version runs from Microsoft Azure. Microsoft Azure has a sensor data analysis example.
- Pig is a query script (power shell) developed at Yahoo.
- Hadoop is used for behavioral data, not transactional data. This is how Facebook, for example, knows your cursor position at any point in time and can send you ads even though you haven't actually clicked on something.