Data Exercises

Setup

We will start on these in class, you will finish them up outside of class.
cleanData .csv
 
Save the output to a file and then use that .psv file instead of the original one.

The delimiter for this file is a pipe - be sure to use single quotes to protect it from the shell:
-d '|'

Written Assignment

Answer these questions directly in your email submission. Please copy the questions into your email, followed by your answer.

  1. Give a brief explanation of 2 useful/cool tips you learned about Excel.
  2. Why does "Dublin 10" get sorted before "Dublin 2"?
  3. What kind of file is your down-loaded csv data? Show the output from the command you use to find out:
  4. What is CRLF?
  5. Give a brief description of 2 questions to ask of the data; one should be easier in Linux and one should be easier in Excel. Note: I do not mean because Excel can create charts or other graphics; it should be a basic question about the data - for instance how many counties are there? And, no, you cannt use this as one of your 2 questions. :)

    For each, briefly explain why it is easier in the specific tool.

Programs

For the programs, attach your files to your submission email.

Very Optional!