B. Demo of Unix Programming - continued A Data Set ------------- Say it was several years ago and this website was being planned. In preparation, someone typed the printed scheduled into a spreadsheet and produced this plain-text file: We look at the sched file: a list of all arrivals/departures Compare these events to entries in the schedule. A trip is sequence of these events. We can see all trains stopping in brockton by searching for stn=brockton We can see all stops on train 017 by searching for TR=017 Operations on this Data Set --------------------------- - clean it (find typos, missing data, format errors..) - analyze it: get stats and info - report: generate schedules - search : trip planning - webify : build remote access to reports, stats, and searches How Would You Do it? ---------------------- Some of you have worked with datasets. What tools would you use, given this plain text file, to clean, analyze, report, and provide web access to this data set? How long would it take?