Yesterday (October 14, 2012), Felix Baumgartner made history by becoming the first person to break the speed of sound during a free fall. He also set some other records (e.g., longest free fall, etc.) during the Red Bull Stratos Mission–which was broadcast live on the internet. Kind of cool, but imagine the conversation that took place daydreaming this one…
Red Bull Creative Person: What if we got some idiot to float up into the stratosphere in a space capsule and then had him step out of it and free fall four minutes breaking the sound barrier?
Another Red Bull Creative Person: Great idea! Lets’ also broadcast it live on the internet.
Well anyway, after the craziness ensued, It was suggested on Facebook that, “I think this data should be on someone’s blog!”. Rising to the bait, I immediately looked at the mission page, but the data was no longer there. Thank goodness for Wikipedia [Red Bull Stratos Mission Data]. The data can be copied and pasted into an Excel sheet, or read in to R using the readHTMLTable() function from the XML package.
mission <- readHTMLTable( doc = "http://en.wikipedia.org/wiki/Red_Bull_Stratos/Mission_data", header = TRUE )
We can then write it to an external file, I called it Mission.csv and put it on my desktop, using the
write.csv(mission, file = "/Users/andrewz/Desktop/Mission.csv", row.names = FALSE, quote = FALSE )
Opening the new file in a text editor, we see some issues to deal with (these are also apparent from looking at the data on the Wikipedia page).
The first line is the first table header, Elevation Data, which spanned three columns in the Wikipedia page. Delete it.
The last row are the re-printed variable names. Delete it.
Change the variable names in the current first row to be statistical software compliant (e.g., remove the commas and spaces from each variable). My first row looks like the following:
Remove the commas from the values in the last column. With a comma separated value (CSV) file, they are trouble.
There are nine rows which have parentheses around their value in the last column. I don’t know what this means. For now, I will remove those values.
The file can be downloaded here.
Then you can plot (or analyze) away to your heart’s content.
# read in data to R mission <- read.csv(file = "/Users/andrewz/Desktop/Mission.csv") # Load ggplot2 library library(ggplot2) # Plot speed vs. time ggplot(data = mission, aes(x = Time, y = Speed)) + geom_line() # Plot elevation vs. time ggplot(data = mission, aes(x = Time, y = Elevation)) + geom_line()
Since I have no idea what these really represent other than what the variable names tell me, I cannot interpret these very well. Perhaps someone else can.