layout | title | subtitle |
---|---|---|
page |
Working With Data on the Web |
Reference |
- Many sites make data available for download via URLs that are formatted in specific ways.
- Use the
requests
library to download data in Python programs.
- Use the
io
library to treat text as input or output files. - Use the
csv
library to read comma-separated values.
- Write tests.
- Use the
pyplot
library frommatplotlib
for simple visualizations. - Viewing data is often the first step toward understanding it.
- Publish data by putting files with predictable names in a publicly-accessible location.
- Create a machine-readable index to explicitly tell people what data sets are available.
Representational State Transfer (REST): : a set of patterns for sharing data on the web.
comma-separated values (CSV): : A common textual representation for tables in which the values in each row are separated by commas.
escape sequence:
: A sequence of characters used to represent another character.
For example,
the two-letter escape sequence \n
represents a newline character in Python,
while the multi-letter escape sequence &
respresents an ampersand in HTML.
index: : A document or data set that contains information about, and pointers to, actual data sets. An index contains the metadata that makes actual data findable.
silent failure: : Failing without producing any warning messages. Silent failures are hard to detect and debug.
status code: : A numerical value that indicates whether a function or other procedure succeeded, or if it failed, why.
unit testing tool: : A software library and associated tool or tools that helps programmers write short tests for their code and run them systematically.