Skip to content
This repository has been archived by the owner on Sep 24, 2024. It is now read-only.

Commit

Permalink
update project summary
Browse files Browse the repository at this point in the history
  • Loading branch information
nickeubank committed Oct 24, 2023
1 parent e807168 commit d70958a
Show file tree
Hide file tree
Showing 8 changed files with 17 additions and 6 deletions.
Binary file modified docs/html/.doctrees/environment.pickle
Binary file not shown.
Binary file modified docs/html/.doctrees/exercises/Exercise_dask.doctree
Binary file not shown.
2 changes: 1 addition & 1 deletion docs/html/sitemap.xml
Original file line number Diff line number Diff line change
@@ -1 +1 @@
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"><url><loc>http://cm4ss.comdistributed_starting_dask_cluster.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_bigdata.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_cloud_dukesubscription.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_dask.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_dask_realdata.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_git.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_jupyterlab.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_scikit_learn.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_sklearn.html</loc></url><url><loc>http://cm4ss.comindex.html</loc></url><url><loc>http://cm4ss.comparallelism.html</loc></url><url><loc>http://cm4ss.compr_review.html</loc></url><url><loc>http://cm4ss.comgenindex.html</loc></url><url><loc>http://cm4ss.comsearch.html</loc></url></urlset>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"><url><loc>http://cm4ss.comdistributed_starting_dask_cluster.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_cloud_dukesubscription.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_dask.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_dask_realdata.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_git.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_jupyterlab.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_scikit_learn.html</loc></url><url><loc>http://cm4ss.comexercises/Exercise_sklearn.html</loc></url><url><loc>http://cm4ss.comindex.html</loc></url><url><loc>http://cm4ss.comparallelism.html</loc></url><url><loc>http://cm4ss.compr_review.html</loc></url><url><loc>http://cm4ss.comgenindex.html</loc></url><url><loc>http://cm4ss.comsearch.html</loc></url></urlset>
11 changes: 11 additions & 0 deletions lesson_plan_notes/2023/2023_10_24.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
Feedback:

Pros of class:

- hands-on assignments / learning by doing
- Working in randomized pairs throughout the semester so far has really helped me get to know my classmates and share knowledge with each other.
- I like the partner activities... when we actually get to do them in class together. The independent ones don't feel like I am learning as much.

Could be improved:

- Pairing MIDS people with non MIDS people can be quite tough as non MIDS people have different availability and may have different working styles that can start to be detrimental rather than useful especially when the differences create a communication barrier with your randomly assigned pair and you have a hard time working with them.
Binary file modified opioid_project/PDS_ProjectSummary.pdf
Binary file not shown.
10 changes: 5 additions & 5 deletions opioid_project/PDS_ProjectSummary.tex
Original file line number Diff line number Diff line change
Expand Up @@ -123,11 +123,11 @@ \subsection*{Data}

\subsubsection*{Opioid Prescriptions}

A core component of this project will be a recently released dataset of all prescription opioid drug shipments in the United States from 2006 to 2014. This was only recently released (in 2020) by the \emph{Washington Post}, which obtained the data through a Freedom of Information Act (FOIA) request to the US Drug Enforcement Agency.
A core component of this project will be a recently released dataset of all prescription opioid drug shipments in the United States from 2006 to 2019. This was only recently released (in 2020) by the \emph{Washington Post}, which obtained the data through a Freedom of Information Act (FOIA) request to the US Drug Enforcement Agency.

\begin{itemize}
\item \href{https://www.washingtonpost.com/graphics/2019/investigations/dea-pain-pill-database/}{Read about data here.}
\item \href{https://www.washingtonpost.com/national/2019/07/18/how-download-use-dea-pain-pills-database/?arc404=true}{Download here.}
\item \href{https://www.washingtonpost.com/national/2019/07/18/how-download-use-dea-pain-pills-database/?arc404=true}{Download here. Please use the \emph{Click here to download the national data from The Washington Post} link in the fourth bullet point under ``How to download this data'' to get the most updated data.}
\item \href{https://github.com/wpinvestigative/arcos-api/blob/master/data/data_dictionary.csv}{More about variables in the data here.}
\end{itemize}

Expand All @@ -137,7 +137,7 @@ \subsubsection*{Vital Statistics Mortality Data}

Several notes about this data:
\begin{itemize}
\item I've made a couple of small changes to this data, but they are very small. Basically, this is the raw data (in the format originally provided) by the US Vital Statistics system. So beware of formatting and cleanliness issues.
\item I've made a couple of small changes to this data, but they are very small. This is the raw data (in the format originally provided) by the US Vital Statistics system. So beware of formatting and cleanliness issues.
\item For privacy, the US Vital Statistics Agency censors some data. If the number of people in a given category (i.e. one county / year / cause of death category) is less than 10, that data does \emph{not} appear in the data. Similarly, zero counts are also not reported. So if a county has only 2 deaths in a given year, that county just doesn't appear in the data for a given year. And if a county has 20 deaths unrelated to drugs or alcohol, but 7 deaths due to overdose, the former statistic will appear in the data, but not the latter.
\begin{itemize}
\item It is for this reason that we will use \emph{annual} data on mortality -- by summing deaths over full years, fewer counties end up near below this 10-death threshold, so the data is more complete.
Expand Down Expand Up @@ -249,7 +249,7 @@ \subsection*{Your Task}
\item Your analysis,
\item Your interpretation of that analysis.
\end{itemize}
\item In the report for a policy maker (someone who is NOT trained in statistics), include:
\item In the report for a policymaker (someone who is NOT trained in statistics), include:
\begin{itemize}
\item The motivation for the project,
\item Overview of the data being used,
Expand All @@ -263,7 +263,7 @@ \subsection*{Your Task}

\subsection*{Due Dates}
\begin{itemize}
\item An outline of your project strategy is due on November 8th.
\item An outline of your project strategy is due on November 7th.
\begin{itemize}
\item In writing this strategy, use a ``backwards design'' organizational scheme: Start by establishing what you want to achieve (i.e. the plots I've already specified). Then ask: What dataset do I need to make these plots? Or more specifically:
\begin{itemize}
Expand Down
Binary file added source/class_schedule.xlsx
Binary file not shown.
Binary file modified source/class_schedule_xlsx.xlsx
Binary file not shown.

0 comments on commit d70958a

Please sign in to comment.