Data Carpentry with Python for interns

NMNH

16-17 Jul 2024

1:00 pm - 4:00 pm EDT

Instructors: Adam Mansur, Mike Trizna

Helpers: Carpentries Instructor Team

General Information

Data Carpentry develops and teaches workshops on the fundamental data skills needed to conduct research. Its target audience is researchers who have little to no prior computational experience, and its lessons are domain specific, building on learners' existing knowledge to enable them to quickly apply skills learned to their own research. Participants will be encouraged to help one another and to apply what they have learned to their own research problems.

For more information on what we teach and why, please see our paper "Good Enough Practices for Scientific Computing".

This workshop will use Python to analyze, aggregate, and plot data. We'll be working with an ecology dataset, but don't let that scare you away! The approaches taught in this course can be useful across a wide variety of disciplines.

Who: This course is open to interns from any Smithsonian unit. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Where: This a hybrid workshop. Attendees will receive a Zoom link prior to the workshop. In-person attendance may be limited, and attendees wishing to participate in person should note this preference during registration. The in-person workshop will take place in WG33, National Museum of Natural History, 10th & Constitution NW, Washington, DC 20560. Get directions with OpenStreetMap or Google Maps. Additional instructions for reaching the room will be provided to attendees prior to the workshop.

When: 16-17 Jul 2024; 1:00 pm - 4:00 pm EDT Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.). They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. For workshops at a physical location, the workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email mansura@si.edu for more information.

Roles: To learn more about the roles at the workshop (who will be doing what), refer to our Workshop FAQ.


Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.


Collaborative Notes

We will use this collaborative document for chatting, taking notes, and sharing URLs and bits of code.


Surveys

Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


Schedule

Day 1

Before starting Pre-workshop survey
1:00 pm Workshop introduction
1:10 pm Introducing Python
2:00 pm Introducting Pandas
3:00 pm Accessing Data in a Dataframe
4:00 pm End of Day 1

Day 2

1:00 pm Review and Questions
1:10 pm Aggregating and Grouping Data
2:00 pm Combining Dataframes
3:00 pm Visualization with `altair`
3:50 pm Post-workshop survey
4:00 pm End of Day 2

Setup

To particpate in this workshop, you will need access to a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) with an up-to-date web browser.

All attendees must complete the instructions on the Summary and Setup page for the lesson prior to the workshop.

Virtural attendees must have Zoom installed.