Skip to content

Contains resources and material for BASH/SHELL programming for data science

License

Notifications You must be signed in to change notification settings

skhub12/BASH_workshop_2021

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BASH_workshop_2021

BASH/SHELL Programming Workshop for Data Science 2021

Venue: Solbosch Campus of ULB, at Avenue Fr. Roosevelt, 42, 1050 Brussels (the Solvay building). Room: R42.3.103 Time:10-12AM. Speaker: Dr.Shruti Kulkarni


1.1 Why BASH for Data Science?

Bash/Shell environment may not be the most ideal way to handle all kinds of data formats and it is often not the tool that comes to mind when we talk about data science! However, it is definitely useful and powerful in automating your workflows in a structured way. Sometimes, universities or institutional infrastructures often come with a pure Bash environment, such as what you get in the common linux based super-computers and you just want havea high-level view of the data before you drive into the real programming or other exploratory analysis, using Python, R and SQL, SAS, SPSS, and so on. Expertise in these data-science intensive languages can be time consuming. Whereas bash scripting is simple, easy to learn tool and might save some time in optimising a project.

1.2 Who this course is for?

• Anyone who wants to learn bash shell programming.

• Researchers who want to add Bash and other command line nifty tools to their bag of tricks.

1.3 What you’ll learn -

• Basics, the environment, other tools available to bash environment, practical tips.

• Basic bash commands and various features of bash sell.

• Advanced bash commands.

• Creating & running scripts to automate repeated tasks to save time.

• Tutorial: Makefiles, interplaying with bash scripts, and introduction to project structure.

• Tutorial: Manipulating and managing Github tasks in BASH environment.

• Tutorial: Data mining and wrangling in BASH environment.

• Use bash on processing real-world data sets.

1.4 Requirements or prerequisites -

• Desire to learn bash shell programming.

• Access to a Linux or Unix or Mac system (physical or virtual).

1.5 Getting started: Setup instructions for access to shell environment -

Instructions

  • Install a linux OS, recommend Ubuntu, on a virtual machine on windows, such as virtualbox (or vmware) https://www.wikihow.com/Install-Ubuntu-on-VirtualBox This works well for those who have older Windows versions.

  • It is recommended to install WSL (windows subsystem for linux) on Windows 10. It allows you to install Ubuntu on windows, in a limited way, but enough to give you access to bash shell right in your Windows environment.

  • Create AWS account – Free tier, Create EC2 instance, SSH (Possible with azure too).

  • Optional – GIT BASH, Atom editor (very limited features, not recommended).

For set up instructions please refer to the repo and slides.

About

Contains resources and material for BASH/SHELL programming for data science

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages