Skip to content

sanand0/tools-in-data-science-public

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tools in Data Science - Sep 2025

Tools in Data Science is a diploma level data science course at IIT Madras.

It teaches the tools used to build & deploy data science applications.

Anyone can audit this course. It's public.

Anyone can access this course content for free and submit assessments.

Those auditing can use GitHub notifications and watch activity on the course repository.

Enrolled IITM students can additionally participate in Discourse, get projects evaluated, take the final end-term, and get a certificate.

You MUST know Python, JavaScript, APIs, etc.

You need a good understanding of Python, JavaScript, HTML, APIs, Excel, ChatGPT, and data science concepts.

Try Graded Assignment 1. Drop it if you score low.

It's a practical course. Just get it done. "How" matter less.

The course models real-life. Unclear problems, messy data, ridiculous deadlines, limited support.

Find your own unique ways of solving the problems. There's no one right approach.

It's a hard course. Take it in your last IITM BS Diploma term.

It's good for learning, maybe not for grades.

Here's students' feedback from past terms:

  • It used to be an easy course until 2024. # # #
  • Now it's hard and covers more. Take it in your last semester if possible. # # #
  • Plan extra time. It takes more time than typical 3-credit courses. # # #
  • LLMs grade you -- unpredictably. # #
  • The ROE is hard. #
  • Should you take Tools in Data Science this term? (Ans: take it in your last term)

Take Graded assignment 1 to check if you're ready for this course. Please drop this course (do it in a later term) if you score low. It'll be too tough for you now.

But the learnings may be worth the effort.


Copying & ChatGPT are encouraged.

You CAN copy from friends and LLMs. You can work in groups. You can use the Internet, WhatsApp, ChatGPT, your notes, your friends, your pets...

You can share code. Even in projects, assignments, and exams (except the final in-person end-term exam).

  • Why copy? Because in real life, there's no time to re-invent the wheel. You'll be working in teams on the shoulders of giants. It's important to learn how to do that well.
  • To learn well, understand what you're copying. If you're short of time, prioritize.
  • To learn better, share what you've learnt. Learn from others' feedback.
Check system requirements.

Check system-requirements.md for permissions you need, software to install, and websites to access. You may need to speak with your system administrator for access.

8 modules in 12 weeks

This course covers:

  1. Programming tools from building to deploying (with & for AI).
    1. Development Tools used to build apps.
    2. Deployment Tools to publish for others to use.
    3. AI Coding to write better code faster.
    4. Large Language Models for apps that use AI.
  2. Data science tools from sourcing to narrating.
    1. Data Sourcing to get the data: download, query, scrape.
    2. Data Preparation to clean and transform.
    3. Data Analysis with statistics, geospatial & network analysis.
    4. Data Visualization using charts, slides, and narratives.

Don't bother with the content. Start with assessments. Ask ChatGPT for help. If stuck, read content.

Content Assessment Weight Release Date Submission Date
Development Tools GA1 - Chat Wed 03 Sep 2025 Sun 05 Oct 2025
Deployment Tools GA2 - Chat Wed 24 Sep 2025 Sun 05 Oct 2025
AI Coding GA3 - Chat Fri 03 Oct 2025 Sun 12 Oct 2025
Project 1 P1 - Chat 20% Fri 26 Sep 2025 Fri 17 Oct 2025
Large Language Models GA4 - Chat Sat 11 Oct 2025 Sun 19 Oct 2025
Data Sourcing GA5 - Chat Wed 15 Oct 2025 Mon 27 Oct 2025
Data Preparation GA6 - Chat Wed 23 Oct 2025 Mon 10 Nov 2025
Data Analysis GA7 - Chat Thu 30 Oct 2025 Sun 16 Nov 2025
Project 2 P2 - Chat 20% Thu 30 Oct 2025 3:00 pm, Sat 29 Nov 2025
Data Visualization GA8 - Chat Wed 19 Nov 2025 Mon 08 Dec 2025
Remote Online Exam (hard) ROE - Chat 20% 1 pm, Sun 09 Nov 2025 1:45 pm, Sun 09 Nov 2025
Final end-term F 20% Sun 21 Dec 2025 Sun 21 Dec 2025
GA = Graded Assignment Best 5 of 8 20%

Updates:

  • 23 Sep 2025: We swapped modules 2 & 3. Deployment Tools is now before AI coding - because the AI coding graded assignment needs deployment. Dates revised:
    • GA2 release is on 24 Sep (was 18 Sep).
    • GA3 release is on 1 Oct (was 25 Sep)
  • 05 Nov 2025: Project 2 will be live evaluated between 3:00 pm - 4:00 pm IST on Sat 29 Nov 2025.

Notes

  • Graded Assignment 1 checks course pre-requisites. Please drop this course (do it in a later term) if you score low. It'll be too tough for you now.
  • We may post bonus activities on Discourse. See previous bonus activities
  • LLMs will evaluate you. Results vary each run. Learn to answer them robustly. LLM decisions are final.

Contacts

Links

IITM student links:

BS in Data Science Program links:

Check communications

Check these three links regularly to keep up with the course.

  1. Seek Notifications for Course Notifications. Log into seek.onlinedegree.iitm.ac.in and click on the bell icon 🔔 on the top right corner ↗️. Check notifications daily. Portal Inbox
  2. Your email for Course Announcements. Seek Inbox are forwarded to your email. Check daily. Check spam folders too.
  3. TDS Discourse: Faculty, instructors, and TAs will share updates and address queries here. Email [email protected] cc: [email protected] if you can't access Discourse.

Previous terms

About

Official content for the IITM BS course on Tools in Data Science

Topics

Resources

License

Stars

Watchers

Forks

Contributors 7