Tools in Data Science is a diploma level data science course at IIT Madras.
It teaches the tools used to build & deploy data science applications.
Anyone can audit this course. It's public.
Anyone can access this course content for free and submit assessments.
Those auditing can use GitHub notifications and watch activity on the course repository.
Enrolled IITM students can additionally participate in Discourse, get projects evaluated, take the final end-term, and get a certificate.
You MUST know Python, JavaScript, APIs, etc.
You need a good understanding of Python, JavaScript, HTML, APIs, Excel, ChatGPT, and data science concepts.
Try Graded Assignment 1. Drop it if you score low.
It's a practical course. Just get it done. "How" matter less.
The course models real-life. Unclear problems, messy data, ridiculous deadlines, limited support.
Find your own unique ways of solving the problems. There's no one right approach.
It's a hard course. Take it in your last IITM BS Diploma term.
It's good for learning, maybe not for grades.
- In Jan 2025, 40% of the batch failed.
- The ROE is intentionally impossible
Here's students' feedback from past terms:
- It used to be an easy course until 2024. # # #
- Now it's hard and covers more. Take it in your last semester if possible. # # #
- Plan extra time. It takes more time than typical 3-credit courses. # # #
- LLMs grade you -- unpredictably. # #
- The ROE is hard. #
- Should you take Tools in Data Science this term? (Ans: take it in your last term)
Take Graded assignment 1 to check if you're ready for this course. Please drop this course (do it in a later term) if you score low. It'll be too tough for you now.
But the learnings may be worth the effort.
- May 2025 feedback indicates that students know it's hard -- and still rate the learning high.
- Jan 2025 course experience and farewell post.
Copying & ChatGPT are encouraged.
You CAN copy from friends and LLMs. You can work in groups. You can use the Internet, WhatsApp, ChatGPT, your notes, your friends, your pets...
You can share code. Even in projects, assignments, and exams (except the final in-person end-term exam).
- Why copy? Because in real life, there's no time to re-invent the wheel. You'll be working in teams on the shoulders of giants. It's important to learn how to do that well.
- To learn well, understand what you're copying. If you're short of time, prioritize.
- To learn better, share what you've learnt. Learn from others' feedback.
Check system requirements.
Check system-requirements.md for permissions you need, software to install, and websites to access. You may need to speak with your system administrator for access.
This course covers:
- Programming tools from building to deploying (with & for AI).
- Development Tools used to build apps.
- Deployment Tools to publish for others to use.
- AI Coding to write better code faster.
- Large Language Models for apps that use AI.
- Data science tools from sourcing to narrating.
- Data Sourcing to get the data: download, query, scrape.
- Data Preparation to clean and transform.
- Data Analysis with statistics, geospatial & network analysis.
- Data Visualization using charts, slides, and narratives.
Don't bother with the content. Start with assessments. Ask ChatGPT for help. If stuck, read content.
| Content | Assessment | Weight | Release Date | Submission Date |
|---|---|---|---|---|
| Development Tools | GA1 - Chat | Wed 03 Sep 2025 | Sun 05 Oct 2025 | |
| Deployment Tools | GA2 - Chat | Wed 24 Sep 2025 | Sun 05 Oct 2025 | |
| AI Coding | GA3 - Chat | Fri 03 Oct 2025 | Sun 12 Oct 2025 | |
| Project 1 | P1 - Chat | 20% | Fri 26 Sep 2025 | Fri 17 Oct 2025 |
| Large Language Models | GA4 - Chat | Sat 11 Oct 2025 | Sun 19 Oct 2025 | |
| Data Sourcing | GA5 - Chat | Wed 15 Oct 2025 | Mon 27 Oct 2025 | |
| Data Preparation | GA6 - Chat | Wed 23 Oct 2025 | Mon 10 Nov 2025 | |
| Data Analysis | GA7 - Chat | Thu 30 Oct 2025 | Sun 16 Nov 2025 | |
| Project 2 | P2 - Chat | 20% | Thu 30 Oct 2025 | 3:00 pm, Sat 29 Nov 2025 |
| Data Visualization | GA8 - Chat | Wed 19 Nov 2025 | Mon 08 Dec 2025 | |
| Remote Online Exam (hard) | ROE - Chat | 20% | 1 pm, Sun 09 Nov 2025 | 1:45 pm, Sun 09 Nov 2025 |
| Final end-term | F | 20% | Sun 21 Dec 2025 | Sun 21 Dec 2025 |
| GA = Graded Assignment | Best 5 of 8 | 20% |
Updates:
- 23 Sep 2025: We swapped modules 2 & 3. Deployment Tools is now before AI coding - because the AI coding graded assignment needs deployment. Dates revised:
- GA2 release is on 24 Sep (was 18 Sep).
- GA3 release is on 1 Oct (was 25 Sep)
- 05 Nov 2025: Project 2 will be live evaluated between 3:00 pm - 4:00 pm IST on Sat 29 Nov 2025.
- Graded Assignment 1 checks course pre-requisites. Please drop this course (do it in a later term) if you score low. It'll be too tough for you now.
- We may post bonus activities on Discourse. See previous bonus activities
- LLMs will evaluate you. Results vary each run. Learn to answer them robustly. LLM decisions are final.
- Faculty creates content: Anand S, [email protected] | @s.anand
- Instructors teach the course:
- Carlton D'Silva. [email protected] | @carlton
- Prasanna S, [email protected] | @iamprasna
- Teaching assistants answer your doubts:
- Hritik Roshan Maurya, [email protected] | @HritikRoshan_HRM
- Jivraj Singh, [email protected] | @Jivraj | LinkedIn Profile
- Virtual TA (GPT Instructions)
- TDS: Live Video Sessions + YouTube Channel
- TDS: Course files -- Jupyter notebooks, datasets, etc.
IITM student links:
- TDS: Discourse (for IITM students) - Ask questions, get help, and discuss with your peers.
- TDS: Course page - Sep 2025
- TDS: Announcement group - Sep 2025
BS in Data Science Program links:
Check these three links regularly to keep up with the course.
- Seek Notifications for Course Notifications. Log into seek.onlinedegree.iitm.ac.in and click on the bell icon 🔔 on the top right corner
↗️ . Check notifications daily.
- Your email for Course Announcements. Seek Inbox are forwarded to your email. Check daily. Check spam folders too.
- TDS Discourse: Faculty, instructors, and TAs will share updates and address queries here. Email [email protected] cc: [email protected] if you can't access Discourse.