Data Science Institute < Brown University

Brown University's Data Science Institute (DSI) serves as a campus hub for research and education in data science. Through our research and academic programs, we strive to ensure that those most in need are not the last to benefit from fundamental research in data science or data-driven applied research. Brown’s DSI is unique because we:

Equally value both domain-driven and fundamental methodological research in data science;
Increase data fluency and educate the next generation of data scientists, through our master’s program and providing outreach to students and researchers at a variety of career stages;
Explore the impact of the data revolution on culture, education, health and genomics, society, and social justice by engaging with research partners within the University and beyond.

The DSI offers multiple academic programs:

Residential master’s degree in Data Science, including a fifth-year master's option for Brown undergraduates
Online master's in Data Science: Policy, Governance, and Society
Doctoral certificate for Brown graduate students
Undergraduate certificate in Data Fluency

The DSI also supports two research centers, the Center for Computational Molecular Biology (CCMB) and the Center for Technological Responsibility, Reimagination, and Redesign (CNTR).

To support data science research and education across Brown’s campus, the DSI hosts seminars and public lectures, offers workshops in data science skills, and offers small grants to Brown researchers in all disciplines.

For additional information, please visit the institute's website: http://dsi.brown.edu/.

DATA 0050. What is Data? The great, good, and perils of data in society.

We know data is all around us and used in so many ways, but have you ever really thought about what data is? When did we start to think about ‘data’? What goes into ‘curating’ data, where do we get data and how do we know good data when we see it? This course will explore these questions from a non-technical, non disciplinary perspective. Regardless of your ultimate professional goals you will be interacting with data, so join us in discovering more about data from an interrogatory lens!

DATA 0080. Data, Ethics and Society.

A course on the social, political, and philosophical issues raised by the theory and practice of data science. Explores how data science is transforming not only our sense of science and scientific knowledge, but our sense of ourselves and our communities and our commitments concerning human affairs and institutions generally. Students will examine the field of data science in light of perspectives provided by the philosophy of science and technology, the sociology of knowledge, and science studies, and explore the consequences of data science for life in the first half of the 21st century. Fulfills requirement for Certificate in Data Fluency

Fall

DATA0080

S01

14681

MWF

9:00-9:50(09)

'To Be Arranged'

DATA 0150. Data Detectives: Critical Thinking in Our Data-Driven World.

What is data? Where does it come from? How do we use data? In this First Year Seminar, students will develop critical thinking skills: critical thinking with data – how to use, analyze and interpret data to answer questions and make decisions, and critical thinking about data – how to question, contextualize and evaluate data. Students will examine the history of data science, practice the art of effective data communication, explore the ways in which data is shaped by the analyst and society, and identify data science applications across a range of disciplines. Students will engage in hands-on activities such as exploring existing datasets and creating their own, while developing skills in data literacy, ethical analysis, and data visualization. Open to students of all disciplines, no prior experience is required!

Fall

DATA0150

S01

15268

TTh

1:00-2:20(06)

(K. Bergen)

DATA 0200. Data Science Fluency.

As data science becomes more visible, are you curious about its unique amalgamation of computer programming, statistics, and visualizing or storytelling? Are you wondering how these areas fit together and what a data scientist does? This course offers all students regardless of background the opportunity for hands-on data science experience, following a data science process from an initial research question, through data analysis, to the storytelling of the data. Along the way, you will learn about the ethical considerations of working with data, and become more aware of societal impacts of data science. Course does not count toward CS concentration requirements.

Spr

DATA0200

S01

24387

TTh

1:00-2:20(08)

'To Be Arranged'

DATA 0250. Applied Statistics in Python.

As more students engage in data science there is a need to provide guidance on conducting basic statistical analysis in Python. This course will provide a non-specialist approach to applied statistics, specifically linear models Python. Students will learn how to conduct linear modules using the Statsmodels package in Python. Students should have good working knowledge of descriptive statistics (equivalent to a high school AP level). Python coding experience is helpful but not required. Student learning would be assessed through hands-on Python coding activities and written interpretation of statistical reports. Students from the humanities and social sciences are particularly encouraged to enroll in this course. Pre-req- Basic High School A.P. knowledge of statistics.

DATA 1010. Probability, Statistics, and Machine Learning.

An introduction to the mathematical methods of data science through a combination of computational exploration, visualization, and theory. Students will learn scientific computing basics, topics in numerical linear algebra, mathematical probability (probability spaces, expectation, conditioning, common distributions, law of large numbers and the central limit theorem), statistics (point estimation, confidence intervals, hypothesis testing, maximum likelihood estimation, density estimation, bootstrapping, and cross-validation), and machine learning (regression, classification, and dimensionality reduction, including neural networks, principal component analysis, and unsupervised learning).

DATA 1030. Hands-on Data Science.

Successful machine learning systems require more than fitting complex models on data. You need to understand your dataset and all the decisions that go into data collection and modeling, carefully validate your results, and you need to be able to explain and defend the conclusions of the model. You will learn all these aspects and more in this course. Topics include exploratory data analysis, data splitting and preprocessing, cross validation and the bias-variance trade-off, training linear and nonlinear supervised machine learning models, measuring uncertainties, and making the model predictions interpretable/explainable. You will also learn about techniques to handle missing values. We use the Python data science ecosystem (e.g., sklearn, pandas and polars, matplotlib, XGBoost, SHAP). Prerequisites: Python coding experience (this course is not suitable for students with no prior python experience). Familiarity with linear algebra and calculus.

Fall

DATA1030

S01

14835

TTh

10:30-11:50(13)

(A. Zsom)

DATA 1050. Data Engineering.

The course will cover the storage, retrieval, and management of various types of data and the computing infrastructure (such as various types of databases and data structures) and algorithmic techniques (such as searching and sorting algorithms) and query languages (such as SQL) for interacting with data, both in the context of transaction processing (OLTP) and analytical processing (OLAP). Students will be introduced to measures for evaluating the efficacy of different techniques for interacting with data (such as ‘Big-Oh’ measure of complexity and the number of I/O operations) and various types of indexes for the efficient retrieval of data. The course will also cover several components of the Hadoop ecosystem for the processing of ‘big data.’ Additional topics include cloud computing and NoSQL databases. Introduction to concepts and techniques of computer science essential for data science will also be covered.

Fall

DATA1050

S01

14959

TTh

9:00-10:20(05)

(A. Ashraf)

DATA 1080. Large Language Models and Generative AI.

The primary emphasis of this course will be on the technical foundations of large language models (LLMs) and generative AI (Gen AI). The course will cover the use of agents and tools to combine the power of large language models and Gen AI with a wide range of traditional computer algorithms. Additionally, the course will cover the social impacts of this technology and the safety concerns it raises. Students will be required to complete a project to build a multi-agent system in which each agent uses both the power of generative AI and traditional algorithms and in which the collaboration of the agents produces a system which can perform a complex task such as planning a trip or building a predictive model or creating a database or a creating a website. Programming assignments will be done in Python or PyTorch.

DATA 1150. Data Science Fellows.

DATA 1150 is for juniors and seniors possessing data science skills, seeking to apply these skills and collaborate with faculty to integrate data science content into Brown courses. The course teaches communication, teaching and learning strategies, and determining project requirements. Qualified students have a combination of programming experience (intermediate level or above in R or Python), statistical knowledge (intermediate level or above) and knowledge of how data and computing can be used in applied fields. Students in the data fluency certificate must have completed DATA 0200 prior to DATA 1150. Students are required to complete an application (https://forms.gle/Je3Prrzs3NDEo4eG9) for the course. Students should apply by May 1 for full consideration, and must apply by August 1 at the latest. Qualified students must participate in an interview with the instructor and override requests will be granted only to students by instructor.

Fall

DATA1150

S01

14957

3:00-5:30(10)

(L. Clark)

DATA 1200. Reality Remix - Experimental VR.

This course pursues collaborative experimentation with virtual and augmented reality (AR and VR). The class will work as a team to pursue research (survey of VR/AR experiences, scientific and critical literature review), reconnaissance (identifying VR/AR resources on campus, in Providence and the region), design (VR/AR prototyping). Research findings are documented in a class wiki. The course makes use of Brown Arts Initiative facilities in the Granoff Center where an existing VR laboratory will be expanded through the course of the semester based on student needs. Class culminates in the release the class wiki as a resource for the Brown community.

DATA 1250. Artificial Intelligence Law and Policy.

This course will explore how courts, regulators, and policymakers are responding to the complex questions posed by artificial intelligence and machine learning technology. The course will cover several areas of law and policy, including liability theories for individual harms, privacy, discrimination, copyright, and new regulations emerging both in the U.S. and abroad. Students will read and discuss court decisions, statutes, policy memos, academic research, and more as they build a picture of this new legal and policy landscape. At the same time, they will learn the basics of the technology underlying AI systems and how law and technology influence one another.

Fall

DATA1250

S03

15641

4:00-6:30(07)

(T. Lazovich)

DATA 1340. Machine Learning for the Earth and Environment (EEPS 1340)..

Interested students must register for EEPS 1340.

DATA 1450. Text Analytics.

This course will first cover techniques for compiling textual corpora from web pages, pdfs, scanned pdfs, images, audio clips, etc. Secondly, it will look at processes for extracting some common types of information from these corpora. In particular, we will cover extracting named entities (persons, locations, organizations, etc.), relations between entities, events, transactions, topics, document summaries, abstracts, legal clauses, etc. This course is different from standard courses in Natural Language Processing and Computational Linguistics in that we will spend significant amount of course time on compiling textual corpora from documents in a variety of formats and our emphasis will be on extracting information that can be fed to analytics pipelines.

DATA 1491. Fairness in Automated Decision Making (CSCI 1491)..

Interested students must register for CSCI 1491 S02.

DATA 1500. Data Visualization & Narrative.

Data visualization is an essential tool in both discovering and communicating key analytical findings. However, data practitioners and developers can sometimes undervalue the visual polish that goes into creating the most effective graphics. This course will act as a technical primer for building data visualization using code, but a core focus will be the graphic design decisions – color, hierarchy, font selection, labeling – that elevate visualizations. Additional topics will include cartography, web design, and interaction.

Spr

DATA1500

S01

24965

4:00-6:30(17)

'To Be Arranged'

DATA 1720. Tackling Climate Change with Machine Learning (EEPS 1720).

Interested students must register for EEPS 1720.

DATA 1954S. Technology, Data, and the Authoritarian State (HIST 1954S).

Interested students must register for HIST 1954S.

Fall

DATA1954S

S01

15860

Arranged

'To Be Arranged'

DATA 2020. Statistical Learning.

A modern introduction to inferential methods for regression analysis and statistical learning, with an emphasis on application in practical settings in the context of learning relationships from observed data. Topics will include basics of linear regression, variable selection and dimension reduction, and approaches to nonlinear regression. Extensions to other data structures such as longitudinal data and the fundamentals of causal inference will also be introduced.

Spr

DATA2020

S01

24388

TTh

10:30-11:50(09)

'To Be Arranged'

DATA 2030. Forces of Influence in AI Governance.

Artificial Intelligence (AI) technologies have infused every industry, government, and field of research. It’s a part of the vast majority of every person’s daily life. Indeed, it is no longer viable to consider AI/tech as a field on its own. To navigate this rapidly moving, and increasingly complicated space as effectively as possible, we have to first step back and take an ecosystem-wide view. We’ll start there, with a field-wide overview, then dive into the theories of change and the different actors that influence the governance of AI systems globally (think big tech, governments, investors, media, etc). This course will give an overview of those who are influencing the governance of AI technologies, and their dynamics, and help students make informed choices for themselves and their work in or adjacent to tech & AI.

DATA 2040. Deep Learning and Special Topics in Data Science.

A hands-on introduction to neural networks, reinforcement learning, and related topics. Students will learn the theory of neural networks, including common optimization methods, activation and loss functions, regularization methods, and architectures. Topics include model interpretability, connections to other machine learning models, and computational considerations. Students will analyze a variety of real-world problems and data types, including image and natural language data.

DATA 2050. Data Science Practicum.

This course is a requirement for master’s students in Data Science and is only open to them. The course includes a semester-long capstone project as well as instruction on topics that prepare students for working as data scientists, such as requirements gathering, version control, bug tracking, software deployment, and other professional development. Capstone projects will be sourced from entities in Brown (departments, labs, researchers, etc.) as well as external entities (companies, nonprofits, etc.) and will engage most of the core components of data science: Data sculpting (data cleaning, formatting, feature selection, etc.), exploratory data analysis, data modeling, and data visualization. Students will also address any social and ethical issues raised by their project. Students will usually work in teams of 3 - 4.

Spr

DATA2050

S01

24568

3:00-5:30(10)

(A. Ashraf)

DATA 2060. Machine Learning: from Theory to Algorithms.

Data science techniques and tools are all around us. Machine learning is a term used across many different disciplines, and often people use machine learning tools without a thorough understanding of how and why the tools work. This course will provide a foundation of machine learning grounded in the mathematical models behind the techniques. We will cover the theory, computational methods, and visualization inherent in the application of machine learning models. Students will learn the statistical learning framework, common assumptions in the data generation process, the mathematics behind machine learning models, including supervised and unsupervised techniques, as well as how to implement machine learning models in Python from scratch. Strong python skills (preferably object-oriented programing) are required. All DSI and CS ScM students will receive overrides as this is a required course for them.

Fall

DATA2060

S01

14686

TTh

2:30-3:50(12)

(A. Zsom)

DATA 2080. Data and Society.

DATA 2110. Topics in Econometrics.

This course will begin with a survey of the literature on identification using instrumental variables, including identification bounds, conditional moment restrictions, and control function approaches. The next part of class will cover some of the theoretical foundations of machine learning, including regularization and data-driven choice of tuning parameters. We will discuss in some detail the canonical normal means model, Gaussian process priors, (empirical) Bayes estimation, and reproducing kernel Hilbert space norms. We will finally cover some selected additional topics in machine learning, including (deep) neural nets, text as data (topics models), multi-armed bandits, and data visualization.

DATA 2450. Exchange Scholar Program.

Fall

DATA2450

S01

13302

Arranged

'To Be Arranged'

DATA 2980. Research in Data Science.

Section numbers vary by instructor. Please check Banner for the correct section number and CRN to use when registering for this course.

DSIO 2000. Technical Foundation for Data Science Success.

Focused on practical applications, this course covers the foundational concepts from programming, linear algebra, calculus, probability, and statistics that are most relevant to technical data science skills in the curriculum. At the end of the course, all students will achieve a baseline level of technical knowledge to prepare them for the technical components of the program. Learners will have the opportunity to explore more in-depth topics to discover how foundational topics like derivatives, matrices, and distributions are applied in machine learning and data modeling. Examples are drawn from advanced courses to highlight how these tools can be applied to theories and practice examined in this course.

Fall	DSIO2000	S01	13409	Arranged		(L. Clark)
Spr	DSIO2000	S01	23327	Arranged		(L. Clark)

DSIO 2010. Data Engineering in Disguise.

This course introduces students to the core principles of data engineering, emphasizing the often-hidden ethical choices that shape how massive datasets are managed. Students will learn about the fundamentals of data architecture, storage, and processing, while exploring critical values issues such as privacy, biases, data provenance, ownership, and copyright. This course interweaves the ethical considerations with the technical mechanics of data engineering, which exemplifies the real-world choices data engineers make as well as their broader societal implications. By the end of the course, students will understand not just the technical foundations of data engineering, but also the value-laden decisions involved in handling large-scale data.

Spr

DSIO2010

S01

23325

Arranged

(D. Firrincieli)

DSIO 2020. Machine Learning/DL/LLM.

This course offers a comprehensive introduction to machine learning (ML), deep learning (DL), and generative AI, preparing students to become informed users of these powerful technologies. The first half of the course focuses on classical ML techniques, while the second half is split between deep learning applications and the emerging field of generative AI, particularly large language models (LLMs). Students will explore key concepts like backpropagation to understand how models are updated with new data, and the differences between pretraining, fine-tuning, and alignment strategies, including Deep Policy Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF).

Fall

DSIO2020

S01

13407

Arranged

(M. Mneimneh)

DSIO 2030. Applied Learning Experience.

In this capstone course, students will apply their comprehensive knowledge of data science to a significant project centered on policy and governance issues and their societal impacts. This course integrates students' understanding of policy, governance, machine learning, and data science into a practical project, which may include case studies, practicums, projects related to current employment, or research papers. Students will engage in hands-on work with data, reflecting key aspects of the data science pipeline. Additionally, the course offers career-oriented skills development to enhance students' professional readiness. Through this culminating project, students will demonstrate their ability to synthesize and apply their learning in real-world scenarios.

Fall	DSIO2030	S01	13406	Arranged		(T. Lazovich)
Spr	DSIO2030	S01	23324	Arranged		(T. Lazovich)

DSIO 2100. Basic AI & Policy Ethics.

This course provides an introduction to the ethical and policy considerations surrounding artificial intelligence (AI) in today's society. Students will explore key ethical concerns, such as data privacy, bias, and accountability, as well as the societal and historical contexts that have shaped current AI governance. In addition, the course will offer a high-level overview of how AI systems are developed, including the basics of data collection, usage, and the training process for machine learning models. By examining these topics, students will gain a foundational understanding of the complex interactions between AI technologies and societal impacts, preparing them for deeper discussions in future courses on AI governance and responsible innovation.

Fall	DSIO2100	S01	13410	Arranged		(A. Hillery)
Spr	DSIO2100	S01	23328	Arranged		(A. Hillery)

DSIO 2110. Evidence-Driven Policy Making.

This course explores the role of artificial intelligence (AI) and machine learning (ML) in shaping evidence-driven policy decisions across various sectors. Using case studies from various AI/ML sectors, students will critically examine how AI/ML tools influence policy outcomes. Rather than delving into the technical intricacies of ML, this course emphasizes a "black box" approach, where data inputs lead to predictions. Students will learn to distinguish between prediction and intervention, recognize the limitations of AI/ML, and develop a transparent, precise language for discussing these technologies. By fostering healthy skepticism, this course equips students to make informed decisions about AI's role in evidence-based policy making.

Spr

DSIO2110

S01

23326

Arranged

(M. Berlin)

DSIO 2120. Fairness & Bias.

This course investigates the pursuit of building equitable technology by addressing fairness and bias in algorithmic systems. Students will review the latest advancements in creating more equitable algorithms, exploring definitions and types of (un)fairness. The course covers the challenges of explaining machine learning processes, ensuring accountability in algorithmic decisions, and addressing systemic biases. Through a combination of theoretical insights and practical approaches, students will gain a comprehensive understanding of how to design and implement fair and accountable AI systems.

Fall

DSIO2120

S01

13408

Arranged

(N. Marda)

DSIO 2130. Advanced Topics of AI Governance.

Drawing from the lessons of earlier courses, this course provides a thorough exploration of data governance within the context of artificial intelligence (AI) and machine learning (ML). Students will start by defining AI ethics and core principles guiding AI and ML development, expanding on key issues such as bias, fairness, and transparency. This course will then explore the current landscape of AI regulation and legislation, examining the roles of governments and international organizations in shaping and enforcing these regulations. Students will discuss the challenges and opportunities associated with AI governance, gaining insights into how regulatory frameworks can both address ethical concerns and foster innovation.

Fall	DSIO2130	S01	13405	Arranged		(N. Marda)
Spr	DSIO2130	S01	23323	Arranged		(N. Marda)

Data Fluency

The Certificate in Data Fluency provides a formal pathway for undergraduates in concentrations other than applied mathematics, computational biology, computer science, math, and statistics (see details below) who wish to gain fluency and facility with the tools of data science. The driving intellectual question motivating certificate students is how we can infer meaning from data whilst avoiding false predictions. The required experiential learning component provides you with the opportunity to apply your data-science skills in applied settings, engage in research that uses data science, teach data science as an undergraduate teaching assistant, or undertake an internship that has a substantive data-science component.

As with all undergraduate certificates, the certificate has the following requirements:

Students may not earn more than one certificate and may only have one declared concentration.
Students must be enrolled in or have completed at least two courses toward the certificate at the time they declare in ASK.
No more than one course may count toward your concentration and the certificate.
Students may declare a certificate in ASK only once an approved concentration is on file, and must declare no later than the last day of classes of the antepenultimate (typically the sixth) semester, in order to facilitate planning for the capstone or other experiential learning opportunity.
Students must submit a proposal for their experiential learning opportunity by the end of the sixth semester.

Excluded Concentrations: Applied Mathematics, Computational Biology, Computational Neuroscience, Computer Science, Mathematics, Statistics, and Social Analysis and Research. This includes joint concentrations in these areas; for example, Applied Mathematics-Economics is also excluded. According to the certificate guidelines, a student’s concentration and certificate cannot have substantial overlap.

For more information on the Certificate in Data Fluency, please visit the Data Science Institute website.

Certificate Requirements

Certificate Requirements

Core Courses:
DATA 0080	Data, Ethics and Society	1
CSCI 0111	Computing Foundations: Data	1
or CSCI 0150	Introduction to Object-Oriented Programming and Computer Science
or CSCI 0170	Computer Science: An Integrated Introduction
or CSCI 0190	Accelerated Introduction to Computer Science
or CPSY 0950	Introduction to programming
DATA 0200	Data Science Fluency	1
Elective Course: ¹		1
Select one follow-up Applied Math, Biostatistics, Computer Science or domain-specific course with a significant data component from the following list (or another course with approval from the certificate advisor). Students should be aware of potential prerequisites for these courses, which can be found at Courses@Brown.
ANTH 1201	Introduction to Geographic Information Systems and Spatial Analysis
APMA 1650	Introduction to Probability and Statistics with Calculus
BIOL 0495	Statistical Analysis of Biological Data
BIOL 1435	Computational Methods for Studying Demographic History with Molecular Data
BIOL 1535	Survey of Health Informatics
BIOL 1555	Methods in Informatics and Data Science for Health
BIOL 1595	Artificial Intelligence in Health Care
CPSY 0900	Statistical Methods
CPSY 1291	Computational Methods for Mind, Brain and Behavior
CPSY 1580C	Visualizing Information
CPSY 1950	Deep Learning in Brains, Minds and Machines
CSCI 0410	Foundations of AI and Machine Learning
CSCI 1270	Database Management Systems
CSCI 1302	Sociotechnical Approaches to AI and HCI
CSCI 1420	Machine Learning
CSCI 1470	Deep Learning
CSCI 1491	Fairness in Automated Decision Making
CSCI 1951A	Data Science
DATA 1030	Hands-on Data Science
DATA 1050	Data Engineering
DATA 1150	Data Science Fellows ²
DATA 1500	Data Visualization & Narrative
ECON 1620	Introduction to Econometrics
ECON 1630	Mathematical Econometrics I
EDUC 1230	Applied Statistics for Ed Research and Policy Analysis
EEPS 1320	Introduction to Geographic Information Systems for Environmental Applications
EEPS 1330	Global Environmental Remote Sensing
EEPS 1340	Machine Learning for the Earth and Environment
EEPS 1720	Tackling Climate Change with Machine Learning
MATH 1210	Probability
MUSC 1210	Seminar in Electronic Music: Real-Time Systems
SOC 1020	Methods of Social Research
SOC 1100	Introductory Statistics for Social Research
SOC 1340	Principles and Methods of Geographic Information Systems
STAT 1501	Essentials of Data Analysis
STAT 1510	Principles of Biostatistics and Data Analysis
STAT 1560	Using R for Data Analysis
Experiential Learning Component: ³		0-1
The required experiential learning component provides students with the opportunity to apply their data-science skills in their concentration, engage in research that uses data science, teach data science as UTAs, or undertake an internship that has a data-science component. More information can be found at https://dsi.brown.edu/academics/certificate-data-fluency/experiential-learning-component-elc.
Options for fulfilling the requirement include:
1. Participate in a Brown University credit experience (such as an independent study or the Data Science Fellows course).
2. Participate in a non-credit experience (such as a data-related internship, TA-ing for a certificate course, working with the CEDEC on a data-related project). A reflective paper is required for a non-credit option.
Total Credits		4-5

¹: It is best practice to take the elective with or after the other core courses so that students can integrate their data science skills into the advanced elective.
²: Students may complete DATA 1150 and the concurrent Data Science Fellows project to fulfill both the elective and experiential components of the certificate. Students interested in DATA 1150 should pay close attention to the prerequisites and the application deadline for the course.
³: Students must submit a proposal for their experiential component by the end of the sixth semester.

Data Science
Data Science Policy, Governance & Society

Data Science

Master of Science in Data Science

The Data Science Institute (DSI) at Brown offers a master's program (ScM) that prepares students from a wide range of disciplinary backgrounds for distinctive careers in Data Science. With connections to departments across campus, in particular Brown's Division of Applied Mathematics and Department of Computer Science, the master's program offers a unique and rigorous education for people building careers in data science. The program is designed to provide a fundamental understanding of the methods and algorithms of data science, to be achieved through a study of relevant topics in mathematics, statistics, and computer science, including database engineering, visualization, machine learning, and deep learning. The program also provides experience in important, frontline data-science problems in a variety of fields, and introduces students to ethical and societal considerations surrounding data science and its applications.

The program's course structure, including the capstone experience, ensures that students meet the goals of acquiring and integrating foundational knowledge for data science, applying this understanding in relation to specific problems, and appreciating the broader ramifications of data-driven approaches to human activity.

All students begin the program in September; there is no option for starting in the spring semester. The default program length is 21 months but students may elect to complete the program over 12, 16, 21, or 24 months. In some cases, exceptionally well-prepared students complete their work in nine months.

The curriculum for the Data Science Master's Program consists of nine credits: eight required courses, one of which is the experiential project course, and one elective. The nine credit-units divide as follows:

3 credits in mathematical and statistical foundations
3 credits in data and computational science
1 credit in societal implications and opportunities
1 elective credit to be drawn from a wide range of focused applications or deeper theoretical exploration
1 credit capstone experience.

We also offer an option as a 5-th Year Master's Program if you are an undergraduate at Brown. This allows you to substitute maximally 2 credits with courses you have already taken. 5th-Year students must complete the degree in one year (September - August).

For more information on admission and program requirements, please visit the following website: https://graduateprograms.brown.edu/graduate-program/data-science-scm.

Master of Science in Data Science

For more information about the Master's Program curriculum and when courses are offered, please visit the DSI Master's curriculum page or Courses@Brown.

DATA 1030	Hands-on Data Science	1
DATA 1050	Data Engineering	1
APMA 1690	Computational Probability and Statistics	1
DATA 2020	Statistical Learning	1
CSCI 1470	Deep Learning	1
or CSCI 2470	Deep Learning
CSCI 1491	Fairness in Automated Decision Making	1
DATA 2060	Machine Learning: from Theory to Algorithms	1
or CSCI 1420	Machine Learning
DATA 2050	Data Science Practicum	1
The practicum experience is a hands-on thesis project that entails an in-depth study of a current problem in data science. Students will synthesize their knowledge of probability and statistics, machine learning, and data and computational science. Students will work in teams on projects with Brown faculty members or with external companies. The project will be completed as part of a course that includes additional career-oriented skills development.
One elective:		1
Domain knowledge relevant to individual interest, 1 credit, must be a graduate level course with 4-digit course number starting with a non-0 digit. Most graduate level CSCI and APMA courses qualify. Please contact the DGS if you plan to take a course from a different department.
Total Credits		9

Data Science Policy, Governance & Society

Master of Science in Data Science: Policy, Governance & Society

The acceleration of artificial intelligence, machine learning, and advanced analytics is rapidly reshaping industries and societies across the globe. With this immense potential for transformation comes the need to develop robust governance frameworks that ensure data and AI systems are deployed responsibly, ethically, and legally. Brown’s online Master’s in Data Science: Policy, Governance & Society program prepares students to be leaders and stewards of their organization's data assets and analytics capabilities.

This fully online program brings a world-class education to a globally diverse student body while maintaining the same academic rigor and excellence of an Ivy League University. A flexible format delivered asynchronously gives working professionals global access to graduate education from one of the country’s leading universities.

The curriculum for the online ScM in Data Science consists of eight required courses.

For more information about the online Master's Program curriculum and when courses are offered, please visit the department website or Courses@Brown.

DSIO 2000	Technical Foundation for Data Science Success	1
DSIO 2100	Basic AI & Policy Ethics	1
DSIO 2110	Evidence-Driven Policy Making	1
DSIO 2010	Data Engineering in Disguise	1
DSIO 2020	Machine Learning/DL/LLM	1
DSIO 2120	Fairness & Bias	1
DSIO 2130	Advanced Topics of AI Governance	1
DSIO 2030	Applied Learning Experience	1

Brown University

Data Science Institute

Data Fluency

Certificate Requirements

Data Science

Master of Science in Data Science

Master of Science in Data Science

Data Science Policy, Governance & Society

Master of Science in Data Science: Policy, Governance & Society

Brown University

Resources

Contact