- About Ramapo
- Academics
- Admissions & Aid
- Student Life
- Athletics
- Alumni
- Arts & Community
- Quick Links
- Apply
- Visit
- Give
The Academic Data Science Alliance (ADSA) Data Science Taxonomy represents a comprehensive framework of competencies for Master’s-level data science programs, developed through collaboration with leading academic institutions and federal partners including NSA, DOD, NIH, and NSF.
This nationally recognized taxonomy establishes standardized competencies that ensure graduates possess the critical skills needed in today’s data-driven economy, making it highly valued by employers across industries.
Ramapo College’s Master of Science in Data Science program aligns exceptionally well with this prestigious framework, demonstrating our commitment to providing students with industry-relevant, federally-recognized competencies that will distinguish them in the competitive data science job market. It’s one of the reasons Ramapo’s MSDS has been consistently listed as one of Fortune’s Best Masters degrees in Data Science.
Our Master of Science (MS) in Data Science degree is a 30-credit program with course work in Python, R, Data Visualization, Database Systems, Machine Learning, Statistics and Mathematical Modeling. Full-time students will complete their degree in 18 months. Courses are delivered as a combination of online, hybrid, and evening in-seat format – you can complete the degree while being on campus just one night a week.
Explore the detailed mappings below to see how each course in our program contributes to building these essential, nationally-recognized data science skills.
Foundations of Analytics: Statistics | |
---|---|
Data Collection DesignMethodical approach to gather observations, measurements and information from different sources |
|
InferenceProcess of using statistics to make conclusions about a population based on a random sample |
|
Modeling (Stochastic)Method of generating sample data and making real-world predictions using statistical models |
|
Multivariate AnalysisStatistical techniques that simultaneously look at three or more variables |
|
Statistical LearningProcess of learning from data using statistical algorithms |
|
Bayesian MethodsTheory based on Bayesian interpretation of probability |
|
Causal inferenceProcess of determining independent effect of a phenomenon |
|
Model uncertaintyLevel of understanding of world representation for mathematical modeling |
|
Experimental designCarrying out research in objective and controlled fashion |
|
SamplingSelection of subset from statistical population to estimate characteristics |
|
Foundations of Analytics: Mathematics | |
Set theory and basic logicFundamental mathematical concepts dealing with collections of objects and logical reasoning |
|
Matrices and basic linear algebraMathematical structures and operations for solving systems of linear equations |
|
Optimization – algorithmMathematical techniques for finding the best solution from all feasible solutions |
|
Probability theoryMathematical framework for analyzing random phenomena and uncertainty |
|
Arithmetic and GeometryBasic mathematical operations and study of shapes, sizes, and properties of space |
|
Graph Theory and NetworksStudy of graphs as mathematical structures used to model pairwise relations |
|
Foundations of Analytics: Data Analytics | |
Exploratory AnalysisApproach to analyzing data sets to summarize their main characteristics |
|
Variable DistributionsDescription of how values of a variable are spread or distributed |
|
Scatter PlotsGraph using Cartesian coordinates to display values for two variables |
|
Correlation AnalysisStatistical method used to evaluate the strength of relationship between variables |
|
Conditional ProbabilityProbability of an event occurring given that another event has occurred |
|
Spatial AnalysisExamining locations, attributes, and relationships of features in spatial data |
|
Data VisualizationRepresentation of data through graphics like charts, plots, infographics |
|
Artificial IntelligenceTechnologies that enable computers to perform advanced functions including analysis |
|
Classical AITraditional artificial intelligence approaches using symbolic reasoning |
|
Modern AI/Data Driven AIContemporary AI approaches based on machine learning and data analysis |
|
Machine LearningSubfield of AI using data and algorithms to learn and improve accuracy over time |
|
Classical MLTraditional machine learning algorithms and statistical methods |
|
Deep LearningMachine learning based on artificial neural networks with multiple processing layers |
|
NLPBranch of AI allowing computers to interpret human language similarly to humans |
|
Uncertainty Quantification/CharacterizationAssessment and representation of uncertainties in computational models |
|
Data MiningPractice of analyzing large databases to generate new information |
|
Foundations of Analytics: Data Modeling | |
Model Development and DeploymentProcess of creating, testing, and implementing predictive models |
|
Model Risks and Mitigation StrategiesIdentification and management of potential model failures |
|
Model analysis and ValidationEvaluation of model performance and reliability |
|
Data VisualizationRepresentation of data through graphics like charts, plots, infographics |
|
Systems and Implementation: Computing and Computer Fundamentals | |
Data StructuresWays of organizing and storing data in computer programs |
|
AlgorithmsStep-by-step procedures for solving computational problems |
|
SimulationsImitation of real-world processes or systems using computational models |
|
Data EngineeringPractice of designing and building systems for collecting and analyzing data |
|
Database DesignProcess of producing detailed data models and database structures |
|
Data Preparation and CleaningProcess of detecting and correcting corrupt or inaccurate records |
|
Records Retention and CurationManagement and preservation of data records over time |
|
Big Data SystemsTechnologies for processing data sets too large for traditional software |
|
Data Security and PrivacyProtection of data from unauthorized access and ensuring privacy |
|
Cloud ComputingDelivery of computing services over the internet |
|
High Performance ComputingUse of parallel processing for running advanced computation programs |
|
Systems and Implementation: Software Development and Maintenance | |
ProgrammingProcess of creating computer programs using programming languages |
|
Collaboration and version controlTools and practices for team software development |
|
Database/data warehousingSystems for storing and managing large amounts of structured data |
|
Data Science Project Design: Users and Impacted Groups | |
Implications of analysis and resultsUnderstanding the broader impact and consequences of data analysis |
|
Defining the user and UX designCreating user-centered design for data products and interfaces |
|
Story-telling with dataCommunicating insights and findings through compelling data narratives |
|
Human-centered designDesign approach that focuses on human needs and experiences |
|
Data Science Project Design: Research Methods | |
Hypothesis developmentFormulating testable predictions based on observations |
|
Defining data-driven questionsCrafting questions that can be answered through data analysis |
|
Computational logicApplication of logical reasoning in computational problem-solving |
|
Data-driven decision makingMaking decisions based on data analysis rather than intuition |
|
Data/research lifecycleComplete process from data collection to research conclusions |
|
Analysis and presentation of decisionsCommunicating analytical findings to support decision-making |
|
Data Science Project Design: Data | |
Data acquisitionProcess of gathering data from various sources |
|
Data governanceManagement of data availability, usability, integrity and security |
|
Data provenance and citationDocumentation of data sources and proper attribution |
|
Data Science Project Design: Open Science by Design | |
Reproducibility, replicability, repeatabilityEnsuring research can be verified and repeated |
|
Interactive computingComputing environment that allows real-time user interaction |
|
Data Science Project Design: Visualization | |
Grammar of graphicsSystem for describing and building statistical graphics |
|
Static and dynamic visualization designCreating both fixed and interactive data visualizations |
|
Data Science In Practice: Responsible Practices | |
Relevant domain knowledge for effective decision-makingUnderstanding the specific field or industry context |
|
Legal considerationsUnderstanding legal requirements and constraints in data use |
|
Data privacyProtection of personal and sensitive information in datasets |
|
Data and product/system security and resilienceEnsuring robust protection against threats |
|
Data and product/system governanceOversight and management of data systems |
|
Research integrityAdherence to ethical principles in research conduct |
|
Assessment, monitoring, and management of risksSystematic approach to identifying and controlling risks |
|
Understanding and uncovering biasIdentifying and addressing systematic errors in data and analysis |
|
Interpretability and ExplainabilityMaking complex models understandable to humans |
|
Human impacts of designConsidering how design decisions affect people and communities |
|
Responsible data collectionEthical approaches to gathering data from individuals and communities |
|
Understanding impacted communitiesRecognizing how data work affects different groups of people |
|
Data Science In Practice: Effective Collaboration | |
Working with stakeholdersCollaborating effectively with various project participants |
|
Working with domain expertsPartnering with subject matter experts |
|
Project managementPlanning, executing, and controlling project activities |
|
Infrastructure cost and benefitsEvaluating financial and operational aspects of technology infrastructure |
|
Participatory research / stakeholder engagementIncluding community members in research processes |
|
Data Science In Practice: Communication | |
Technical writing skillsCommunicating complex technical information clearly |
|
Communication (oral) and presentation skillsEffectively presenting information to audiences |
|
DocumentationCreating clear and comprehensive project documentation |
|
Categories: Data Science, MSDS
The first event for this year’s DMC Lecture series features a Ramapo Computer Science alum – Save the date!
Tuesday September 9th, 2025
ASB 429 @ 5:30pm
Artificial intelligence is changing how we work, moving us from predictable processes to systems that operate on probabilities and uncertainty. This talk explores what this shift means for students entering the workforce. Devinder will examine how humans and AI collaborate in practice, what skills are becoming essential, and how to make responsible decisions when working with uncertain technologies. Students will learn practical strategies for thriving in careers where adaptability and critical evaluation of AI outputs are key professional skills.
Speaker Bio
Devinder Sodhi is a Ramapo College Computer Science graduate (2016), who currently shapes AI curriculum development at DataCamp and leads technology communities at Frontier Tower and AICamp SF. His engineering background spans diverse industries, including jet engine testing systems at Qt, surgical robotics at Johnson & Johnson, and medical imaging platforms at Canfield Scientific. Sodhi focuses on practical AI implementation and education, teaching professionals how to effectively integrate AI tools into their workflows while maintaining critical oversight of probabilistic outputs.
Categories: Data Science, Lecture Series, MSCS, MSDS
Date/Time: Wednesday, 4/23/25, 5-7 pm
Location: Ramapo College, Trustees Pavilion
Join us for the Center for Data, Mathematical, and Computational Sciences (DMC) Fair on April 23th, 2025.
Following the keynote speech, attendees can view posters showcasing Ramapo students’ research projects in Data, Mathematical, and Computational Sciences. Attendees will be able to network with Ramapo students and faculty as well as industry professionals. Awards will be given for the best posters and refreshments will be served.
In this presentation, Michael Geraghty, Chief Information Security Officer (CISO) for the State of New Jersey will bridge the gap between academic data science concepts and their real-world applications in the field of cybersecurity. Drawing from extensive industry experience, Mr. Geraghty will demonstrate how data science techniques are applied in threat detection, risk assessment, and incident response in enterprise environments. Through case studies and practical examples, attendees will gain insights into the challenges and opportunities of applying data science in a cybersecurity context. By the end of the session, attendees will have a clearer understanding of how their data science skills can be applied to solve real-world cybersecurity problems, as well as the career paths available in this rapidly evolving field. The talk will conclude with a Q&A session.
Michael Geraghty is the State of New Jersey’s Chief Information Security Officer (CISO) and Director of the New Jersey Cybersecurity and Communications Integration Cell (NJCCIC). In these roles Director Geraghty is responsible for the development and execution of the State’s cybersecurity strategy. He is responsible for leading and coordinating New Jersey’s cybersecurity efforts while building resiliency throughout the State and has direct responsibility for all aspects of statewide cybersecurity operations; governance, risk and compliance; and incident response.
Mr. Geraghty is an accomplished cybersecurity executive with a history of building innovative and model programs in private and public sector enterprises including roles as CISO of the Hudson’s Bay Company, Chief Information Officer of the National and International Centers for Missing and Exploited Children, Vice President of High Technology Investigations at Prudential Financial, and Network Intrusion Detection Manager, Lucent Technologies/Bell Labs. Mr. Geraghty began his career with the New Jersey State Police, where he served 12 years and led the formation and development of its High Technology Crimes Investigations Unit.
He has provided expert testimony before the United States Congress and in federal, state, and international courts on computer crime investigations and forensics. Geraghty is a past president of the Northeast Chapter of the High Technology Crimes Investigation Association and has held leadership roles in the National Strategic Policy Council on Cyber and Electronic Crime.
Categories: Data Science, Lecture Series, Mathematics, MSCS, MSDS
AI implementation in Finance operations
Speaker: Larry Shapiro from PwC
When: Monday 11/11 at 6:30pm – Room ASB 136
Topic: AI implementation in Finance operations: Larry Shapiro leads the Global Oracle ERP implementation at PwC including application of AI to finance operations and engagement management. Larry’s talk will resemble a real life case study on how leaders in large global enterprises embrace AI.
Some of the topics he will cover include:
Leadership Concerns:
Ethical Considerations:
Strategic Imperatives:
Implementation Challenges:
Please contact Jane Riff ( jriff@ramapo.edu ) for more information.
Categories: Data Science, Lecture Series, Mathematics, MSCS, MSDS
Bjarne Stroustrup, inventor of the C++ language and a pioneer in the field of Computer Science, is coming to Ramapo College on March 13th at 3:30 pm. Join us to learn about the past, present, and future of the language, and Dr. Stroustrup’s personal insights on its impact on the field of Computer Science . Dr. Stroustrup is currently a Professor of Computer Science at Columbia University in New York, NY.
All are welcome! Food will be available, along with a raffle for Google earbuds – courtesy of the Computer Science Club and co-sponsors. Check back here for room information as we get closer to the event.
Categories: Data Science, Lecture Series, MSCS, MSDS
The Fall 2023 Thesis Information session was held November 6th, if you missed it – here’s the link to the slides from the presentation. Reach out to Dr. Frees or Dr. Beecher for more information!
For graduate students looking to complete their Thesis in Spring 2024, you should be getting started with your thesis proposal – or at least starting to bounce ideas off potential thesis advisors!
For more information, take a look at the Thesis Handbook and Thesis Archive page as well.
Tuesday, April 4, 5:00 PM – 6:30 PM – SC-158 (Alumni Lounges)
Please join us for a talk by MSDS Alum Keith Osani, VP of Global Technology Data & Analytics at Sony Music!
This session will discuss the type of data available to music companies from digital platforms such as Spotify, Apple Music, Amazon, YouTube, TikTok and more. We will also explore the increasing volumes of data and the data engineering required to handle it all. Insight will also be given into the tools that data analysts are using to access the data and what types of analysis they are performing.
Categories: Data Science, Lecture Series, MSDS
Thursday, March 30, 2023 7-8pm – ASB 323
Please join us for a DMC Lecture Series event: Decoding Data Science: A secret recipe for success, given by Dr. Kanad Basu, Data Science Leader & Applied Mathematician!
Abstract of talk:
Data Science has completely transformed all major business sectors such as IT, Banking, Healthcare, Entertainment, Sports & much more. In order to make better & smarter decisions, businesses all over the world have long gathered & analyzed data. The global data science platform market was worth US $ 3.93 billion in 2019 and US $ 4.89 billion in 2020. From 2020 to 2027, it is predicted that the Data Science market will increase at a compound annual rate of 26.9 percent. There has never been a better time to study analytics & data science and contribute to this exciting field.
In this talk Kanad will share his transition journey, lessons picked up along the way, the skills one must develop for being a successful data scientist and how to prepare & market yourself for the job market.
About the author:
Kanad Basu, an interdisciplinary scientist who has 18+ years of analytics and data science experience spanning across academia and industry. Kanad holds the prestigious “Professor of Practice” position at Thunderbird School of Global Management, Arizona State University. He also led a market research data science team at Medallia that focuses on building a consumer behavioral intelligence and benchmarking platform that combines millions of consumer activities to allow businesses to understand where, how, and why consumers spend their time and money. Before Medallia, he led the computer vision data science team at Covisus where he and his team implemented state of the art computer vision algorithms to keep our nation safe by securing supply chains, protecting global compliance for pharmaceutical and medical industries and aiding brands in the prevention of product tampering.
Kanad holds a PhD in Applied & Computational Mathematics, has multiple publications, patents & book chapters around big data analytics and statistical applications in the field of Mathematical Biology & Mathematical Finance. Kanad is a frequent speaker at national & international conferences in AI & Data Science.
Categories: Data Science, Lecture Series, MSCS, MSDS
Tuesday, February 21, 2023 – Via WebEx
Please join us for a DMC Lecture Series event: Maximum Likelihood Estimation for Discretely Observed Multivariate Vasicek Processes, given by Dr. Michael Pokojovy, Assistant Professor, Department of Mathematical Sciences, PhD Program in Data Science, PhD Program in Computational Science, The University of Texas at El Paso (UTEP)!
This talk will discuss analyzing multiple zero-coupon bonds simultaneously to forecast future short rate dynamics. Those forecasts can play important roles in risk management, portfolio optimization, and other applications.
Because of low correlation with other asset classes, bonds play major roles in portfolio diversification efforts. A pure discount or a zero-coupon bond is a contract that does not involve intermediate interest payments but is traded at a deep discount, rendering yields at maturity when redeemed at full face value. As investment funds can create robust diversified portfolios with bonds, it is imperative that multiple bonds be analyzed simultaneously. The classical Vasicek model studies individual zero-coupon bonds and assumes the instantaneous interest rate follows a mean reverting process. In this talk, we consider an extension of the original Vasicek model to multiple zero-coupon bonds. The resulting coupled model is given by a stochastic differential equation driven by a p-dimensional white noise process. Given a set of observations over an equispaced time grid, our goal is to calibrate the system and forecast future short rate dynamics. Those forecasts can play important roles in risk management, portfolio optimization, and other applications. This is joint work with Ebenezer Nkum and Thomas M. Fullerton (UTEP).
Michael Pokojovy is an Assistant Professor of Data Science and Statistics. He holds a Ph.D. and a Dipl.-Math. degree in Mathematical Sciences, both from the University of Konstanz, Germany. Prior to his current appointment at UTEP, Dr. Pokojovy has held several postdoctoral positions in Europe and the US. His research interests include Statistical & Machine Learning, Big Data Analytics, Scientific Computing, etc. In addition to numerous theoretical and methodological developments, he has a track record of collaborative research in statistical process control, quantitative finance, engineering, biomedical sciences, rational mechanics, etc. He has authored and co-authored a number of publications in a variety of professional journals. Dr. Pokojovy has also mentored and advised numerous undergraduate, graduate and doctoral students as well as postdoctoral scholars who are now successfully pursuing various exciting careers in industry or academia.
This talk was supported by a grant from the Ramapo College Foundation.
Categories: Data Science, Lecture Series, MSCS, MSDS
Thursday, January 26, 2023 – 7-8pm – Location ASB 323
Please join us for a DMC Lecture Series event on Data Storytelling: The Secret to Delivering Business Impact with Analytics, given by Ganes Kesari, the Co-founder and Chief Decision Scientist at Gramener!
Did you know that over 50% of analytics projects fail due to bad data storytelling? Organizations invest heavily in hiring data science experts, buying expensive licenses, and setting up analytics processes. However, without the right data visualization, all this effort will go to waste. Stories have the power to engage people and inspire them to action. This session will introduce you to the concept of data storytelling. It will reveal the 4 steps to mastering visual storytelling with data using exciting industry examples.
Takeaways:
Ganes Kesari is the Co-founder and Chief Decision Scientist at Gramener , a data science company that helps organizations present data insights as stories. He advises executives on data-driven leadership and helps organizations adopt a data culture. Ganes is a TEDx speaker and Forbes Contributor. Find his latest work here: gkesari.com.
This talk was supported by a grant from the Ramapo College Foundation.
Categories: Data Science, Lecture Series, MSCS, MSDS
Copyright ©2025 Ramapo College Of New Jersey. Statements And Policies. Contact Webmaster.