Work by Sinclert Perez, Quynh M. Nguyen, and Kyle Cranmer.

This entry is a part of the NYU Center for Data Science blog’s recurring guest editorial series. Sinclert Perez is a CDS research software engineer. Quynh M. Nguyen recently graduated from NYU. He completed his PhD research in the Applied Math Laboratory at NYU’s Courant Institute of Mathematical Sciences Kyle Cranmer is a CDS professor of data science and professor of physics at the NYU College of Arts & Science.

Sinclert Perez (left), Quynh M. Nguyen (center), Kyle Cranmer (right)

Introduction

English speakers in the same local area tend to use the same word for a certain concept (i.e. “coke”…


We have arrived, once again, at graduation season here at NYU and this graduation season is incredibly special for us at CDS. We are celebrating our first two PhD graduates and our first graduates from our Data Science undergraduate major! It makes us so happy to see these programs deliver their first degrees to such amazing students.

It is a major achievement to complete the requirements for a degree of any level. This is why we want to not only congratulate our students for graduating but also persevering during an incredibly trying time. At CDS, we have always known our…


Anton Strezhnev, CDS Moore-Sloan Fellow, has recently co-authored “Leases as Forms” alongside University of Pennsylvania Carey Law School professor David A. Hoffman. The paper, which is the first large-scale descriptive study of residential leases, was featured in MarketWatch this past March.

CDS Moore-Sloan Fellow Anton Strezhnev

The team’s research is based on a dataset of ∼170,000 residential leases filed in support of over ∼200,000 Philadelphia eviction proceedings that occurred from 2005 through 2019. The team found that these leases are quite likely to include terms that are unenforceable and generally contain worse rules for tenants over time. They argue that much of this trend has…


Our PhD program was established almost four years ago in the Fall of 2017. During those four years two students have worked tirelessly in their studies and their research. This Spring, those students will be our first graduates from the Center for Data Science to be awarded a PhD. It is with the utmost excitement that we congratulate both Leslie Huang and Vlad Kobzar on their amazing achievement!

Leslie has been working on interdisciplinary research, advised by CDS faculty Arthur Spirling, that focuses on developing natural language processing methods to study political discourse. We spoke to Leslie about her time…


We’re excited to once again bring you our Data Science for Everyone course over the summer. The course’s summer session 1 will take place from May 24 thru July 5. Registration is currently open, and the application deadline is May 27 for NYU students and May 12 for non-NYU students.

So what is Data Science for Everyone all about? The course was first offered in the spring 2019 semester. Initially tailored for a smaller classroom setting, the original cohort was just 31 students. As it turned out, demand was high with the course having a sizable waiting list even that…


The openclean Open-Source Data Cleaning Library: A unified framework for data wrangling

Joint work by Heiko Müller, Sonia Castelo, Munaf Qazi, and Juliana Freire

This entry is a part of the NYU Center for Data Science blog’s recurring guest editorial series. Heiko Müller is a CDS research engineer. Sonia Castelo and Munaf Qazi are research engineers at NYU Tandon School of Engineering. Juliana Freire is a CDS joint faculty member and Professor of Engineering at NYU Tandon School of Engineering.

Data preparation is still a major bottleneck for many data science projects. A frequently cited survey in 2016 found that…


At CDS, we’re proud of every contribution made by our students and faculty to their respective fields. When those contributions are recognized outside of our community it is cause for celebration. This is especially the case for the recent awarding of the Samsung Ho-Am Prize to CDS faculty member Kyunghyun Cho.

For those unfamiliar with the honor, the Samsung Ho-Am Prize was established in 1990 by Kun-Hee Lee, the Chairman of Samsung at the time, to “create a new corporate culture that continues the noble spirit of public service espoused by Byung-chull Lee, founder of Samsung.” …


It’s been more than a full year since the beginning of the SARS-CoV-2 pandemic and we’ve all had to find ways to go on with our lives in the safest way possible. Life at CDS is no exception. Over the past year we’ve implemented a hybrid instruction model that integrated online delivery (like last summer’s asynchronous Data Science for Everyone) with socially distanced in-person lectures.

One of our flagship courses, Intro to Data Science, has had great success in the socially distanced format. …


Last week, Kyle Cranmer, CDS professor of data science and professor of physics at the NYU College of Arts & Science gave a talk at AISTATS 2021, the 24th International Conference on Artificial Intelligence and Statistics. Kyle was joined by Emmanuel Candès, the Barnum-Simons chair in mathematics and statistics and professor of mathematics, of statistics, and of electrical engineering at Stanford University as well as Bin Yu, professor of of statistics and electrical engineering and computer sciences at UC Berkeley.

Professor Kyle Cranmer

The conference was held virtually from Tuesday, April 13 to Thursday, April 15, 2021. Presented by the Society of AI…


CDS, in collaboration with our partner Capital One, jointly held the Capital One/CDS CURP Information Session on April 9, 2021 via Zoom. CURP is the CDS Undergraduate Research Program in partnership with the National Society of Black Physicists (NSBP). The event was hosted by Capital One machine learning engineers Anirudh Sridhar and Rachana Balasubramanian and facilitated by CDS staff members Riya Sarkar, research scientist and Loraine Nascimento, head of external relations & career development. In attendance were the students of the CDS Capital One CURP Spring 2021 Cohort.

Anirudh and Rachana spent much of the session providing valuable information about…

NYU Center for Data Science

Official account of the Center for Data Science at NYU, home of the Undergraduate, Master’s, and Ph.D. programs in Data Science.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store