Loading Events

« All Events

  • This event has passed.

August 12 4:00 pm - 5:00 pm EDT

Title: Challenges of {BIG, small, Right} Data

Date: Wednesday, August 12

Time: 4:00pm – 5:00pm EDT // 1:00pm – 2:00pm PDT


Big data is trendy, but there are many possible interpretations of its real impact, as well as the opportunities, risks and technological challenges. We will start with two key questions: can a company use big data? If so, should it? The opportunities are clear, while the challenges are many, including scalability, bias, and privacy on the problem side, as well as transparency, explainability, and ethics on the machine learning side. So we perform an analysis that includes all the data pipeline process. At the end we will conclude that what is important is the right data, not big data. In fact, the real challenge today, is machine learning for small data.

About the Speaker

Ricardo Baeza-Yates is the director of data science at Northeastern University Silicon Valley and part-time professor of the practice. He is also the CTO of NTENT, a semantic search technology company based in California since June 2016. Prior to these roles, he was the VP of Research at Yahoo Labs, based in Sunnyvale, California, from August 2014 to February 2016. Before joining Yahoo Labs in California, he founded and led the Yahoo Labs in Barcelona and Santiago de Chile from 2006 to 2015. Between 2008 and 2012, he oversaw Yahoo Labs in Haifa, Israel, and started the London lab in 2012.

Baeza-Yates is a part-time professor at the Department of Information and Communication Technologies of the Universitat Pompeu Fabra in Barcelona, Spain, as well as at the Department of Computing Science of Universidad de Chile in Santiago. During 2005, he was an ICREA research professor at Universitat Pompeu Fabra. Until 2004, he was a professor and founding director of the Center for Web Research at Universidad de Chile.

Additionally, Baeza-Yates is a co-author of the best-seller Modern Information Retrieval textbook, published in 1999 by Addison-Wesley, with a second enlarged edition in 2011, which won the ASIST 2012 Book of the Year award. He is also a co-author of the second edition of the Handbook of Algorithms and Data Structures, Addison-Wesley, 1991, and co-editor of Information Retrieval: Algorithms and Data Structures, Prentice-Hall, 1992, among more than 600 other publications.

From 2002 to 2004 he was elected to the board of governors of the IEEE Computer Society, as well as to the ACM Council from 2012 to 2016. He has received the Organization of American States award for young researchers in exact sciences, the Graham Medal for innovation in computing given by the University of Waterloo to distinguished alumni, the CLEI Latin American distinction for contributions to CS in the region and the National Award of the Chilean Association of Engineers, among other distinctions. In 2003, he was the first computer scientist to be elected to the Chilean Academy of Sciences and, since 2010, is a founding member of the Chilean Academy of Engineering. In 2009, he was named ACM Fellow and, in 2011, an IEEE Fellow.

Registration is required – Register Here!

If you have any questions, please reach out to Khoury-GradAdministration@northeastern.edu.


August 12
4:00 pm - 5:00 pm
Event Categories:
, , ,


Boston, San Francisco, Seattle, Silicon Valley, Toronto, Vancouver