Skip to main content

Byron Wallace

Assistant Professor

Director - BS in Data Science Program


Office Location

177 Huntington Avenue
2208 – 177
Boston, MA 02115

Mailing Address

Northeastern University
ATTN: Byron Wallace, 2210-177
Boston, MA 02115

Research Interests

  • Data mining
  • Machine learning
  • Natural language processing


  • PhD in Computer Science, Tufts University
  • BSCS, University of Massachusetts at Amherst


Byron Wallace is an assistant professor in the Khoury College of Computer Sciences at Northeastern University. He earned his PhD from Tufts University in 2012, at which point he joined Brown University as research faculty. He joins Northeastern from the University of Texas at Austin, where he was an assistant professor in the School of Information from 2014-2016.

Byron’s research areas include artificial intelligence, data science, machine learning, natural language processing, and information retrieval, with emphasis on applications in health informatics. Byron is a member of the applied machine learning group and the Data Science and Analytics Lab at Northeastern.

Much of Byron’s work has concerned developing machine learning and natural language processing methods that make synthesizing the vast biomedical evidence-base more efficient. He also works on core machine learning and natural language processing methods. Some of his recent work concerns Convolutional Neural Network (CNN) architectures for text. And he has recently been developing hybrid, interactive human/machine learning systems that aim to robustly combine human and machine intelligence.

Byron’s work has been supported by grants from the Army Research Office (ARO), the National Institutes of Health (NIH), and the National Science Foundation (NSF). He won the Tufts University 2012 Outstanding Graduate Researcher award and his thesis work was recognized as The Runner Up for the 2013 ACM Special Interest Group on Knowledge Discovery and Data Mining (SIG KDD) Dissertation Award. He recently co-authored the winning submission for the Health Care Data Analytics Challenge at the 2015 IEEE International Conference on Healthcare Informatics.


Leyden, Massachusetts


Outside of research, I do a fair amount of reading (both fiction and non-fiction). I’m a consistent (if mediocre) runner, and I like to ski. I also enjoy coffee and craft beer.

What’s one problem you’d like to solve with your research/work?

I’d hope that my work contributes to allowing us to realize better healthcare by processing and making sense of the vast amounts of health-related data that currently exists in unstructured formats like text. For example, one of my major points of focus has been building machine learning systems that enable researchers to make sense of the torrential volume of biomedical evidence now being published in biomedical journals. A lot of my work on methods in machine learning and natural language processing is therefore motivated by this aim.

More broadly, I hope my research contributes to continued progress toward teaching machines to make sense of language and text.

What aspect of what you do is most interesting?

Working closely with domain experts on interdisciplinary problems is hugely interesting to me, because it exposes a whole new set of questions and perspectives. I find that it also leads to novel work that could not be pursued within a single, narrowly defined discipline.

On a similar note, I personally find it most compelling when real-world problems directly motivate interesting methodological work (as opposed to developing new models or methods that address theoretical or abstract problems). I therefore tend to approach research from the perspective of addressing practical challenges. This comes with its own set of challenges, but I find it very rewarding because it means the research has immediate impact.