Publications
Research Publications
2023
- Using topic-noise models to generate domain-specific topics across data sources (January 2023)
- Measuring Candidate Ideology from Congressional Tweets and Websites (February 2023)
- The Civil Justice Data Gap (February 2023)
2022
- Misinformation About COVID-19 and Venezuelan Migration: Trends in Twitter Conversation During a Pandemic (January 2022)
- Students or Mechanical Turk: Who are the more reliable social media data labelers? (January 2022)
- Inferring #MeToo Experience Tweets using Classic and Neural Models (January 2022)
- Environmental change and human mobility: Opportunities and challenges of big data (April 2022)
- A Guided Topic-Noise Model for Short Texts (April 2022)
- Dynamic Topic-Noise Models for Social Media (May 2022)
- Civil Court Data at the Local Level: Interviews and Insights from Four Locations (May 2022)
- Traditional and context-specific spam detection in low resource settings (June 2022)
- PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter (June 2022)
- Parenting online: analyzing information provided by parenting-focused Twitter accounts (June 2022)
- The Diabetes Prevention Gap And Opportunities To Increase Participation In Effective Interventions (June 2022)
- Data Commons Models (June 2022)
- Population health science as a unifying foundation for translational clinical and public health research (June 2022)
- Delteil’s Les Écœurés: Gilets Jaunes and the Limits of the Noir (July 2022)
- DeMis: Data-efficient Misinformation Detection using Reinforcement Learning (July 2022)
- Privacy Preserving Technologies in Education (July 2022)
- Do Shallow Rental Subsidies Promote Housing Stability? Evidence on Costs and Effects from DC’s Flexible Program (July 2022)
- Assessing Social Media Data as a Resource for Firearm Research: Analysis of Tweets Pertaining to Firearm Deaths (August 2022)
- Do Shallow Rental Subsidies Promote Housing Stability? Evidence on Costs and Effects from D.C.’s Flexible Rent Program (September 2022)
- The Evolution of Topic Modeling (November 2022)
- Trends and Race/Ethnic Disparities in Diabetes-Related Hospital Use in Medicaid Enrollees: Analyses of Serial Cross-sectional State Data, 2008–2017 (November 2022)
- Death, Inequality, and the Pandemic in the Nation’s Capital (December 2022)
2021
- Rethinking Payment for Prevention in Healthcare (January 2021)
- A Comparative Analysis of Classic and Deep Learning Models for Inferring Gender and Age of Twitter Users (January 2021)
- textPrep: A Text Preprocessing Toolkit for Topic Modeling on Social Media Data (January 2021)
- A Big-Data Approach to Contemporary French Politics (February 2021)
- Text Analytic Research Portals: Supporting Large-Scale Social Science Research (March 2021)
- Science Research Using Social Media Data (March 2021)
- Data Acquisition, Sampling, and Data Preparation Considerations for Quantitative Social Science Research Using Social Media Data (March 2021)
- Analyzing the impact of missing values and selection bias on fairness (May 2021)
- Sharing Sensitive Department of Education Data Across Organizational Boundaries Using Secure Multiparty Computation (May 2021)
- # BlackLivesMatter: From the Protest to Policy (June 2021)
- Knowledge Enhanced Masked Language Model for Stance Detection (June 2021)
- #BlackLivesMatter—Getting from Contemporary Social Movements to Structural Change (June 2021)
- Modeling Considerations for Quantitative Social Science Research Using Social Media Data (June 2021)
- Estimating risk factor progression equations for the UKPDS Outcomes Model 2 (August 2021)
- DC Flexible Rent Subsidy Program: Findings from the Program’s First Year (August 2021)
- Migration Misinformation in Spanish-language Tweets during a Pandemic (October 2021)
- Age Inference Using A Hierarchical Attention Neural Network (October 2021)
- Topic-Noise Models: Modeling Topic and Noise Distributions in Social Media Post Collections (December 2021)
- Research note: Lies and presidential debates: How political misinformation spread across media streams during the 2020 election (December 2021)
2020
- Model Data Use Agreements: A Practical Guide (January 2020)
- Words that matter: How the news and social media shaped the 2016 Presidential campaign. (May 2020)
- Co-occurrence of diabetes and depression in the U.S. (June 2020)
- Identifying Meaningful Indirect Indicators of Migration for Different Conflicts (August 2020)
- Percolation-Based Topic Modeling for Tweets (August 2020)
- Information Exposure From Relational Background Knowledge on Social Media (October 2020)
- Understanding high-and low-quality URL Sharing on COVID-19 Twitter streams (November 2020)
- Social Media Data-Our Ethical Conundrum (December 2020)
2019
- # MeToo as Catalyst: A Glimpse into 21st Century Activism (January 2019)
- # Diversity: conversations on Twitter about women and Black men in medicine (January 2019)
- Linking Survey and Administrative Data to Measure Income, Inequality, and Mobility (January 2019)
- Worst Case Scenarios in Sharing Large-scale Sensitive Data (January 2019)
- Postsecondary Data Infrastructure: What is Possible Today (June 2019)
- Postsecondary Data Infrastructure: What is Possible Today (June 2019)
- Blending Noisy Social Media Signals with Traditional Movement Variables to Predict Forced Migration (July 2019)
- Big data and early warning of displacement (September 2019)
- Exploring the Relationship Between Conversation Using# MeToo and University Harassment Policies (October 2019)
2018
- Data analytics and displacement: Using big data to forecast mass movement of people (January 2018)
- A Temporal Topic Model for Noisy Mediums (June 2018)
- Developing, Validating, and Obtaining Stakeholder Buy-In for Criteria for Applying Social Science to Policymaking (November 2018)
- Detecting and Using Buzz from Newspapers to Understand Patterns of Movement (December 2018)
2017
- Using Network Flows to Identify Users Sharing Extremist Content on Social Media (April 2017)
- Location-based Event Detection Using Geotagged Semantic Graphs (August 2017)
- Using semantic graphs to detect overlapping target events and story lines from newspaper articles (November 2017)
- Understanding the impact of sampling and noise on detecting events using twitterModeling Social Preferences Based on Social Interactions (December 2017)
- EOS: A multilingual text archive of international newspaper & blog articles (December 2017)
2016
- Identification of Extremism on Twitter (August 2016)
- Overlapping Target Event and Storyline Detection of Online Newspaper Articles (October 2016)
White Papers
2020
- A first look at COVID-19 information and misinformation sharing on Twitter (March 2020)
- Data blending: Haven’t we been doing this for years? (April 2020)
- Study Designs for Quantitative Social Science Research Using Social Media (July 2020)
- Measurement Considerations for Quantitative Social Science Research Using Social Media Data (December 2020)