AI-Accelerated News Media
My research interests lie in the realization of AI-accelerated news media. This involves creating new experiences and improving operational efficiency in all stages of news production. In particular, we focus on exploring opportunities and challenges for pre-trained models in news media. Furthermore, I am involved in leveraging data science competitions as a means to promote the democratization of data science.
Building and Applying Information Technology Such as Pre-trained Models
There are many opportunities to apply information technology within the business processes of news media. As in-house engineers, we collaborate closely with editors. Some of our work is published externally in the form of press releases and research papers.
- Domain-specific pre-trained models Press release
- Improving operational efficiency:
- News summarization Journal of NLP
- Entity linking system Journal of NLP
- Creating new experiences:
- Service & User analysis:
- Effects of delivery formats IC2S2 2026
- Semantic shift analysis IC2S2 2023
- Reading time estimation BigData 2022 Industrial & Government Track
- Revisiting news recommendation NLDB 2025
Quantifying memorization of LLMs
The memorization of training data by large language models (LLMs) is recognized as a significant challenge related to copyright, security, and sound evaluation. We are conducting empirical experiments to quantify memorization, primarily using Japanese newspaper articles. We have also published survey papers and released a library for quantification.
- Survey paper ACL 2023 Workshop Transactions of JSAI
- Benchmarking membership inference attacks ACL 2026 Demo
- Experiments on Japanese newspaper INLG 2024 ACL 2025 Workshop
- Monitoring time-series performance degradation AACL-IJCNLP 2022 Journal of NLP
Promoting data science competitions
Data science competitions, such as those on Kaggle, are a powerful tool for the social implementation and democratization of information technology. Through publishing books and organizing competitions, we are dedicated to promoting this approach.
Reference
You can see Publications for more details.