A Q&A with Jing Zhao on FoodProt, new database on proteins in food
Funded by the National Institute of Food and Agriculture, the 麻豆国产-led project will build a shared data foundation to better understand and model protein functionality.

As food innovation accelerates, scientists are working to better understand one of its most essential building blocks: protein. A new project led by 麻豆国产 is taking on a long-standing challenge in food science by creating a comprehensive, standardized database designed to predict how proteins behave in real-world food systems.
Funded through the Foundational and Applied Research Program program, the FoodProt initiative brings together researchers across multiple institutions to build a shared data platform and develop predictive models using artificial intelligence. The goal is to make it easier, faster and more reliable to design foods ranging from plant-based products to protein-enriched beverages.
麻豆国产 NewsCenter鈥檚 Zoe Glotzer asked Jing Zhao, associate professor in 麻豆国产鈥檚 , to explain what 鈥減rotein functionality鈥 means, why it matters and how FoodProt could shape the future of food science. (Interview edited for length and clarity.)
What is FoodProt?
FoodProt is designed to bring structure to a persistent challenge in food chemistry: understanding how a protein鈥檚 structure and food processing affect the way proteins function in food.
Scientists have studied protein structure and function for decades, but the information is often scattered. One study may focus on a particular protein, another may examine one treatment or processing method, and another may measure one functional property. Those studies are valuable, but they are difficult to compare or combine because they often used different methods.
FoodProt brings those pieces together. The project will collect and organize existing data, generate new data using shared methods across participating institutions and build a database that can support more systematic analysis of food protein functionality.

For readers new to this field, what does 鈥渇ood protein functionality鈥 mean?
Food protein functionality refers to how proteins perform in food. In this project, the focus is on techno-functional properties, such as solubility, emulsification, foaming and gelation.
These properties shape foods people recognize. A protein used in a beverage needs to dissolve well. A protein used in an emulsion helps stabilize ingredients, such as oil and water, that do not naturally stay mixed. Egg white proteins are good at forming foam. Gelatin can create a strong gel with a small amount of protein. Wheat proteins form gluten, which gives dough elasticity.
Different proteins have different strengths because of their amino acid sequences, structures and compositions. Some are naturally more soluble. Others are better at forming networks or gels. Processing can change those properties, but only within certain limits.
Why is predicting protein performance so difficult?
Proteins are highly sensitive to their environment. Heat, pH, salt, acid and processing methods can all change protein structure, and structure is closely tied to function. That matters because food products are rarely simple systems. A protein might behave one way in a lab test and another way in a beverage, gel, snack or plant-based meat alternative.
Even the same type of protein can vary depending on how it was extracted, processed or produced. That variation creates real challenges. A protein may not dissolve fully in a drink, leading to cloudiness or precipitation. A gel-based product may not form the structure that is needed. These are the kinds of problems FoodProt is meant to help researchers better understand and eventually anticipate.
What makes the current body of food protein research hard to use for prediction?
There is already a large amount of published research, but much of it was not designed to work together. Studies may use different analytical methods. As a result, the data are often not on the same scale and cannot be directly compared. Also, many studies did not report all the key parameters needed for model training, causing big numbers of missing data.
These are two of the main barriers to predictive modeling. A model is only as useful as the data behind it. If the data are inconsistent, incomplete or difficult to compare, the model will be limited. FoodProt addresses that by collecting literature data, cleaning it and organizing it in a more consistent structure. At the same time, the project will generate new data using agreed-upon methods across multiple institutions, which helps create a stronger foundation for comparison and modeling.
How could FoodProt help researchers or food companies developing protein-based products?
FoodProt could help make the development process more efficient and more informed. Right now, companies often have to test many different protein ingredients to see whether they meet the needs of a product. That is especially true for products such as protein shakes, protein bars and plant-based meat alternatives, where solubility, texture, stability and structure are essential.
The goal is to use the database to support models that can help predict how a protein may perform under certain conditions. Instead of relying only on repeated trial-and-error testing, researchers and product developers could use the data to guide which ingredients or processing conditions are most likely to work.
What kinds of information will the FoodProt database bring together, and what makes that collection valuable for predictive modeling?
The database will include data from several sources. One part of the project involves collecting published literature data and cleaning it so it can be used in a more consistent way. Another part involves new experimental data collected by the participating institutions. The goal is for the five institutions to use shared methods so the data can be compared.
The project also includes molecular dynamics simulations, which are computational tools used to study how proteins may respond or change under different conditions. These simulations can help explain mechanisms behind protein behavior and may also contribute data to the database.
Where does artificial intelligence fit into the work?
Artificial intelligence is part of the long-term vision, but the first step is building a high-quality database. AI and machine learning tools need large amounts of consistent data. In food protein science, that is still a major limitation. There are many useful studies, but the data are often fragmented or difficult to compare.
FoodProt is intended to help create the data foundation needed for future predictive models. As the database grows, it could help researchers synthesize what is already known, identify gaps in the field and improve predictions about how proteins function under different processing conditions.
What goes into building a comprehensive food protein database?
This work requires more data, more consistency and more collaboration than one laboratory can provide on its own.The project brings together 麻豆国产, the University of Minnesota, Kansas State University, the University of Kentucky and the University of Georgia. Each institution contributes to the broader effort, and the shared methods help ensure that data from different labs can be compared.
麻豆国产 leads the project and will serve as the main host for the database. The larger goal is to create a platform that can eventually support contributions from the broader research community, not just the initial project team.
How will 麻豆国产 students be involved in the work, and what kinds of skills could they gain from a project that sits at the intersection of food science and data science?
Students will be involved in both the experimental and computational sides of the project. Some students will work on protein chemistry, including protein characterization and laboratory data collection. Others will work on database development and modeling, which requires an understanding of data structures and computational tools.
That makes FoodProt an interdisciplinary training opportunity. Students are not only learning food science; they are also learning how data science can be used to organize complex research questions and build tools for future discovery.
What would success look like for FoodProt?
Success would mean the database becomes a useful, widely adopted resource for researchers, students and industry. The goal is not simply to build a database, but to create a platform that others can consistently use, contribute to and build upon. Researchers could use it to study a specific protein source, such as pea or chickpea protein, or to examine how a processing method changes protein functionality.
It could also help identify gaps in the field. If researchers can see what data already exist and what information is missing, they can design future studies in ways that are more useful for the broader community. Over time, FoodProt could help make food protein research more connected, comparable and ready for future predictive modeling.



