Can machine learning improve trade modelling?

This is the question that Lucien Chaffa, Martin Trépanier and Thierry Warin seek to answer in their latest research. Amidst a flurry of trade policy changes in recent years, trade modelling has become increasingly important for understanding the economic outcomes of shifting policy.

In this regard, the gravity model of trade provides a theoretically intuitive means of explaining bilateral trade while also being empirically robust. It predicts that the volume of trade between two countries is positively related to their economic size and inversely related to the trade costs between them¹. In simple terms, large economies that are geographically close tend to trade more with each other, akin to the gravitational pull between celestial objects.

The model incorporates variables to capture trade costs, such as shared land borders (which reduce transport costs) and common languages (which reduce information costs). These variables can be expressed in logarithmic form, which allows their coefficients to be interpreted as elasticities i.e., the percentage change in trade associated with a one percent change in the variable.

The issue

As more variables are added to a gravity model, it becomes harder to isolate the unique contribution of each in explaining trade flows. Standard gravity models also assume that relationships are linear, which means they may not capture more complex interactions between variables such as the varying effect of geographic distance and the mode of transport used for a particular trade.

Removing observations with zero trade values is required when using logarithmic transformations, but this distorts the overall picture. It biases the model toward existing trade relationships and underestimates the barriers that prevent trade from happening in the first place. While methods like Poisson Pseudo Maximum Likelihood (PPML) can address this issue, it may not fully capture the real-world relationships between variables.

Research and findings

Given the constraints of conventional estimation methods, this study investigates whether machine learning (ML) based methods are better at explaining bilateral trade flows.

The study uses a comprehensive dataset on trade among ten Canadian provinces and fifty U.S. states to compare traditional and ML-based approaches applied to two versions of the gravity equation. Two sets of estimations are conducted, one that includes zero trade values and one that excludes them.

The comparison between the approaches evaluates the predictive accuracy, robustness to zero trade values, and interpretability of results. This makes it possible to assess the strengths and limitations of each estimator.

Performance Comparison of Estimation Approaches
Gravity Equation	Trade Values	Estimation Approach	Estimation Model	RMSE	MAE	R²
1	Non-zero flows only	Traditional	OLS	1.87	1.29	0.70
			PPML	4228.45	895.38	0.17
			GPML	3053.94	596.24	0.65
			NBPML	3340.50	636.39	0.65
		Machine Learning	Random Forest	1.49	1.05	0.81
			XGBoost	1.33	0.94	0.85
			Neural Network	1.83	1.20	0.73
	All flows	Traditional	PPML	3974.63	930.28	0.22
		Traditional	NBPML	4661.11	852.79	0.82
		Machine Learning	Random Forest	2678.07	689.71	0.53
			XGBoost	2190.64	748.74	0.68
			Neural Network	3120.31	864.29	0.40
2	Non-zero flows only	Traditional	OLS	1.47	1.05	0.83
			PPML	4271.08	1032.07	0.20
			GPML	84051.74	8377.11	0.53
			NBPML	49478.33	4955.34	0.53
		Machine Learning	Random Forest	1.32	0.91	0.86
			XGBoost	1.27	0.88	0.87
			Neural Network	1.33	0.91	0.86
	All flows	Traditional	PPML	3972.85	1072.20	0.18
		Traditional	NBPML	30867.10	5462.12	0.68
		Machine Learning	Random Forest	2624.04	730.89	0.54
			XGBoost	2169.56	714.91	0.70
			Neural Network	3033.20	869.84	0.43

Note: Lower RMSE and MAE values indicate higher predictive accuracy, while a higher R² reflects a better model fit.

As shown in the table above, ML-based approaches significantly outperform traditional methods in predicting trade flows, demonstrating a stronger ability to capture complex patterns. This holds true even when zero trade values are included, though performance declines slightly across both versions of the gravity equation.

This suggests that ML-based approaches are effective at capturing nonlinearities and intricate interactions without imposing a functional form. However, this advantage diminishes when interpretability is required for analysis and policy use.

Key takeaways

This study finds that while ML-based approaches outperform traditional models like PPML, their limited interpretability reduces their usefulness for policy design. PPML remains reliable even when handling zero trade values and offers clearer insights into how variables affect trade flows. However, ongoing advancements in ML algorithms are likely to improve the interpretability of their results in the near future.

For policymakers who need robust forecasting tools, ML-based approaches can complement traditional models, especially in projecting the impact of changes to trade policies or regional trade agreements. These results highlight the importance of integrating ML tools thoughtfully into trade policy analysis to balance predictive power with interpretability.

Read the working paper here.

To cite:

Chaffa, L., Trépanier, M., & Warin, T. (2025). Beyond PPML: Exploring Machine Learning Alternatives for Gravity Model Estimation in International Trade (2025s-14, Working Papers, CIRANO.) https://doi.org/10.54932/BFKY4995

Piermartini, R., & Teh, R. (2005). Demystifying modelling methods for trade policy (WTO Discussion Paper 10). World Trade Organization (WTO). https://hdl.handle.net/10419/107045