1 Introduction

The impacts of digitalization and artificial intelligence (AI) technologies on labor markets are multifaceted. Workers performing predominantly work activities that can be automated are at risk of being displaced by digital machines. However, occupations combining activities that cannot be automated with those that can are likely to be transformed. Workers in these occupations may benefit from working closely with new digital technologies rather than being displaced by machines (Acemoglu and Restrepo 2018; Lane and Saint-Martin 2021).

Prior research has investigated the impact of new digital technologies on occupations primarily in the United States (Frey and Osborne 2017; Brynjolfsson et al. 2018; Felten et al. 2019; Acemoglu et al. 2020; Fossen and Sorgner 2021, 2022) and in some cases in other developed countries (Arntz et al. 2016, 2017; Georgieff and Hyee 2021). These papers develop measures of the impact of digitalization on occupations in these countries and proceed by testing effects on wages and unemployment (Felten et al. 2019; Fossen and Sorgner 2022). Few papers in the literature investigate the impacts of digitalization in developing countries. Carbonero et al. (2020) evaluate the impacts of robotization on employment in supply chains in developing countries. Aly (2022) looks at various digitalization indices in developing countries and their associations with macroeconomic variables including employment. Although many developing countries, including some of the world’s poorest, are already using basic AI technologies, for instance, in smart farming, credit scoring and targeted advertising, advanced AI technologies are not yet widely adopted there. Yet, there exists a substantial potential for adoption of such technologies to leapfrog traditional development models (IFC 2020; Soto 2020). The use of digital technologies has accelerated in developing and even the poorest countries, not least due to lockdown measures that governments implemented during the COVID-19 crisis. In the service sector in Lao PDR, for example, the lockdowns have led many enterprises to switch to digital processes (Homsombath 2020). Similar efforts were made in the education sectors in which many activities were held online. These developments may have been a trigger for further digitalization efforts in the near future. Research applying occupation-level data for the United States to other countries typically points to a substantial risk of job destruction and an imminent job crisis, especially when analyzing developing countries (Balliester and Elsheiki 2018).

There are several issues that need to be considered when analyzing the impacts of digitalization in the context of developing countries. Applying the occupational digitalization scores computed for the United States in the context of developing countries might lead to significantly biased results, since the occupational tasks in developing countries might differ considerably from the occupational tasks of a similarly coded occupation in the United States (Arntz et al. 2017).Footnote 1 Alternatively, reproducing approaches that assign AI impact scores to occupations based on extensive surveys of AI experts (Frey and Osborne 2017; Brynjolfsson et al. 2018) in developing countries would be very costly. Several studies have therefore relied on correction procedures. In particular, Arntz et al. (2016, 2017) adjust occupation-level computerization risk calculated for the US occupations (Frey and Osborne 2017) by regressing them on individual- and job-specific characteristics from the OECD’s Survey of Adult Skills (PIAAC) or other national surveys available in the US and the country of interest. Then they use the estimated coefficients to make predictions of computerization risk for individual jobs and occupations in other countries. While the correction procedure partly accounts for peculiarities of national labor markets, it has several drawbacks. First, before the regression can be estimated, the occupational codes used in O*NET (6-digit level of SOC) must be translated to the occupational codes in PIAAC (ISCO) using a crosswalk, and the latter codes are only available at the imprecise 2-digit level. Arntz et al. (2016, 2017) use a multiple imputation method to deal with this issue. Second, the approach starts with digitalization scores at the occupation level, whereas we suggest starting with scores directly attributed to the much finer level of detailed work activities to enhance accuracy and precision. Third, predictions from a regression have a lower variance than the original data, which is likely to be reflected in the results.

In this paper, we rely on the main advantage of previous cross-country adjustment methods, namely the use of individual-level survey data, but aim to overcome the drawbacks of prior approaches mentioned above. We develop a methodology that allows translating existing scores of AI impacts, most of which were developed using data for the U.S., to the contexts of other countries at the level of work activities. Our method allows comparing AI impacts on workers in countries at vastly different levels of development, including low-income and least-developed economies.

In a nutshell, we propose to use individual-level surveys of workers’ skills, such as the World Bank's Skills Measurement Program (STEP) for developing countries or PIAAC (for OECD countries). We use the state-of-the-art method SBERT to assess semantic similarities between textual descriptions of detailed work activities (DWA) from the O*NET occupational database for the US,Footnote 2 for which AI impact scores are available, and the textual descriptions of workers’ skills elicited in surveys available for developing countries, in particular the World Bank’s STEP Skills Measurement Program. We then use the matrix of relatedness to translate the AI impact scores to the level of individual workers’ skills in a given country. In this way, an additional advantage of our method is that it supports different levels of analysis of AI impact on labor markets: at the individual level distinguishing by workers’ characteristics such as age or gender, at the skill level, or at the occupation level.

We illustrate the method using the cases of two neighboring Asian countries: Lao PDR, a least developed country according to the United Nations classification,Footnote 3 and urban areas in Viet Nam, a developing country that has transformed from one of the poorest countries in the 1980s into a lower middle-income country today. Among the digitalization measures available, we choose the suitability of work activities for machine learning as reported by Brynjolfsson et al. (2018).

The picture that emerges from our approach is insightful and shows that the impact of AI on individual workers is more heterogeneous in urban Viet Nam than in Lao PDR. While most respondents in urban Viet Nam are moderately affected, a significant number of workers are at high risk of being displaced by digital technologies; in Lao PDR, the impact is more evenly distributed. The most common occupation reported by STEP respondents in Lao PDR, subsistence crop farming, has a comparably low suitability for machine learning, presumably due to the importance of non-routine manual tasks in this occupation. The most common occupations in urban Viet Nam are more suitable for machine learning, in particular the occupations of shop salespersons and textile machine operators, but also of crop growers (according to the tasks they perform in Viet Nam). At the same time, workers in these occupations perform a relatively large variety of tasks in Viet Nam, some of which cannot be automated; this makes it likely that these occupations will be transformed rather than completely automated.

It should be noted that these results only make an assessment regarding the impact of machine learning on jobs, not about the overall risk of automation due to other types of technologies, such as non-AI software and robots. Non-digital mechanization, for instance, might affect occupations such as subsistence crop farming in Lao PDR more immediately than digitalization and AI.

We also compare results obtained with the proposed method to the results from a naïve approach when the AI impact scores are transferred from the United States to Lao PDR and Viet Nam at the level of occupations. In comparison to our proposed method based on semantic textual similarity matching, the naïve approach seems to produce too much noise to derive meaningful insights.

2 Data and methodology

2.1 AI impact measures

Several measures of AI impacts on occupations in the United States have been suggested by recent literature. To illustrate our method, among the available digitalization measures that we will briefly discuss below, we choose the suitability of work activities for machine learning (ML) provided by Brynjolfsson et al. (2018). The main reason for this choice is that this measure is available at the very detailed level of work activities, while other measures are usually available at the less disaggregated level of workers’ abilities, work tasks or occupations.

Brynjolfsson and Mitchell (2017) identify eight key criteria that specify conditions under which ML techniques can be employed as substitutes or complements to human labor.Footnote 4 The authors emphasize that these criteria are developed solely on the basis of technical feasibility, and that other factors, such as the elasticity of labor supply, price and income elasticities, determine the economic feasibility of implementation of ML applications. Brynjolfsson et al. (2018) create a rubric of 23 questions that aim at estimating the degree to which a detailed work activity (DWA) as defined in the O*NET database (compiled by the US Department of Labor) falls under the eight above criteria, and hence, is “suitable for machine learning” (SML). Corresponding to the eight criteria, this rubric also only concentrates on technical feasibility, not on the economic, organizational, legal, cultural, and societal factors influencing ML adoption. Based on a survey, the authors evaluate the potential for applying machine learning to the 2,069 DWAs, 18,156 tasks, and 964 occupations in the O*NET database. The authors use Crowdflower, a Human Intelligence Task (HIT) crowdsourcing platform, where each DWA is scored by 7 to 10 respondents with knowledge in the area. Through the 23 questions respondents are asked to evaluate each DWA based on the eight criteria. Brynjolfsson et al. (2018) then aggregate their scores from the DWA level to the task level and further to the occupation level in the United States weighted by importance as recorded in O*NET. The result is an average SML score for each US occupation.

Since the SML scores reported by these authors focus on the possibility of automation of activities currently performed by human workers, the average SML of the work activities performed in an occupation can be interpreted as destructive digitalization in the sense of putting workers at risk of being displaced (see also Fossen and Sorgner 2022). In contrast, the standard deviation of SML scores across work activities performed within an occupation reflects transformative digitalization, because occupations combining activities that can be automated with activities that cannot be automated are likely to be reorganized (Brynjolfsson et al. 2018) and transformed rather than to displace workers. Workers in these occupations are more likely to benefit from their close interaction with new digital technologies than to lose their jobs. The SML scores have the advantage that they are first generated at the level of DWAs in O*NET. These DWAs resemble the skills and work activities elicited in surveys like STEP or PIAAC, which facilitates the translation of these scores to other countries. We elaborate further on the conceptual differences and similarities between the DWAs from O*NET and the skills questions from STEP in Section 2. 3.

Alternative currently available AI impact measures could also be applied within our methodological framework, but some adaption would be necessary. A second option are the AI Occupational Impact (AIOI) scores provided by Felten et al. (2018), potentially as a measure of transformative digitalization (as argued by Fossen and Sorgner 2021, 2022). These scores are constructed at the ability level in O*NET. Although our approach could be suitable to use the AIOI scores in combination with individual-level surveys measuring workers’ abilities, there are only 52 abilities in O*NET, much less than DWAs. Moreover, the textual descriptions of abilities in O*NET seem to be quite dissimilar to the textual descriptions of skills provided in STEP, reflecting different concepts underlying these measures and, therefore, making the AIOI scores less suitable for applying our approach in combination with the STEP surveys.Footnote 5

A third option are the computerization probability scores provided by Frey and Osborne (2017) as a measure of destructive digitalization. However, these probability scores are only available at the occupation level, so one would have to break these down to the level of work activities, implying imprecision. One way to do so could be to regress the computerization probabilities at the occupation level on the nine bottleneck skills from O*NET identified by Frey and Osborne (2017). This would allow the prediction of computerization risk at the occupation level in countries where data on occupations linked to the bottleneck skills are available. Arntz et al. (2016, 2017) pursue a similar approach by regressing the automation probability as provided by Frey and Osborne (2017) on a set of individual job-related characteristics (including tasks and skills) from the PIAAC survey. Yet, the assessment of which tasks are automatable is ultimately derived from the expert opinions assembled by Frey and Osborne (2017) on the occupational level. Alternatively, one would have to resort to the simple approach of transferring the measure to other countries at the occupation level, which does not seem to be accurate, as argued above.

A fourth option is provided by Webb (2020). He develops a measure of exposure of occupations to AI technology by matching descriptions of work tasks in O*NET to the text of patents using text similarity measures. This procedure generates AI exposure scores at the O*NET task level; however, the author currently only provides the data aggregated to the occupation level.

It should be noted that the different measures capture different technologies within digitalization and AI; Fossen and Sorgner (2022) provide a detailed discussion. In particular, machine learning is a subfield of AI from a technological perspective. Therefore, the rankings and relative positions of occupationsare not necessarily expected to be similar when using the different scores. Table 3 in the Appendix shows the mean SML score and its within-occupation standard deviation provided by Brynjolfsson et al. (2018), the computerization probability provided by Frey and Osborne (2017), and the AIOI scores provided by Felten et al. (2018), which were all developed for the United States, for the 10 largest occupations in the United States in terms of employment. Cashiers have the highest SML score among these occupations, and also the highest computerization probability, but a moderate AIOI score. Laborers and freight, stock and material movers (by hand) have the lowest SML score and AIOI score, but a high computerization probability. Therefore, analyses using different scores would be interesting as they would answer different research questions, but they are not suitable as robustness checks.

2.2 Individual-level data on skills in developing countries: STEP survey

The STEP skills measurement program is provided by the World Bank. The goal of the survey is to provide representative individual-level data on the skills of the workforce and the usage of these skills in the individuals’ jobs that can be compared across countries. STEP is based on the adult population aged between 15 and 64 residing in urban municipalitiesFootnote 6 in developing countries and is comparable to the PIAAC survey by the OECD. While the focus of PIAAC is primarily on high-income developed countries, the STEP survey focuses on developing and transition economies. So far, STEP has been administered in two waves, in 2012 and 2013, in 13 countries, including Lao PDR and Viet Nam (surveys in these two countries were conducted in 2012). STEP surveys provide detailed information on individuals’ socio-demographic characteristics (e.g., age, gender, formal education level) and job characteristics.

The STEP survey specifically targets the measurement of skills of the workforce, broadly defined as “abilities to do certain things”. STEP distinguishes three types of skills: cognitive skills (e.g., reading and writing proficiency), socio-emotional skills (referring to social and emotional behaviors, personality, and attitudes), and job-relevant (technical) skills (see Pierre et al. 2014, for more details). For the purpose of our study, we use a subsection of STEP questions that attempt to measure cognitive skills and job-relevant skills through self-reported information on respondents’ use of these skills in work-related activities (see Table 4 in the Appendix). These questions therefore link the relevant skills to typical work activities. These activities in the STEP questions resemble direct work activities (DWA) as defined in O*NET. We call these 44 activities “STEP skills” throughout the paper, even though, strictly speaking, these questions mostly relate to certain activities that are supposed to reveal information about underlying skills of the respondents in the three categories mentioned above (cognitive skills, socio-emotional skills, and job-relevant skills). We exclude respondents from the sample who did not work during the last 12 months before the interview because they are not asked about their work-related skills.

2.3 Matching O*NET work activities to skills in STEP

A major challenge regards matching the descriptions of work activities in O*NET, for which we have AI impact scores such as the SML scores, to skills in STEP. Even at the level of abilities, which is more aggregated than the level of DWAs, a manual approach seems infeasible. For example, there are 52 O*NET abilities and 44 skills in STEP, so a translation matrix would require determining 2,288 weighting scores. Furthermore, this approach would be entirely subjective.

Alternatively, one could conduct a new expert survey specific to a country of interest, similar to the approach of Brynjolfsson et al. (2018) or Frey and Osborne (2017), to produce new digitalization scores instead of using the existing scores developed for the United States. Although we consider this approach as a possible avenue for further research, a disadvantage is that it requires substantial resources (e.g., conducting a survey and collecting expert judgments), and it would be limited to a single country or region.

In this paper, we suggest and illustrate a third approach. We directly match 2,069 detailed work activities (DWAs) in O*NET to the 44 STEP skills creating a matrix of relatedness. The PIAAC survey could also be used instead of the STEP to target a different set of countries. O*NET uses its “Content Model” as its conceptual foundation and provides clear definitions for abilities (“enduring attributes of the individual that influence performance”), for skills (“developed or acquired attributes of an individual that may be related to work performance”), and for detailed work activities (“specific work activities that are performed across a small to moderate number of occupations within a job family”). The O*NET model defines a set of generic skills, for example, basic skills like “active listening”, “mathematics”, or cross-functional skills like “social skills” or “technical skills”, which can be further broken down into a total of 35 more detailed skills. Workers then need some of these 35 skills to successfully carry out tasks or activities in their occupations. These activities are described in detail as 2069 DWAs, which are then linked to the 1014 U.S. occupations. As we explained more elaborately in the previous section, the STEP survey collects a wide range of variables including questions about performed activities at work. It does not provide a detailed typology and rather asks the interviewee about actual activities he or she has performed recently (which may allow to draw conclusions on the skills of the surveyed person). The 44 “STEP skills” from the utilized questions resemble more the DWAs than the generic skills in O*NET. Thus, our approach works better at the DWA level than the abilities or skills level. This has affected the choice of the AI impact measure that we use to illustrate our method: since the SML scores of Brynjolfsson et al. (2018) are available at the work activities level, the application of our approach at the work activities level using the SML scores is straightforward.

To find semantic similarities between the textual descriptions of O*NET work activities and the STEP skill measures, we apply automated semantic textual similarity matching techniques (SBERT). By using SBERT, we avoid a manual assignment of similarity as discussed above. The main advantages of this approach are the following: it is systematic rather than subjective; it is automated; there is no need to conduct new surveys; and the same method can be used with different data sources such as STEP and PIAAC for many countries. As our method is based on activities performed within the occupation, it has the additional advantage that occupations not included in the original set of occupations with AI impact scores can be examined as well, including new or reorganized occupations.

2.4 A new method based on semantic textual similarity matching using SBERT

In this section, we describe our method in detailed steps. The first step involves processing the textual descriptions of the DWAs in O*NET and the descriptions of the skills used by employed STEP respondents in their main job. The latter are the questions from the STEP questionnaire that aim at assessing the skills of employed respondents (see Table 4 in the Appendix). We combine the textual descriptions to a single string vector. Then we preprocess the string data stored in this vector. This includes removal of accents, consecutive whitespaces, substitutions of various text characters (e.g., “- “, “,” and “.”), and text conversion to lowercase. In the next step, word (semantic) embeddings are created for both DWAs and STEP questions using the Sentence-BERT (SBERT) method (Reimers & Gurevych 2019).Footnote 7 The model we apply is provided by MS Marco, which is pre-trained with real user search queries from the Bing search engine, a corpus that consists of 8.8 million passages.

In the second step, we create a similarity matrix that contains cosine measures of similarityFootnote 8 between all documents in the sample using the semantic word embeddings created in the previous step. These similarity measures account for semantic similarity between the textual descriptions of 2069 DWAs from O*NET and 44 STEP questions.Footnote 9 O*NET also provides broader, less occupation-specific activity descriptions in a hierarchy. General work activities (GWAs) are the broadest category, followed by intermediate work activities (IWAs), and DWAs are the finest categories. In addition to the first cosine similarity matrix using the DWAs, we create a second cosine similarity matrix using the GWAs to add more information on the nature of each work activity. For example, consider the DWA “Prepare forms or applications.” We improve similarity matching results by adding information that this DWA belongs to the broader GWA category “Documenting/Recording Information”. This way we distinguish this DWA clearly from the DWA “Position construction forms or molds”, which also contains the word “form”, but belongs to the different GWA category “Handling and moving objects”. Our final similarity measure is built as the average between the two similarity measures: between STEP skills and DWAs on the one hand and STEP skills and GWAs on the other hand. The overall patterns of results are not very sensitive to the choice of whether the similarity scores of the DWAs are averaged with any similarity scores of higher-level categories: with the GWAs as done here, with the IWAs, with both the GWAs and IWAs, or with none of these higher-level categories. After this second step, we have for each of the 44 STEP skills 2069 similarity scores that link the particular STEP skill to the DWAs.

In the third step, we use these final similarity measures as weights to create SML scores at the level of STEP skills. We do so by calculating for each of the 44 STEP skills a weighted average of the SML scores at the O*NET DWA activity level:

$${SML}_{skill}=\sum_{activities}\left({similarity}_{activity, skill}{ SML}_{activity}\right)/\sum_{activities}\left({similarity}_{activity, skill}\right)$$
(1)

While larger SML scores signify better suitability of the skills for machine learning, the units of the original SML scores provided by Brynjolfsson et al. (2018) do not have a direct interpretation. Therefore, we standardize the SML scores at this level of STEP skills, with each skill receiving the same weight. Table 4 in the Appendix shows the standardized SML score for each of the 44 skills in STEP. For example, ‘using databases’ and ‘searching for information on the internet’ are the skills most suitable for machine learning, as indicated by the largest SML scores, which seems very plausible. In contrast, ‘physically demanding work’ has the lowest SML score. An example for physically demanding work from the STEP questionnaire is ‘construction’ and one for physically not demanding work is ‘sitting at a desk answering a phone’; it is plausible that the latter task is much more suitable for machine learning (an example would be automated call centers using AI) than the former.

Fourth, we merge the SML scores calculated at the level of STEP skills with the individual-level STEP surveys for Lao PDR and Viet Nam conducted in 2012, the latest available year for both countries. There are three types of questions in STEP that are used to measure the skills respondents use in their jobs: yes/no questions about whether a certain skill is relevant in one’s job (e.g., if a job requires reading books); cards questions that measure on a 10-point Likert scale the extent to which a particular job characteristic is relevant for one’s main job (e.g., the extent to which a job is physically demanding); and frequency questions that measure (on a 4- or 5-point Likert scale) the time that a person dedicates to a particular skill or task in his or her main job. In order to make the responses to the different types of questions comparable, we normalize them such that the responses can take values within an interval between 0 and 1. Now we use the normalized individual responses to create a score capturing the SML of the skills each individual uses in his or her job. More precisely, we create an SML score for each individual i averaged over the skills and weighted by the normalized individual responses to the questions on the usage of these skills. This is our measure of labor-displacing (destructive) AI technology at the level of the individuals’ jobs:

$${SML}_{i}=\sum_{skills}\left({usage}_{i, skill}{ SML}_{skill}\right)/\sum_{skills}\left({usage}_{i, skill}\right)$$
(2)

Fifth, we create mean SML scores at the occupation level and the within-occupation standard deviation of the SML scores. We follow the method by Brynjolfsson et al. (2018) as closely as possible. These authors start with SML scores for each DWA in O*NET, then they aggregate them to a broader level of tasks and then to the level of occupations by building weighted averages (they call this mSML). In addition, they calculate the standard deviation of SML across tasks within each occupation (sdmSML). Both mSML and sdmSML are weighted by the importance of the tasks in the occupation as provided in O*NET. Since detailed occupation databases like O*NET are unavailable for most countries, including Lao PDR and Viet Nam, we use the STEP survey to derive the task composition of occupations in these countries. To do so, we calculate the average of the usage of each skill, obtained from questions in STEP, over individuals i in each occupation occ in a country:

$${usage}_{occ,skill}={\overline{{usage }_{i,skill}}}_{{occ}_{i}=occ}$$
(3)

Then we create an SML score for each occupation as the average SML score over the skills, weighted by the average usage of the skills in the occupation. This is our measure of labor-displacing (destructive) AI technology at the level of occupations:

$${mSML}_{occ}=\sum_{skills}\left({usage}_{occ,skill} {SML}_{skill}\right)/\sum_{skills}\left({usage}_{occ,skill}\right)$$
(4)

Finally, we calculate the standard deviation of the SML scores across the skills in each occupation, weighted by the average usage of the skills in the occupation in the country (\({usage}_{occ, skill}\)):

$${sdmSML}_{occ}=\sigma \left({SML}_{skill}\right)$$
(5)

A large standard deviation of the SML scores within an occupations indicates that an occupation combines work activities that are suitable for machine learning with work activities that are not suitable for machine learning. This suggests that human workers will still be needed in the occupation but could closely collaborate with AI technologies in reorganized occupations (Brynjolfsson et al. 2018). Therefore, we interpret this measure as transformative AI technology at the level of occupations.

3 Results for Lao PDR and Viet Nam

To better understand the occupational structure of the labor markets in Lao PDR and Viet Nam, we first provide descriptive statistics based on data from the Labour Force Surveys (LFS) provided by the International Labour Organization (Table 1). Since the STEP data is available for both urban and rural areas in Lao PDR, but only for urban areas in Viet Nam, we show corresponding statistics in this table; Table 5 in the Appendix shows the occupational structure for both urban and rural areas in Viet Nam. According to the LFS, 44% of the workers in Lao PDR and 32% in Viet Nam reside in urban areas. Referring to Table 1, 40% of the workers in Lao PDR work in agricultural occupations. In Viet Nam, the largest occupations in urban areas are service and sales workers (28%) followed by elementary occupations and professionals (15% each). Elementary occupations comprise many different simple tasks, like door-to-door sale, cleaning and home care activities, simple farming tasks and steering animal-drawn vehicles. According to the information provided by the LFS, we do not observe a significant gender employment gap at the aggregate level, although large differences occur at the occupational level. In particular, women are under-represented among plant and machine operators and assemblers in both countries, among technicians and elementary occupations in Lao PDR and among managers and craft workers in Viet Nam.

Table 1 Occupational Structure in Lao PDR and Viet Nam

Next, we present the measures of destructive and transformative digitalization estimated for Lao PDR and urban Viet Nam following our proposed methodology. Figure 1 shows the kernel density of the SML of the skills reported by individuals in the STEP survey (\({SML}_{i}\)) in both countries. For Viet Nam, there is a bimodal distribution: Most respondents have a mix of skills that is moderately suitable for machine learning, which shows that these individuals are at moderate risk of being displaced by digital machines. However, a significant number of individuals also exhibit skills that are highly suitable for machine learning. This points toward the fact that the labor market in urban Viet Nam is more heterogeneous than in Lao PDR in terms of susceptibility of individual workers’ jobs to labor-displacing machine learning technologies.

Fig. 1
figure 1

Suitability for Machine Learning of Individual Jobs in Lao PDR (Left) and Viet Nam (Right)

Overall, the mean SML score across workers is -0.619 in Lao PDR and -0.398 in Viet Nam, indicating that workers in urban Viet Nam are more affected by machine learning on average (see Table 2); the difference is significant at the 1% level. The scores are comparable across countries because they were standardized at the level of skills in STEP, which are the same in both countries. The fact that the scores are negative in both countries indicates that workers in both countries use skills that are less suitable for machine learning than the average across the skills elicited in the STEP survey; in Lao PDR, the average SML score is 62% of a standard deviation away from the average across STEP skills.

Table 2 Individual-level SML Scores by Country, Gender and Age

The comparison between urban Viet Nam and Lao PDR shows that the level of development is not necessarily an indicator for the suitability of jobs for machine learning. Lao PDR is less developed than Viet Nam with the largest share of workers in Lao PDR engaged in the agricultural sector as crop growers, subsistence crop farmers or animal producers. These are all occupations that have relatively low standardized SML scores, ranging from -0.147 to -0.570, and may not be easily replaced by digital machines. In urban Viet Nam, the largest share of workers are service workers, for example, street and market salespersons (-0.048), shop salespersons (0.075), or finance professionals (0.201), who all have considerably higher standardized SML scores. Informality also tends to be higher in the previous job categories in Lao PDR, which might correlate negatively with SML scores.

Our method also allows us to disaggregate by demographic characteristics such as gender or age. Figure 2 suggests that in both Lao PDR and urban Viet Nam, women use skills in their jobs that are somewhat more suitable for machine learning than men. However, the mean difference between genders is insignificant in both countries (Table 2). An interesting observation from the figure is that heterogeneity of AI impacts on occupations in urban Viet Nam is not specific to male or female workers.

Fig. 2
figure 2

SML of Individual Jobs in Lao PDR (Left) and Viet Nam (Right) by Gender

Several results emerge when the data are disaggregated by workers in different age cohorts (Fig. 3). In Lao PDR, workers in the youngest age cohort (less than 25 years old) use skills in their jobs that are less suitable for machine learning than older cohorts. This is different in urban Viet Nam, where high SML scores are most concentrated among individuals between the ages of 25 and 35.Footnote 10 The differences in SML scores between age cohorts are significant at the 1% level in both countries (Table 2).

Fig. 3
figure 3

SML of Individual Jobs in Lao PDR (Left) and Viet Nam (Right) by Age

Next, we aggregate the SML scores at the occupation level (mSML) in Lao PDR and Viet Nam. This tilts the distribution more to higher SML scores in urban Viet Nam (Fig. 4). The mass of individuals with moderate SML scores we saw in Fig. 1 seems to be concentrated in a few occupations, such that more of the mass of occupations is concentrated at higher SML scores.

Fig. 4
figure 4

SML of Occupations in Lao PDR (Left) and Viet Nam (Right)

Aggregation at the occupation level enables us to not only estimate the mean SML score (mSML) of the skills used in an occupation, but also the standard deviation of the SML scores (sdmSML) of the skills used within an occupation. As argued above, if the skills used in an occupation can be automated on average, workers are at risk of displacement, so mSML is a measure of destructive digitalization. However, if some skills used in an occupation can be automated whereas others cannot, resulting in a high sdmSML score, the occupation will be likely transformed (Brynjolfsson et al. 2018) and workers may benefit from increased productivity. Thus, sdmSML is a measure of transformative digitalization. Transformative digitalization may also be an indicator for required training or re-training on the job or within an occupation, which may have to be supported or enabled by policy makers and employers. Destructive digitalization or displacement risk of workers may require different policy responses such as re-training to different occupations, measures to support job creation in different sectors, or income support to allow workers to make transitions to other jobs. Figure 5 shows that the distribution of sdmSML is shifted toward higher scores in Viet Nam in comparison to Lao PDR, which reveals that more occupations in urban Viet Nam are likely to be transformed or reorganized due to AI than occupations in Lao PDR.

Fig. 5
figure 5

Within-occ. Standard Deviation of SML in Lao PDR (Left) and Viet Nam (Right)

To visualize the effects of machine learning technologies on occupations in both countries, we map how much occupations in Lao PDR and Viet Nam are affected by destructive (mSML) and transformative (sdmSML) digitalization. We depict each occupation in Lao PDR (Fig. 6) and in urban Viet Nam (Fig. 7) as a bubble on a two-dimensional pane. Each bubble represents one occupation, and the size of the bubbles reflects the relative number of workers in the occupation in Lao PDR and urban Viet Nam, respectively, based on the LFS for both countries.Footnote 11 In Figure 11 in the Appendix, the size of the bubbles reflects the relative number of workers in the occupation based on the LFS for both urban and rural areas in Viet Nam. This makes the bubble sizes more directly comparable to those for Lao PDR; however, the underlying STEP data for Viet Nam only covers urban areas.

Fig. 6
figure 6

Destructive and Transformative Digitalization in Lao PDR. Notes: Each bubble represents an occupation in Lao PDR. mSML denotes the mean suitability for machine learning of skills used in an occupation (standardized at the level of STEP skills) and is a measure for destructive digitalization. sdmSML denotes the standard deviation of the SML of skills used within each occupation and is a measure of transformative digitalization. The size of the bubbles represents employment in the occupations based on the 2017 Labour Force Survey for Lao PDR. The largest occupations are labeled

Fig. 7
figure 7

Destructive and Transformative Digitalization in Viet Nam. Notes: Each bubble represents an occupation in Viet Nam. mSML denotes the mean suitability for machine learning of skills used in an occupation (standardized at the level of STEP skills) and is a measure for destructive digitalization. sdmSML denotes the standard deviation of the SML of skills used within each occupation and is a measure of transformative digitalization. The size of the bubbles represents employment in urban areas in the occupations based on the 2016 Labour Force Survey for Viet Nam. The largest occupations are labeled

We observe a tendency for significant labor market transformation in both countries: Occupations in the northeast corner are characterized by high transformative and destructive digitalization technologies, which puts pressure on workers to adapt, combined with a risk of partial displacement (‘machine terrain’). On the contrary, occupations close to the southwest corner show low SML scores and a low standard deviation in SML scores. These occupations can be considered to be in ‘human terrain’, with little expected impact from AI. Few occupations are present in the northwest corner, which represent ‘rising stars’ occupations, with limited risk of destruction and high potential for transformation. Similarly, very few occupations are placed in the southeast corner of ‘collapsing occupations’ with high risk of destruction and little potential for transformation involving human workers (see Fossen and Sorgner 2019, for the characterization of the four sectors in the US context).

By comparing the two countries, we note that the same occupation can have very different mSML and sdmSML scores in different countries because of different work activities workers perform. This indicates that our method has a valuable discriminating power among different pools of workers in different country contexts.

Many occupations in Viet Nam that are important in terms of employment are more suitable for machine learning than many important occupations in Lao PDR and, therefore, they are potentially subject to destructive digitalization. At the same time, many of these occupations in Viet Nam are subject to transformative digitalization, thus, characterizing these occupations as within ‘machine terrain’ for the near future with high levels of both transformative and destructive digitalization (they are in the upper right corner of the chart). The most common occupations in urban areas in Viet Nam, represented by the largest bubbles in Fig. 7, have relatively high SML scores in Viet Nam due to the activities performed in these occupations there. Among these occupations, the activities of textile machine operators are most suitable for machine learning on average. At the same time, the activities performed within this occupation have the highest standard deviation of SML, which suggests that the occupation will be reorganized, and human workers will still be needed in this occupation in the future to perform some of the activities.

In contrast, in Lao PDR, by far the largest share of STEP respondents work as subsistence crop farmers (large bubble in Fig. 6). The suitability for machine learning is lower in comparison to the above-mentioned occupations due to the manual non-routine tasks performed. In Viet Nam, some occupations are also located in the lower left quadrant, for example building frame workers, characterizing them as within ‘human terrain’ for the near future in this country with low levels of both destructive and transformative digitalization. The results suggest that Viet Nam is currently undergoing a significant shift from traditional occupations to those affected by industrialization and digitalization. In contrast, employment in Lao PDR is still dominated to a large extent by agricultural occupations that lie somewhere in the middle on the scales of both transformative and destructive digitalization. Therefore, workers in Lao PDR are currently less affected by AI, as the labor market there has not yet fully absorbed previous waves of automation.

In a nutshell, the gap between ‘machine terrain’ and ‘human terrain’ occupations is clearly more pronounced in Viet Nam than in Lao PDR. At the same time, none of the two countries have many occupations that must be characterized as ‘collapsing’ occupations, which are strongly affected by labor-displacing AI with little prospect of transformation involving human workers, or that fall into the category of ‘rising stars’ occupations, which have low displacement risk but at the same time a high potential for occupational transformation.

How do the measures of destructive and transformative digitalization for Lao PDR and Viet Nam compare to those for a developed economy? Fig. 8 shows the SML scores and their within-occupation standard deviation across tasks for the United States. These scores are directly provided by Brynjolfsson et al. (2018) for the US, so in contrast to Lao PDR and Viet Nam, no translation was necessary. Brynjolfsson et al. (2018) aggregate the SML scores from tasks to occupations weighted by importance in the O*NET database, which is analogous to our procedure. We standardized the scores for Lao PDR and Viet Nam at the level of skills in the STEP survey, but we cannot do this for the US because the STEP survey is not conducted in the US. Therefore, we show the original scores here. Also note that the figure for the US is based on the SOC classification of occupations (6 digits) used by Brynjolfsson et al. (2018), whereas the figures for Lao PDR and Viet Nam are based on the ISCO-08 classification (3 digits) used in the STEP surveys with less detailed occupations. While the scores cannot be directly compared between the US and the other two countries, the patterns can be compared. Retail or shop salespersons have a relatively high mSML score both in the US and in urban Viet Nam, indicating that these occupations are suitable for machine learning. Laborers doing physically demanding manual work (freight, construction) have relatively low mSML scores in these two countries, indicating that this work is not very suitable for machine learning. The patterns in Lao PDR are more different from those in the US. This may reflect that Viet Nam is closer to the US in terms of economic development, so tasks performed within occupations are more similar to the US in urban Viet Nam than in Lao PDR.

Fig. 8
figure 8

Destructive and Transformative Digitalization in the United States. Notes: Each bubble represents an occupation in the United States. mSML and sdmSML are provided by Brynjolfsson et al. (2018). mSML denotes the mean suitability for machine learning of skills used in an occupation (not standardized), which we interpret as a measure for destructive digitalization. sdmSML denotes the standard deviation of the SML over tasks within each occupation, which we interpret as a measure of transformative digitalization. The size of the bubbles represents employment in the occupations as provided by the Bureau of Labor Statistics (2018) for the US. The largest occupations are labeled

Finally, we compare the SML scores translated from the US to Lao PDR and Viet Nam at the work activities and skills level following our approach to the SML scores simply transferred at the occupation level (naïve approach). The naïve approach requires applying a crosswalk between the SOC occupation codes provided for the SML scores by Brynjolfsson et al. (2018) and the ISCO-08 occupation codes available in the STEP survey. When we use this naïve approach and transfer the SML scores (not standardized) from the US to Lao PDR and Viet Nam at the occupation level (Figs. 9 and 10), the maps show no clear patterns or different patterns between the two countries, despite heterogeneous economic conditions and different organization of occupations.

Fig. 9
figure 9

Transferring SML Scores from the US to Lao PDR at the Occupation Level. Notes: Each dot represents an occupation in Lao PDR. The SML scores were translated from the United States to Lao PDR at the occupation level (naïve approach). The size of the bubbles represents employment in the occupations based on the 2017 Labour Force Survey for Lao PDR

Fig. 10
figure 10

Transferring SML Scores from the US to Viet Nam at the Occupation Level. Notes: Each dot represents an occupation in Viet Nam. The SML scores were translated from the United States to Viet Nam at the occupation level (naïve approach). The size of the bubbles represents employment in urban areas in the occupations based on the 2016 Labour Force Survey for Viet Nam

4 Discussion

We proposed a methodology that allows meaningfully assessing AI impacts on individuals, jobs, and occupations in different countries. So far, the analysis of AI impacts on labor markets in countries other than the United States has been rather limited, particularly so in developing countries. While the implementation of AI technologies is still rather low in developing countries, basic AI technologies are already in use in these countries, and substantial potential for adoption of more advanced AI technologies has been identified (IFC 2020). Pronounced interest in enhancing the implementation rate of AI technologies in developing countries is further driven by the promise of these technologies to help leapfrog development.Footnote 12 Hence, understanding the impacts of AI on labor markets in developing countries, including in least developed countries, is crucial, but is dependent on the availability of appropriate methods. Previous methods that we discussed in this paper do not sufficiently account for the fact that occupations are organized in different ways and comprise different work activities across countries. This has been the main challenge to the study of impacts of digitalization on occupations in various countries.

The novel method we propose in this paper relies on the assessment of the suitability for machine learning of 2,069 detailed work activities that constitute occupations. These detailed work activities are reasonably universal activities that can be considered relevant in all labor markets including those in developing and least developed countries. This highly disaggregated level of analysis allows us to overcome the main challenge described above. In a nutshell, our method is based on the SBERT assessment of semantic similarities between textual descriptions of detailed work activities in the occupational database O*NET in the United States, for which digitalization measures are available, and skills elicited in household surveys available in a wide range of countries, such as STEP or PIAAC. This makes it possible to translate measures of digitalization to other countries at the level of work activities and to compare the impact of digitalization across countries and for various groups of individual workers within countries. This method builds on and advances prior approaches such as that suggested by Arntz et al. (2016, 2017), which starts from occupation-level digitalization scores instead of detailed work activities and relies on a crosswalk to 2-digit-level occupational scores.

We illustrate our approach using the suitability of work activities for machine learning (SML) provided by Brynjolfsson et al. (2018) as the AI impact measure, STEP as the survey of individual skills used at work, and the country cases of Lao PDR, a least developed country, and its neighbor Viet Nam, a developing country. Our methodology allows calculating AI impact scores at the level of individuals rather than at the level of occupations, and it provides less noisy and more insightful results than the naïve approach when digitalization measures are translated to other countries at the occupation level. While the mean of the suitability of work activities for machine learning in an occupation reflects destructive (potentially labor-displacing) AI technology, we also calculate the within-occupational variation of this measure to account for transformative effects of AI technology or the extent to which an occupation can be reorganized rather than replaced by technology.

The main insights from our analysis for Lao PDR and Viet Nam can be summarized as follows. First, we find that a larger share of individuals and occupations in urban areas in Viet Nam are exposed to labor-displacing machine learning technologies than in Lao PDR (where the data covers both urban and rural areas). This observation might reflect the differences in skill use between the two countries but also the fact that Viet Nam has already seen a larger transformation of its labor market through previous waves of mechanization, thus, making implementation of machine learning technologies easier. A significant share of workers in Lao PDR are employed in subsistence crop farming where the immediate implementation of AI technologies is challenging given the current state of technology and human capital in the country. This reduces the threat of rising unemployment due to this specific type of technology, but at the same time casts doubt on the feasibility of leapfrogging the current development path by means of AI technologies in Lao PDR. In Viet Nam, where the potential for labor-displacing automation is greater, policy responses could consist, for instance, in implementing measures to support job creation in less affected sectors or supporting workers in obtaining skills that will allow them to make transitions to jobs in these sectors.

Second, the urban labor market in Viet Nam is pronouncedly more heterogeneous with respect to the impacts of AI on individual workers, as compared to the labor market in Lao PDR. Both countries have a rather high share of workers in occupations that are characterized by high suitability of work activities for machine learning technologies, and, at the same time, have a high potential for re-organization of tasks within occupations. However, in Viet Nam there are some relatively highly populated occupations, such as building frame workers, that mainly consist of work activities that are not very suitable for machine learning technologies. While these occupations can be considered as safe in terms of labor-displacing effects of AI on them, there are not many opportunities for workers employed in these occupations to improve their productivity by means of AI. Thus, policy makers should monitor the aspects of inequality that may be due to unequally distributed opportunities for productive work using AI technologies across occupations.

Third, the results of gender-disaggregated analysis indicate that in both countries female workers are slightly more affected by labor-displacing AI technologies than their male counterparts. This is in line with previous research on the impacts of digital technologies on women in the context of developing countries (e.g., Sorgner 2019). We further show that heterogeneity of AI impact on occupations in urban Viet Nam does not seem to be driven by male or female workers, but that it is a rather general phenomenon in this country. Given that the digital gender gap is particularly pronounced in developing countries (Mariscal et al. 2019), policy makers should design and promote educational programs designed for girls and women, to increase their participation in STEM fields and prevent the aggravation of the digital gender gap.

Fourth, several insights emerge from our analysis disaggregated by workers in different age cohorts. We find substantial differences in both countries regarding the impact of AI technologies on younger workers. In Lao PDR, younger workers appear to be least affected by suitability of their work activities for machine learning technologies, while in urban Viet Nam younger workers seem to be among the most affected by this type of AI technology. This suggests that there are large differences in skill use among young workers in both countries, which deserves a more in-depth analysis given that particularly in Lao PDR the share of young individuals in the population is substantial.

Our analysis is not without limitations. Some limitations can be attributed to the methodology, while others are due to the data used in our analysis. In terms of methodology, we were able to improve earlier methods by significantly disaggregating the level of analysis and breaking it down to the level of detailed work activities. Still, one may wonder in how far the detailed work activities are comparable across countries, given different stages of economic development. We argue that using more than 2,000 detailed work activities is currently the most disaggregated level of analysis used in the literature, which represents an important advantage of our method. The highly disaggregated level of work activities makes them rather universal and applicable in various contexts. Moreover, our methodology is based on the application of semantic similarity matching techniques with textual data. We rely on the state-of-the-art Natural Language Processing technique, namely SBERT, to create semantic word embeddings to be used later for finding similar textual descriptions of work activities and skills. Should more advanced methods become available in the future, the method can be adjusted accordingly.

There are several limitations in terms of data used in the analysis. First, surveys like STEP and PIAAC elicit a rather restricted number of skills, which might lead to imprecise results of similarity matching with work activities, as some of the latter might be relevant for one’s job but corresponding information is missing in the survey. Therefore, household survey programs should ensure to include comprehensive information about skills and tasks that do not miss important areas.

Second, for illustration purposes we used the measure of suitability of work activities for machine learning provided by Brynjolfsson et al. (2018). If other measures, for instance, of other types of AI technologies will be developed in the future that are available at this narrow level of analysis, they can be adopted with our methodology in a straightforward way. In addition, future surveys should also attempt to distinguish between work activities, tasks, and abilities in a more systematic way, because some existing AI measures are available at the level of abilities (e.g., Felten et al. 2018), which were not measured in the STEP survey and therefore could not be analyzed with our method. Moreover, considering the speed at which new AI technologies are being developed to automate tasks hitherto not feasible, a more forward-looking approach could be to translate patent data on AI to identify tasks and skills susceptible to be replaced in the future, similar to the approach undertaken by Webb (2020). In addition, it would be desirable to have measures of technology adoption in addition to the task suitability measures to assess the actual impact of digital technologies on job tasks. The actual impact of machine learning technologies on jobs in developing countries could be diminished by many barriers to automation, such as the availability of a young and relatively cheap labor force, the presence of tariffs on digital goods, a lack of high-quality human capital that is needed to adopt new digital technologies, and a relatively high cost of technology adoption given a high share of SMEs and informal businesses, among others (World Bank 2016).

Third, the STEP surveys for Lao PDR and Viet Nam are only available for the year 2012. It would be very useful to have similar surveys of adult’s skills in developing countries that are more recent, representative and include a sufficient number of respondents to allow for a meaningful analysis of different categories of workers. In addition, the measure of suitability of job tasks to machine learning technologies (Brynjolfsson et al. 2018) that we use in our analysis is slightly more recent than the STEP data. Thus, our results show how the occupations of individuals, captured in the structure that existed in Lao PDR and Viet Nam in 2012, were expected to become suitable for machine learning in subsequent years. If in the relatively short period between the collection of the STEP data and the construction of the SML measure the adoption of machine learning technologies in developing countries already affected the composition of job tasks individuals performed, our estimation would still be relevant because it demonstrates the potential impact of machine learning technologies on the structures that existed in 2012. Availability of more recent data on job tasks in developing countries would allow to estimate the extent to which job tasks have changed over the last decade and to relate these changes to the availability of machine learning technologies. In addition, the STEP survey was mainly conducted in urban areas of developing countries but given a strong urban–rural regional divide in these countries, it would be desirable to have data that also includes respondents residing in rural areas. In this paper, only data for Lao PDR covered population residing both in urban and rural areas.

5 Conclusion

Our proposed methodology opens avenues for future research by allowing the estimation of digitalization impact measures of choice for a wide range of different countries, both developing and developed countries. While our illustrative example focuses on SML scores, the STEP survey and the cases of Lao PDR and Viet Nam, other digitalization measures, other surveys such as PIAAC, and other countries should be investigated in the future. The full value of our approach will become visible when applying it to various countries, because the methodology allows using the same digitalization measures across countries, which makes the results comparable. This research will inform policymakers about challenges and opportunities that new digital technologies deliver to different labor markets outside of the United States in a more targeted and precise way than current approaches do. Comparing the impact of digitalization between developed and developing countries will allow adjusting economic development strategies in a timely manner. Future research will also be able to apply our methodology to regions within countries as far as representative surveys with sufficient sample sizes are available. This research will reveal regional digital divides due to digitalization and AI and allow policymakers to develop mitigating and enabling labor market policies such as targeted training programs.

6 Appendix

Table 3

Table 3 Measures of AI Impact on the Largest Occupations in the United States

Table 4

Table 4 STEP Questions to Measure Workers’ Skills and SML Scores

Table 5

Table 5 Occupational Structure in Viet Nam (Urban and Rural Areas)

Figure 11

Fig. 11
figure 11

Destructive and Transformative Digitalization in Viet Nam (Urban and Rural Areas). Notes: Each bubble represents an occupation in Viet Nam. mSML denotes the mean suitability for machine learning of skills used in an occupation (standardized at the level of STEP skills) and is a measure for destructive digitalization. sdmSML denotes the standard deviation of the SML of skills used within each occupation and is a measure of transformative digitalization. The size of the bubbles represents employment in the occupations based on the 2016 Labour Force Survey for Viet Nam (both urban and rural areas). The largest occupations are labeled