From commonly prescribed drugs for gout through to depression, there’s growing evidence that Asians are at a higher risk of side effects from many medications due to their genetic makeup.
In a new study, recently published in Clinical Translational Science, UBC medical student Cody Lo and collaborators at Stanford reveal just how prevalent and serious some of the side effects can be.
Lo, the study’s lead author, shares why a one-size-fits-all approach to prescribing medications is risky and how a growing interest and demand for genetic testing is changing the face of medicine.
What are some of the side effects?
Asians – who make up more than 60 percent of the world’s population – are at increased risk for a wide range of side effects from a variety of common medications used for everything from gout to hormone replacement therapy and depression.
Our review, which examined existing literature in the field of pharmacogenomics (the study of how genes affect a person’s response to drugs), in combination with guidelines and statistics from the Food and Drug Administration (FDA) and the Clinical Pharmacogenetics Implementation Consortium (CPIC), summarizes some of these side effects and how they are affected by genetics.
The side effects, which ranged from benign reactions, such as reduced drug efficacy, to liver injury and severe skin reactions, are serious enough to warrant changes in dosing and – in some cases – medication choice altogether.
Why are Asians at higher risk for adverse reactions?
Asians are more likely to have certain genetic variants compared to other ethnicities that put them at increased risk for these side effects.
One genetic variant, associated with a severe skin reaction from carbamazepine, an anti-convulsant drug, is more than 150 times more common in Asian populations. It is so much more common, in fact, that the FDA has issued black box warnings not to use this drug in Asian populations.
Asians are also nearly seven times more likely to have a genetic variant that puts them at risk of a severe rash when taking allopurinol, a common medication for gout.
In the majority of cases that we reviewed, the adverse response to medications were due to Asians more commonly being carriers of a particular gene, known as the HLA-B*15:02 allele.
Do all Asians face the same risk?
It’s important to remember that different ethnic groups we colloquially refer to as “Asian” can have quite different genetic backgrounds.
Studies can range from investigating specific populations such as “Han Chinese” to much broader populations like “South Asian.”
In this review we referred to Asians broadly so that we were able to include a wide cross section of studies.
One of the enduring challenges facing pharmacogenomics is balancing an appreciation for granularity of differences between related populations and effectively summarizing the evidence.
Research groups have suggested standardized ways of classifying more specific ethnic groups moving forward and I anticipate future studies will be able to further delineate differences between these populations.
How is interest and demand for genetic testing changing medicine?
Increased interest and demand for genetic testing is already having an impact on clinical practice.
At Stanford, they actually have a clinic where patients who have experienced serious side effects to medications can be tested for these variants or can bring in their own genetic test results for interpretation.
They are a bit of a unique example, but here in Canada, we’re also seeing a growing interest from patients in learning how their genes affect their response to drugs.
For doctors, recognizing that certain effects may be more prevalent in Asian populations can be very helpful in their daily practice. Part of the reason why this review was conducted was to give doctors an evidence-based resource to help inform their clinical decision making.
How did the opportunity to collaborate with researchers at Stanford come about?
UBC’s medical program provides students dedicated time to pursue scholarly activities as part of a unique course, known as Flexible and Enhanced Learning, or FLEX. This course enabled me to take my experience from UBC and spend one of my blocks at Stanford, working with Dr. Russ Altman, a world-renowned researcher in the field of personalized medicine.
I also worked with senior author Dr. Latha Palaniappan, founder of the Stanford Center for Asian Health Research and Education, and an international team of pharmacists to synthesize the current data.
Before starting medical school, I became very excited by the idea and field of personalized medicine through working with UBC faculty members Dr. Liam Brunham and Dr. Bruce Carleton. Now, as a physician-in-training, I find the idea of someday using genetic data to guide my clinical decision making incredibly rewarding—this is the future of medicine.
In the era of precision medicine, pharmacogenetics and pharmacogenomics have emerged as fields which seek to optimize drug therapy toward patients based on their individual genetic characteristics.1
Drug metabolism and dosage can vary widely between different individuals, so an individual’s genetic profile can be crucial for providing optimal drug therapy. A number of genes have already been identified as part of US Food and Drug Administration (FDA) guidelines for clinical care.2
Among these, the cytochrome P450 (CYP450) genes have been identified as some of the most important for the delivery of personalized drug therapy.3
The CYP450 enzymes have been estimated to mediate the phase I metabolism of 70–80% of all clinically used drugs.3
Several CYP450 genes have been labeled by the FDA as actionable pharmacogenes which are present on drug labels.4
For example, the CYP2C9 gene has been implicated in the metabolism of warfarin and fluvastatin, among other drugs, while CYP2C19 has been linked to the metabolism of clopidogrel.5,6
Alleles in these genes are linked to different phenotypes which can affect the metabolism and therefore dosing of certain drugs. In CYP2C9, two single-nucleotide polymorphisms (SNPs), which are commonly referred to as the CYP2C9 *2 and *3 alleles, have been linked to the warfarin “poor metabolizer” phenotype.5 Similar alleles and corresponding SNPs have been identified for other CYP450 genes.3
Understanding the interaction between these genes and clinically used drugs in order to provide guidelines for providers is the primary target for pharmacogenetics research.
In the current era of precision medicine, the field of pharmacogenetics is becoming increasingly important in guiding clinical decisions.52 However, currently, pharmacogenetics testing in the clinical setting is not widespread for a variety of reasons.
Results of tests can have long turnaround times which can limit their clinical utility.53 In addition, these genetic tests may be prohibitively expensive to conduct or may not even be available in areas with less developed infrastructure or in low-resource countries.54
In case of emergency or acute care settings, individual pharmacogenetic profile may not be readily available though it was shown to be very useful for medication regimen optimization,55 especially in patients with polypharmacy.56
In such cases, use of self-reported racial identity or geographical ancestry may be instrumental in implementing clinical decision support for initial pharmacogenetic-driven medication prescription.57
As technology and medical infrastructure evolve, the ideal of standard genotyping for every patient may come to fruition in the near future. In the meantime, it is necessary to investigate other, more accessible biomarkers for their utility in guiding clinical decision-making.58
For this reason, race and ethnicity have become the subject of intense research for use as potential biomarkers which can serve as adjuncts to genotyping tests in the clinical setting.59
A number of drugs have already been discovered to have different responses in populations of different racial composition.60,61 For example, in warfarin dosing, the average effective dose needed has been found to vary based on race, so recent studies have proposed using ethnicity as a variable in warfarin dosing algorithms.62–64 Several antihypertensive drugs have also been implicated to have different responses based on race.
In a large clinical trial comparing losartan, an Angiotensin Receptor Blocker, to atenolol, a β-blocker, a black population was found to have better clinical outcomes when treated with atenolol, while non-black populations were found to have better clinical outcomes when treated with losartan.64,65
Clinical trials for the antihypertensive drug BiDil found that the drug was effective only in black populations, leading to the drug receiving FDA approval for use in black populations only.66–68
Although the literature has identified these patterns in differing drug responses between populations of different racial backgrounds, the mechanism for these differences is still poorly understood.
For this reason, pharmacogenetics research on race and ethnicity has sought to provide one potential explanation for these phenomena.
A proposed mechanism for these racial differences in drug response is the discovery of patterns of pharmacogenetic allele distribution amongst different ethnic and racial groups. Studies have found that some minor alleles are extremely rare (<1% prevalence) in some ethnic groups but can be found at a relatively common rate in others.69–72
For example, the CYP2C9 *2 allele has a prevalence of less than 1% in Chinese populations but can reach a prevalence of up to 19% in certain European populations.73–75 As a result, a significant amount of research has gone into further understanding the differences in pharmacogene expression between different races and ethnicities.
Nonetheless, research into race and ethnicity remain controversial issues, with some questioning its utility in clinical practice.76 In particular with regard to genetics, the traditional understanding and classification of race has been called into question.
The traditional ideology of race as distinct, discrete categories such as “White” and “Black” is now regarded by many as a social construction, rather than a biological one.77,78 Genetic differences within populations are complicated and often fall on a spectrum, particularly in populations such as African-Americans in the US, who have experienced varying degrees of admixture across generations.79–81
Nonetheless, in lieu of widespread genotyping for the reasons described above, race and ethnicity remain one of the few readily accessible insights into a patient’s possible genetic risk factors in the clinical setting. For this reason, it is important to refine the usage of race and ethnicity as biomarkers in order to effectively serve the goals of precision medicine.
A major obstacle in producing translatable research regarding race and ethnicity is the lack of consensus on racial and ethnic categories to use and how to classify individuals of different ethnicities.82 Some countries establish national guidelines for how to classify race and ethnicity in medical research; even so, these guidelines can vary widely between countries.
In the US, the National Institutes of Health (NIH) have defined several broad racial and ethnic categories for researchers to use in their studies.83 These categories include terms referring to major continental populations such as “White”, “Black”, “Asian”, etc.64 However, a high degree of ethnic heterogeneity exists within each of these main categories.
For example, the category “Asian” can be further divided into the subcategories “East Asian” and “South Asian”.82
This creates difficulty in ethnicity-based pharmacogenetics research, as the categories of race and ethnicity used vary widely between different researchers.82,84 This inconsistency in specificity of race and ethnic classification can prevent the results from being translated into the clinic accurately.
We are concerned that lack of specificity in racial and ethnic categorization can cause overgeneralization of results which may break down when examining certain ethnic minorities within the broader racial category. In addition, the use of overly broad racial categories and the lack of standardization in race and ethnicity-based research run counter to the aims of precision medicine.
In this study, we aimed to examine the different racial and ethnic categories used in pharmacogenetic studies and how inconsistency in categorization could have clinical consequences. In order to do so, we conducted a literature review of pharmacogenetics studies published between May 2015 and July 2018.
Using this method, we (i) categorized the different racial and ethnic categories used in pharmacogenetics studies, (ii) examined how racial and ethnic category use may vary between studies published by authors from different countries, and (iii) examined the genetic heterogeneity present within broad continental racial categories for certain CYP2C9, CYP2C19, and CYP2D6 alleles.
Methods
Analysis of number of ethnic categories
We searched the PubMed database using the following Boolean search term: (pharmacogenetics OR pharmacogenomics) AND (race OR ethnicity) AND (cyp2c9 OR cyp2D6 OR cyp2c19). We filtered the results to show results only from May 2015 to July 2018 to focus on the scope of our analysis. Using this search term, we found 88 initial results. We then used the following inclusion criteria for studies to be included:
- The study had to analyze the frequency of CYP2C9, CYP2C19, or CYP2D6 alleles within a human population.
- The study had to stratify their study population using racial or ethnic categories or specify the race or ethnicity of the population being studied if there is only one category.
- The racial or ethnic categories examined in the papers had to fall under at least one of the 3 following NIH defined racial categories: White, Black, or Asian as defined by the “Racial and Ethnic Categories and Definitions for NIH Diversity Programs and for Other Reporting Purposes” NIH guideline.83 Thus, studies exclusively examining racial or ethnic groups such as Hispanics or American Indians were excluded to be addressed in further reviews.
- Case reports, case series, and expert opinions were excluded. All other study designs which fit the other criteria, including systematic reviews and meta-analyses, were included.
After analyzing the search results by study title and abstract, we found 60 papers to include in our initial review. We then examined the citations of these papers and added an additional 43 papers to our analysis using the same inclusion criteria.
For the papers found in the citations, we allowed papers published before May 2015 to be included. We did this so that all racial and ethnic categories drawn from cited papers could be compared to the racial and ethnic categories used in the original source. In total, 103 papers were included for this analysis. As the terms “White” and “Caucasian” had been used interchangeably in literature, articles with both terms were included in the final analysis and both terms have been used in this report.
We then classified these papers by country of origin of the 1st author and examined the ethnic categories they used. An ethnic category was defined as any label used to stratify the populations being studied by geographic, cultural, linguistic, or genetic characteristics for which allele prevalence data was present or was otherwise used as a comparison variable. This included ethnic categories which were cited from other papers.
Ethnic categories mentioned outside of the Results or Methods sections were excluded in order to restrict the study to terminology which was included in the primary analysis of the study. We classified each category as Caucasian, Black, or Asian using the NIH definitions noted above. We limited our analysis to these three racial categories as they are the most highly investigated in the literature.
Although Hispanic populations represent an important racial minority, particularly in the US, they remain heavily underrepresented in medico-scientific study and were thus excluded from this review.85 Ethnic categories which could not be precisely classified to just one of the three above categories such as “Other” or “North American” were excluded.
The full details of each study included, such as the countries of origin and the ethnic categories used, can be found in Table S1. We then examined the number of unique race/ethnicity categories used for each of the three major NIH racial categories being analyzed in this study by different countries. In order to exclude countries which had limited representation, we only included countries which had four or more papers within our sample for this analysis.
Using the information contained within the papers as well as supplemental literature, we created three charts, one for each NIH category, which show the variety of different ethnic categories used in the papers as well as how they can be categorized relative to each other. The charts for “Asian” ethnic categories can be found in Figure 1, “Caucasian” in Figure 2 and “Black” in Figure 3.
Not all ethnic categories were included in the charts; if an accurate categorization for an ethnic category could not be found or if the categorization lacked a consensus in the literature, the category was excluded from the chart. In addition, a few were excluded for clarity and ease of visualization. Racial and ethnic identity is a sensitive and controversial issue; therefore, these charts are not intended to be definitive classifications of these ethnic categories. They are intended for the visualization of the complexity of ethnic categorization in pharmacogenetics literature and only reflect how these ethnic categories were classified in the literature we reviewed. A full list of ethnic categories included in this study can be found in Table S2.

Chart of ethnic categories which fall under the NIH term of “Asian”.
Note: Data from references 7, 8, 10, 12, 14, 16, 28, 37, 41, 42, 48, 51, 71–73, 87–90.

Figure 2
Chart of ethnic categories which fall under the NIH term of “Caucasian”.
Note: Data from references 9, 11, 13, 18, 20–26, 28, 34, 37, 42–45, 70, 75, 87, 91–94, 128.
Analysis of racial/ethnic differences in cytochrome P450 allele prevalence
For our analysis, we chose to analyze the CYP2C9 *2 and *3 alleles, the CYP2C19 *2, *3, and *17 alleles, and the CYP2D6 *4 and *17 alleles. These alleles were chosen because they are classified as actionable pharmacogenes by the Phamacogenomics Knowledgebase (PharmGKB) with Level 1A evidence and have a large amount of literature examining their prevalence in different populations.86 The allele frequency data for several broad ethnic groups was first extracted from the Clinical Pharmacogenetics Implementation Consortium (CPIC) website.87 We then compared this data to allele frequency data extracted from the papers included in our racial category analysis but with the following additional inclusion criteria:
- The population sample had to be healthy volunteers or otherwise described as members of the general population.
- The population sample had to include at least 20 members.
- Study populations which did not stratify their population further than the CPIC racial categories (eg divided their population into Caucasian and African-American only) were not included.
The study populations were assumed to be unrelated. The allele frequencies were found using a variety of genotyping methodologies, including PCR and whole genome sequencing. The allele frequency was extracted or calculated from the data in the paper, and the ethnic category was used as defined by the original study. We summarized the ethnic groups whose allele frequency varied from their broader CPIC ethnic groups in Table 2. The full table of all allele frequency data across all groups can be found in Table S3.
Table 2
Allele frequency data for ethnic categories which showed a significant deviation from CPIC categories
Ethnicity | CYP2C9 allele frequency (%) | CYP2C19 allele frequency (%) | CYP2D6 Allele Frequency (%) | References | ||||
---|---|---|---|---|---|---|---|---|
Asian | *2 | *3 | *2 | *3 | *17 | *4 | *17 | |
CPIC East Asian | 0.1 | 3.4 | 29.0 | 8.3 | 1.3 | 0.7 | <0.1 | 87 |
Hui Chinese | 4.6 | 49.4 | 5.2 | 88 | ||||
Han Chinese | 38.6 | 5.2 | 88 | |||||
North Asian | 18.5 | 4.4 | 70 | |||||
Orang Asli | 5.7 | 6.5 | 4.8 | 90 | ||||
CPIC South Asian | 10.7 | 10.2 | 34.3 | 0.9 | 17.3 | 7.9 | 0.1 | 87 |
Sinhalese | 10.2 | 5.1 | 44.7 | 0 | 89 | |||
Tamils | 10.6 | 2.8 | 38.2 | 1.4 | 89 | |||
Moors | 9.2 | 2.0 | 31.0 | 0 | 89 | |||
Caucasian | ||||||||
CPIC Caucasian (European and North American) | 12.6 | 7.1 | 14.6 | 0.6 | 21.3 | 18.1 | 0.3 | 87 |
Laks | 19.5 | 91 | ||||||
Avars | 15.5 | 91 | ||||||
Dargins | 16.0 | 91 | ||||||
Dutch | 11 | 20 | 13.3–14 | 0.0–0.2 | 20 | 18.4 | 0 | 75,128 |
South African Caucasian | 10 | 36 | 13 | 1 | 18 | 3 | 75 | |
Polish | 11 | 1 | 15 | 0 | 36 | 0 | 75 | |
Czech | 19 | 7 | 8 | 0 | 29 | 0 | 75 | |
Slovakian | 7 | 4 | 19 | 2 | 33 | 75 | ||
Mediterannean-South European | 12.5 | 0.1 | 42.0 | 2.0 | 70 | |||
Faroese | 8.8 | 5.3 | 18.8 | 0 | 15.4 | 33.4 | 93 | |
Cypriot | 19 | 7 | 21 | 4 | 11 | 40 | 75 | |
German | 12 | 5 | 17 | 0 | 21 | 10 | 75 | |
CPIC Middle Eastern | 13.2 | 9.3 | 13.2 | 2.3 | 21.6 | 7.8 | 1.6 | 87 |
Yemenite Jewish | 5.1 | 8.1 | 92 | |||||
Black | ||||||||
CPIC African American | 2.4 | 1.0 | 14.5 | 0.7 | 15.4 | 3.3 | 20.0 | 87 |
CPIC African | 2.3 | 1.2 | 18.3 | 0.3 | 20 | 6.4 | 18.1 | 87 |
Ethiopian | 7.7–14.7 | 6.0–12.4 | 13.6 | 1.8 | 94,129 | |||
Igbo | 29 | 0 | 8 | 14–22 | 95,130 | |||
North African | 8.7 | 0.2 | 70 | |||||
Kikuyu | 16 | 0 | 1 | 33 | 95 | |||
Zimbabwean | 13.1 | 0 | 2 | 34 | 96 | |||
South African Xhosa | 13 | 1 | 19 | 3 | 75 | |||
Black Brazilian | 7 | 3 | 16 | 0 | 20 | 10 | 11 | 97 |
Notes: The bold values represent allele frequencies which differ from the CPIC aggregated values for that broader ethnic category by an absolute magnitude of at least 5%. The indicator (*) is used as a standard notation for alleles of CYP450 genes.Go to:
Results
Number of ethnic categories used
In total, we found 66 different ethnic categories used which would be classified as “Caucasian” under NIH guidelines, 49 different ethnic categories for “Asian”, and 62 different categories for “Black”. Our analysis included papers published by authors from 31 different countries. The analysis for the individual number of categories for 8 countries which were selected using the criteria specified above can be found in Table 1.
Table 1
Number of racial and ethnic categories used for eight selected countries
Country | Number of categories used for Caucasian | Number of categories used for Asian | Number of categories used for Black |
---|---|---|---|
China | 5 | 11 | 2 |
Japan | 17 | 6 | 9 |
Malaysia | 3 | 24 | 2 |
Russia | 16 | 4 | 0 |
Singapore | 1 | 14 | 1 |
South Africa | 6 | 5 | 18 |
Sweden | 4 | 3 | 9 |
USA | 11 | 3 | 28 |
Total amongst all studies | 66 | 47 | 62 |
A high degree of heterogeneity was found in the number of ethnic categories used for each major NIH category between different countries. For categories falling under “Caucasian”, Japan had the highest number of different categories at 17, while Singapore had the fewest at 1. For categories falling under “Asian”, Malaysia had the highest number of different categories at 24, while Sweden and the US had the fewest at 3. Finally, for categories falling under “Black”, the US had the highest number of different categories at 28, while Russia had 0 categories due to having no studies involving blacks in our sample.
The categorization relationships between the ethnic categories for “Asian”, “Caucasian”, and “Black” can be seen in Figures 1–3, respectively. The classification of different ethnic categories appears to be highly heterogeneous. Ethnic categories were defined by race, skin color, continent, region, religion, nationality, language, and several other methods.
Ethnic differences in CYP2C9, CYP2C19, and CYP2D6 allele prevalence between Asians
The allele prevalence for selected ethnic groups whose frequencies varied from their broader CPIC racial group’s allele frequency can be found in Table 2. The allele prevalence for all ethnic groups examined in this study can be found in Table S3. The CPIC defined two major groups which could be considered as “Asian” under NIH guidelines: East Asians and South/Central Asians.
For CYP2C9 alleles, East Asians have a much lower prevalence of the *2 allele (0.1%) compared to South/Central Asians (10.7%) as well as the *3 allele (3.4% in East Asians compared to 10.2% in South/Central Asians) in the CPIC data.87 The Hui minority of China (4.6%) had a much higher prevalence of the *2 allele compared to other East Asians.88 Three ethnic populations in Sri Lanka, the Sinhalese, Tamil, and Moors, had similar *2 allele frequency (10.2%, 10.6%, and 9.2%, respectively) as South/Central Asians but lower *3 allele frequency (5.1% for Sinhalese, 2.8% for Tamil, and 2.0% for Moors) which was more comparable to East Asians.89
For CYP2C19 alleles, East Asians had similar prevalence of the *2 allele as South/Central Asians (29.0% to 34.3%, respectively), but higher prevalence of the *3 allele (8.3% to 0.9%, respectively) and lower prevalence of the *17 allele (1.3% to 17.3%, respectively) in the CPIC data.87 Both a Han Chinese population and a Hui Chinese population showed a higher prevalence of the *2 allele than other East Asians (38.6% and 49.4%, respectively).88 A Sinhalese population also showed a higher prevalence of the *2 allele (44.7%).89 A North Asian population and an Orang Asli indigenous population of Malaysia showed lower *2 allele prevalence (18.5% and 5.7%, respectively) than either East Asians or South/Central Asians.70,90
For CYP2D6 alleles, the *4 allele was less prevalent in East Asians compared to South/Central Asians (0.65% to 7.87%, respectively) and the *17 allele was rare in both (<1% for both) in the CPIC data.87 All populations were within a few percentage points of these categories for these alleles.
Ethnic differences in CYP2C9, CYP2C19, and CYP2D6 allele prevalence between Caucasians
The CPIC defined two major groups which could be considered “White” or “Caucasian” under NIH guidelines: Caucasian (European and North American) and Middle Eastern. For the CYP2C9 *2 and *3 alleles, these two populations have a fairly similar prevalence using the CPIC data (12.6% for Caucasians and 13.2% for Middle Easterners for the *2 allele; 7.1% for Caucasians and 9.3% for Middle Easterners for the *3 allele).87 A few ethnic categories showed notable differences from these CYP2C9 allele frequencies. The Lak, Avar, and Dargin ethnic minorities of Russia showed a higher prevalence of the CYP2C9 *3 allele (19.5%, 15.5%, and 16%, respectively), while for the CYP2C9 *2 allele a Czech population (19%) and a Cypriot population (19%) showed higher frequency.75,91 Both a Slovakian population and a Yemenite Jewish population had a lower frequency of the CYP2C9 *2 allele (7% and 5.1%, respectively).70,75,92 A Dutch population and a South African Caucasian population showed a higher prevalence of the *3 allele (20% and 36%, respectively), while a Polish population (1%) showed a lower prevalence.75
For CYP2C19 alleles, the prevalence was also similar between Caucasians and Middle Easterners in the CPIC data (14.6% for Caucasians and 13.2% for Middle Easterners for the *2 allele, 0.6% for Caucasians and 2.3% for Middle Easterners for the *3 allele, and 21.3% for Caucasians and 21.6% for Middle Easterners for the *17 allele).87 A Cypriot population showed a higher frequency of the *2 allele (21%) and a lower frequency of the *17 allele (11%).75 A Czech population showed lower frequency of the *2 allele (8%) but higher frequency of the *17 allele (29%).75 A Polish population, a Slovakian population, and a Mediterranean-South European population also showed a higher frequency of the *17 allele (36%, 33%, and 42%, respectively).70,75
For CYP2D6 alleles, Caucasians in the CPIC data had a higher prevalence of the *4 allele compared to Middle Easterners (18.1% to 7.8%, respectively), but a similar prevalence of the *17 allele (from 0.31% to 1.58%, respectively) in the CPIC data.87 A Faroese population showed a higher prevalence of the *4 allele (33.4%) than both CPIC Caucasians and Middle Easterners.93 The *17 allele was very rare in most Caucasian populations; however, it was common in a German (10%) and Cypriot (40%) population.75
Ethnic differences in CYP2C9, CYP2C19, and CYP2D6 allele prevalence between Blacks
The CPIC defined two major groups which could be considered “Black” under NIH guidelines, African-Americans and Africans. For the CYP2C9 *2 and *3 alleles, the allele frequency between these two populations was almost identical (2.4% prevalence of the *2 allele for Africans and 2.3% for African Americans, 1.0% prevalence of the *3 allele for Africans and 1.2% prevalence for African Americans) in the CPIC data.87 Ethiopians showed a higher prevalence of both alleles (7.7–14.7% prevalence of the *2 allele, 6–12.4% prevalence of the *3 allele) than either population.94
For CYP2C19 alleles, Africans and African Americans also showed similar prevalence for the *2 allele (14.5% for Africans and 18.3% for African Americans), the *3 allele (0.7% for Africans and 0.3% for African Americans), and the *17 allele (15.4% for Africans and 20% for African Americans) in the CPIC data.87 An Igbo population showed a higher prevalence of the *2 allele (29%) than either group.95 A North African population showed a lower prevalence of the *2 allele (8.7%) than either group.70
For CYP2D6 alleles, Africans showed slightly lower prevalence of the *4 allele than compared to African Americans (from 3.3% to 6.4%, respectively) but similar prevalence of the *17 allele (19.9% for Africans, 18.1% for African Americans) in the CPIC data.87 A Kikuyu population showed higher prevalence of the *17 allele (33%) than either population, as did a Zimbabwean population (34%).95,96 A South African Xhosa population showed a much lower prevalence of the *17 allele (3%) than either group, while a black Brazilian population showed a somewhat lower prevalence (11%).75,97Go to:
Discussion
We found a significant amount of heterogeneity in racial and ethnic classification categories across different countries. Among the eight countries we analyzed, the number of categories used for each major NIH racial category ranged from 1 to 17 for Asian ethnic groups, from 3 to 24 in Caucasian ethnic groups, and from 0 to 28 for black ethnic groups. The relationship between the ethnic categories used was highly complex, with many groups which could be considered overlapping as well as some groups which were hard to categorize into any of the major groups, such as the indigenous tribes of Malaysia.
This large range in number of ethnic categories used across different countries supports the idea that currently ethnic and racial classification in pharmacogenetics studies is highly inconsistent. The lack of consensus on how to classify certain ethnicities into broader racial categories can potentially create difficulty in accurately translating the results to a generate clinical guidelines. In addition, the use of broad racial categories has difficulty in accurately classifying ethnic minorities, such as indigenous tribes, whose genetic background can differ from the majority population within their region.98
We also identified several ethnic categories which have different allele frequencies for actionable CYP450 genes when compared to other ethnic groups within that NIH racial category. These groups often included ethnic minorities in major countries, such as the Hui minority of China or the indigenous tribes of Malaysia. For each NIH racial category and CYP450 gene we analyzed, we found at least one ethnic group whose allele frequency differed from the overall race category.
This suggests that the inconsistency in racial and ethnic categorization could have clinical consequences when ethnicity is used for pharmacogenetic guidelines. Pharmacogene allele distribution across the NIH racial categories is not heterogeneous, so making assumptions on an individual’s genotype based on their ethnicity may not be accurate. Based on our analysis, this appears to be particularly relevant for ethnic minority groups within the NIH racial categories.
The use of the NIH-defined categories of race for pharmacogenetic guidelines and pharmacogenetics research in its current form may be of limited utility and potentially even damaging in the clinical setting. The genetics of race is complicated and still poorly understood; thus, further research into the relationship between race and polymorphism prevalence may be needed in order to improve the precision of race and ethnicity-centered pharmacogenetic guidelines.
Race and ethnicity continue to be highly investigated topics for medical research. As clinical care continues to trend toward precision medicine, race and ethnicity continue to be investigated as potential biomarkers for providing more individualized care in lieu of genotyping information.
This is because public health and epidemiology research has found dramatic differences in health outcomes between ethnicities. For example, studies have found individuals of African descent to be at a significantly higher risk for prostate cancer than Caucasians.99,100 Such findings have spurred a renewed intensity for genetic research on minority populations in order to find genetic causes for disparate health outcomes.
Not surprisingly, the field of pharmacogenetics has sought to make use of race and ethnicity to stratify patients by genetic risk as well. Several clinical guidelines for drug therapy based on race have already been established through clinical trials.66 Perhaps, the most famous example of these is the antihypertensive medication BiDil, which was demonstrated to be effective only in African Americans and thus approved by the FDA for treatment of African Americans only.66–68 Other guidelines include the aforementioned usage of race in warfarin dosing algorithms and FDA recommendations for genetic testing in Asian patients using carbamazepine due to risk of Steven–Johnson syndrome.62,63,101,102 However, despite the current usage of race in the clinic, the clinical utility of race as a genetic marker remains highly contested.
The history of race in medicine has been a complicated and controversial one. In years past, race and genetics research had been used as a means to justify racial discrimination and a method to generate a hierarchy of races.103,104 Many current conceptions of race and racial classification originate from social and political ideas rather than having a scientific basis.103 The current most commonly used racial categories in the US are remnants of the works of 20th-century anthropologist Carleton S. Coon, who divided humans into five major races: Caucasoids, Congoid, Mongoloid, Capoid, and Australoids.105,106
The use of these categories continued to propagate for several decades, often for discriminatory purposes, until the model eventually fell out of favor.106 However, the general racial categories still remain, as can be seen in the NIH racial classification guidelines. Specifically, the 3 major continental populations from which the entire human population is thought to derive (namely, European/Caucasian, African, and Asian ancestry) are still widely used for study sample categorization as Caucasian, Black and Asian in significant number of pharmacogenetic articles. Although race-focused genetics research is no longer centered around discriminatory studies, the scientific validity of the current, most widely used classifications of race and ethnicity is still being questioned.107
Examining the genetic structure of human populations reveals inconsistencies in many of the traditionally used racial and ethnic categories. One of the concerns regarding the use of broad racial classifications is the effect of admixture on certain populations. Although African immigrants and African Americans with generations of residence in North America are both classified as “black” or even sometimes “African American” under NIH standards, their genetic ancestry can vary.79,80 European ancestry contributes a significant portion of genetic ancestry for African Americans, a number which also varies widely within the African American population in the US.81 For this reason, categorizing all individuals of African descent as “blacks” or “African Americans” may not accurately capture their genetic background.
Another group for which admixture presents difficulty in racial classification is the Hispanic/Latino population. Although they are typically treated as a homogeneous group, Hispanic/Latino populations actually represent a spectrum of admixture between Europeans, Africans, and Native Americans, with widely differing genetic contributions from each ancestral group between countries.108,109
As a result, some genetic studies now find it more accurate to refer to these groups as admixed Americans instead.110 Studies have also attempted to further stratify “Hispanic” populations into their country of origin in order to generate more precise identifiers of risk factors, including identification of genetic risk factors which differ between Hispanic populations.111–113 These distinctions represent examples of more precise racial and ethnic classification which can improve the accuracy of race and ethnicity-based clinical guidelines.
Due to the increased understanding of the genetic mechanisms underlying race, efforts have been made to generate more precise labels of the race and ethnicity of population samples.
Most notably, the 1000 Genomes project now provides extensive documentation of the ethnicity and geographical origin of their population samples.114 Studies are also attempting to more accurately capture the differences in genetic structure within broad racial categories, such as East Asians and Africans.115,116
These findings can potentially further stratify our currently used racial categories into more precise classifications. Such steps represent positive measures in improving the precision of the use of race and ethnicity in precision medicine and limiting overgeneralization of race characteristics in both research and clinical settings.
There are some limitations to our analysis. Our analysis does not factor in sample sizes or include statistical analysis of differences in allele frequencies; however, that is not the primary aim of our study. In addition, our analysis does not include Hispanics, who remain a socially important and historically understudied ethnic population.
Our study is certainly not comprehensive of all racial and ethnic categories currently in use, nor is it intended to be. Instead, we hope to paint a picture of the current heterogeneity present in race and ethnicity classification in pharmacogenetic studies, as well as the limitations of broad racial categories. The literature supports this notion, as genetic studies have increasingly trended away from broad categories such as “White” and “Black” and instead focus on the underlying ancestry of individuals and populations.114–116 As our understanding of population genetics increases, these categories could become increasingly outdated. The usage of such broad hierarchies in classifying the genetic makeup of individuals is potentially detrimental to the aims of precision medicine.
Currently, the role of race and ethnicity in genetics research is at an important crossroads. By and large, broad classifications such as black and white have been found to be inadequate from a biological standpoint, a result supported by our review. However, the role of race as a categorical variable for both clinical and research settings remains unclear. Powerful new genotyping technologies such as genome-wide association studies allow researchers to group populations into clusters based on genetic expression, rather than social or geographic characteristics.78,117
Genetics researchers have expressed enthusiasm that as these technologies become more accessible and widespread in the future, population clustering based on gene expression can become the new standard for population stratification in genetics research.118 However, there still remains a barrier in translating these findings from a research setting to the clinical setting due to the expense and limited availability of these technologies.
Establishing more precise guidelines for categorical racial classification may be necessary as a bridging step between the current classification methods and the precise genotyping technology of the future.
To our knowledge, our review is the first study which systematically categorizes the racial and ethnic labels used in pharmacogenetics research and relates them to the most commonly used continental race categories promulgated by NIH.
Previous studies have examined current practices of race in both the research and clinical settings and come to similar conclusions as ours. There is a growing recognition that use of ethnicity as a proxy of ancestry, which itself is being used as a proxy for genotype, may result in subpar predictions in precision medicine applications.66
Recent reviews recognized interethnic differences in drug response and their potential usefulness for implementing public health policies, designing and interpreting clinical trials, and identifying patients with higher probability of a particular drug metabolism phenotype.119,120 However, it is advocated that focus should be shifted from populations, ethnicities and races to inherent genetic individuality for effective medication optimization of individual patients.121,122
A panel of “ancestry-informative” markers identifying the geographic region of individual’s ancestors is being proposed as potential enhancement of traditional ethnic categories reflecting a complex mixture of social, cultural and geographic influences.123,124
Tang et al reported in 2005 comparison of genetic data from over 3,500 individuals who identified themselves as belonging to one of the major racial or ethnic groups in US (Caucasian, Black, Asian, and Hispanic) and showed near-perfect correspondence of genetic analysis with self-reported categories.125
However, another review examined self-reported race in 2015 in clinical settings, and its accuracy compared to genomic analysis of ancestry. This review found that the practice of self-reported race could be leading to a large number of false positives or false negatives due to its inaccuracy in capturing an individual’s genetic ancestry.126
A 2008 study which surveyed physicians on their attitudes regarding race and genetics in clinical medicine found that many physicians considered race to be a “complex and poorly defined subject” but still considered it an important variable in clinical decision-making. However, many of the surveyed physicians felt that the connection between race and genetics was tenuous and that a major reason for that was the biological inaccuracy of traditional, social race classification systems.127
Based on our findings, we recommend further investigation into the differences in pharmacogene allele frequency between ethnic minority populations using precise classification of their ethnic background and geographical ancestry. Such research can improve our understanding of how race and ethnicity affect pharmacogenetic factors, as well as provide more precise guidelines for clinical care.
We also recommend the establishment of more precise guidelines for race and ethnicity classification in clinical research. An example of this is how the 1000 Genomes project defines their populations.114 Their population categories use specific ethnic categories as well as the geographical origin of the population.
It is possible that there is a limited number of alleles whose frequency is comparable across various ethnic subgroups of a particular continental race category. However, as it was shown in this review, frequency of a large number of alleles may vary significantly within the major continental race categories depending on a particular geographic ancestry. Evidence-based consensus is necessary for meaningful use of self-identified race categories as well as for geographical ancestry nomenclature in pharmacogenetics.
Introduction of common taxonomy of geographical ancestry which integrates specific subpopulations of particular countries and is accepted by the entire scientific community for precision medicine research and practice is warranted in order to facilitate reproducible pharmacogenetic research and clinical implementation of its results. Consistent classification and specificity in defining ethnic populations can improve the precision and generalizability of pharmacogenetics research.
More information: Cody Lo et al. Pharmacogenomics in Asian subpopulations and impacts on commonly prescribed medications, Clinical and Translational Science (2020). DOI: 10.1111/cts.12771