Open access peer-reviewed chapter

Application of Data Science Approaches to Investigate Autoimmune Thyroid Disease in Precision Medicine

Written By

Ayodeji Folorunsho Ajayi, Emmanuel Tayo Adebayo, Iyanuoluwa Oluwadunsi Adebayo, Olubunmi Simeon Oyekunle, Victor Oluwaseyi Amos, Segun Emmanuel Bamidele and Goodness Olusayo Olatinwo

Submitted: August 27th, 2021Reviewed: October 14th, 2021Published: November 18th, 2021

DOI: 10.5772/intechopen.101220

Chapter metrics overview

86 Chapter Downloads

View Full Metrics


In recent times, the application of artificial intelligence in facilitating, capturing, and restructuring Big data has transformed the accuracy of diagnosis and treatment of diseases, a field known as precision medicine. Big data has been established in various domains of medicine for example, artificial intelligence has found its way into immunology termed as immunoinformatics. There is evidence that precision medicine tools have made an effort to accurately detect, profile, and suggest treatment regimens for thyroid dysfunction using Big data such as imaging and genetic sequences. In addition, the accumulation of data on polymorphisms, autoimmune thyroid disease, and genetic data related to environmental factors has occurred over time resulting in drastic development of clinical autoimmune thyroid disease study. This review emphasized how genetic data plays a vital role in diagnosing and treating diseases related to autoimmune thyroid disease like Graves’ disease, subtle subclinical thyroid dysfunctions, Hashimoto’s thyroiditis, and hypothyroid autoimmune thyroiditis. Furthermore, connotation between environmental and endocrine risk factors in the etiology of the disease in genetically susceptible individuals were discussed. Thus, endocrinologists’ potential hurdles in cancer and thyroid nodules field include unreliable biomarkers, lack of distinct therapeutic alternatives due to genetic difference. Precision medicine data may improve their diagnostic and therapeutic capabilities using artificial intelligence.


  • artificial intelligence
  • autoimmune disease
  • Big data
  • Graves’ disease
  • precision medicine
  • thyroid disease

1. Introduction

A breakthrough was launched for the field of personalized medicine when the president of the United States of America announced precision medicine in January 2015, presenting it for review and implementation by all healthcare professionals [1]. Since then, molecular characterization of patients which are more precise has been developed in the area which includes an increasing number of ‘omics’: (proteomics, genomics, transcriptomics, lipidomics, metabolomics and epigenomics), integration of genomic data, the rapid exchange of knowledge among researchers, bioinformatics which involves the retrieval and analysis of data stored in the large databases, and the growing world of Big data and artificial intelligence [1, 2]. These factors are introduced to drive clinicians towards diagnosis, follow-up and therapeutic decisions in precision medicine [2].

Data science applies the use of machine learning algorithms to audio, video, images, text, and numbers to develop artificial intelligence (AI) systems which are used in data processing and preparation of analysis, optimization and construction of integral models, which is further used in the combination of certain algorithm and consequently produce insights that analysts can translate to add value to existing knowledge [3].

One of the principal challenges in clinical endocrine practice is thyroid disease management. During the last years, continuous progress has been experienced in medical science. Also, some factors have improved our knowledge of this field from arithmetical to geometrical proportions. Some of the lists of these factors include accurate clinical assessment, understanding inter or intracellular reactions, and the environment’s influence on this reaction [2]. Most fields of science have undergone a big data revolution. The use of data science in personalized medicine is important for treating variability in autoimmune disorders, especially in patients with the presence of varying autoimmune diseases [4, 5]. Studies have also shown how data like the electronic health records (EHRs) initially designed to facilitate patients registration has been used as a tool in predicting thyroid diseases, as seen in some reports that link the EHRs data to extant genotypes to identify new gene locus like forkhead box E1 (FOXE1), which is associated with autoimmune thyroid diseases [6, 7, 8].

Genomic data is an important data in precision medicine. Therefore, most thyroid diseases such as autoimmune thyroiditis are known to have high heritability [8, 9]. Studies have reported high rate of Graves’ disease in monozygotic twins compared to dizygotic twins (in the range of 50–70%, compared with 3–25% respectively). Also, Hemminki and his co-worker reported the familial standardized incidence ratios for Graves’ disease to be 4.49 (for individuals whose parent was affected), 5.04 (for individuals with only a single sibling affected), while 310 (if the individual has two or more siblings affected), and 16.45 in twins [1, 8, 10]. For Hashimoto’s thyroiditis (HT), the sibling risk ratio was found to be 28 and this risk was confirmed in data obtained from Germany [8, 11, 12]. All this evidence shows the association of genetic susceptibility to autoimmune thyroid diseases.

A genome-wide association study (GWAS) of hyperthyroidism was carried out with a sample of 1317 hypothyroidism cases and 5053 controls which was algorithmically determined from five EMRDs (electronic medical record databases), one association was found with near forkhead box E1 (also known as thyroid transcription factor 2 (TTF-2)) [7]. Gene studies have also linked autoimmune hypothyroidism with PTPN22 (protein tyrosine phosphatase, non-receptor type 22), CTLA4 (cytotoxic T lymphocyte antigen 4) and HLA II (human leukocyte antigen class II region) [7, 8]. On the other hand, Graves’ disease has been studied in several genome-wide association studies, with the discovery of many loci [1, 7]. These associations are important in the diagnosis and treatment of autoimmune thyroid diseases.


2. Autoimmune thyroid diseases and data science

2.1 Autoimmune thyroid diseases (AITDs)

Autoimmune thyroid diseases (AITDs) are the most common autoimmune diseases in humans and it is divided based on the grade of lymphocytic infiltration [13]. They are more prevalent in females than males (i.e. they are 5–10 less frequent in men). Graves’ disease which is a disease associated with hyperthyroidism and Hashimoto’s thyroiditis which is also associated with hypothyroidism are the major types of AITDs [13].

2.1.1 Graves diseases (GD)

Graves’ disease is the most common cause of hyperthyroidism, which affects people at any age but most prevalent in adults, the incidence of this disease peaks between 30 and 50 years [14]. It is also characterized by goiter, ophthalmopathy [15].

2.1.2 Hashimoto’s thyroiditis (HT)

HT has now been considered the most common AITD [16], the most common endocrine disorder [17] and also the most common cause of hypothyroidism [18, 19]. It can be divided into primary and secondary forms, the primary form is the most common thyroiditis and the secondary is the more recent description of thyroiditis [20].

2.2 Causes of AITDs

The factors that result in AITDs are genetic factors and environmental factors. Various susceptibility genes like HLA-DR gene locus and non-MHC genes which includes CTLA-4, CD40, PTPN22, CD25, FOXP3, thyroglobulin and TSH receptor genes have been identified and characterized [21]. The major environmental triggers that have been identified are; iodine, selenium, medications, smoking and stress, infection, sex steroids, pregnancy, fetal microchimerism and radiation exposure [22, 23].

The risk of developing Graves’ disease is influenced by genetic factors accounting for up to 80%, while environmental factors account for up to 20% [24, 25, 26]. The mechanisms involved in immune tolerance are destroyed by these environmental factors in genetically predisposed people leading to the onset of the disease [24, 26].

In Hashimoto thyroiditis, genetic and environmental factors also contribute to the development of HT.

2.3 Pathogenesis of AITDs

Many factors play a role in the pathogenesis of AITDs, mostly involving the complex interaction of the genetics and environmental factors, immune system and cytokines [27]. The pathogenesis of AITDs results from either cell-mediated autoimmune and endocrine autoimmunity [26]. Thyroid peroxidase antibodies are potent marker of AITDs [27]. Its levels associated with the expression of MHC on thrococytes and with a degree of infiltration by lymphocytes may sensitize and trigger the synthesis of autoantibodies [28]. They are involved in both the immune system and directly targeting the thyroid follicular cells [27]. Their presence has been identified within inflammatory and thyroid follicular cells [29]. Cytokines enhance inflammatory responses by stimulating both B and T lymphocytes, resulting in antibody production and damage to the thyroid tissue by apoptosis in particular HT [30]. In addition, T cells subtypes have also been recently discovered to play a role in the pathogenesis of AITDs [31, 32, 33].

In Graves’ disease, pathogenesis is a complex process, it involves the TRAbs which are antibodies against the thyroid-stimulating receptors [34]. TSH receptor antibodies (TRAb) mimics the function of TSH and it causes the disease by binding to the TSH receptor thereby stimulating or inhibiting thyroid cells in producing thyroid hormones (T3 and T4) [35]. The TRAbs binding to the TSH receptors leads to continuous and uncontrolled thyroid stimulation associated with the synthesis of thyroid hormone in excess and thyroid hypertrophy [35].

In Hashimoto thyroiditis, the pathogenic mechanism involves the contribution of cellular immunity in the form of the defect in the suppressor T cells as well as regulatory T cells, follicular helper T cells, cytotoxicity and apoptosis and humoral immunity in the form of TPO/TG antibodies and immunoglobin subclass, sodium iodide symporter (NIS) and pendrin antibodies, thyroid-stimulating hormone receptor (TSHR) antibodies and also the role of cytokines and DNA fragments and micro RNA [36]. All these have been observed to play an important role in the pathogenesis of HT.S.

2.4 Management of AITDs

The recent landmark in the management of HT disease and GD disease will be discussed as it is the major form of AITDs.

2.4.1 Hashimoto’s thyroiditis

Since it discovery, various understanding has been made about this condition. It has been reviewed that a grading system might be a better method of classifying hypothyroidism due to the continuous change that is observed in the serum level of TSH and free thyroxine (T4) than differentiating it into clinical and subclinical forms [37]. With this consideration, it becomes difficult to determine a starting point for thyroid hormone therapy supplementation which is ideal enough. A randomized trial (TRUST) initiated by the European Commission (2012) aids the understanding of the effects of levothyroxine (LT4) in the treatment of subclinical hypothyroidism [37].

Reoccurrence of symptoms was observed in 5–10% of patients with hypothyroidism despite receiving LT4 treatment and having a normal serum TSH levels [38]. A guideline has been provided by European Thyroid Association (ETA) on the combination therapy of LT4 and LT3 as superior to T4monotherapy and LT4 mono-therapy [38].

2.4.2 Graves’s diseases

Since the inception of GD, it has been treated by antithyroid drugs, radioactive iodine and surgery. Preexisting guidelines were used in the management of GD but recently a detailed guideline has been provided separately for subclinical hyperthyroidism, although they are not supported by randomized clinical trial [39]. Radioiodine is used in the treatment of Grave’s disease [40]. It connects to thyroid autoimmunity through thyroid cell death in which self-antigens are liberated from the thyroid gland following the exposure to the therapy until complete ablation has been achieved [40]. Treatments of GD with antithyroid drugs gives favorable and unfavorable response in patients [40].

With all the recent studies on the management of GD, each management plan is associated with its limitation and a definite plan for the management of GD has not been confirmed. To provide a permanent treatment plan for the disease, researchers are: looking at the aspects of creating a new drug that will d preventing the disease without destroying or removing the thyroid gland and also avoiding the recurrence of the disease. The results of recent in vivo experiments are quite promising [41].

In both diseases, vitamin D has been reviewed to play a significant role in the modulation of the immune system, enhancing the innate immune response while it also exerts an inhibitory action on the adaptive immune system [42].

2.5 General investigation of AITDs

This is based on clinical features and laboratory investigation. The circulating antibodies is a core determinant of AITDs as they are measured against TPO and TG. A negative test excludes AITDs, but a positive test infers AITDs, each type of disease depending on the presence of either antibody. The measurement is done using thyroid receptors assays or bioassays [37].

2.6 Data science approaches to investigate autoimmune diseases

At a time when computer processing power keeps increasing exponentially while networks keep expanding, data available at the same time becomes overwhelming and it becomes imperative to marry the field of data processing and computer so as to take full advantage of the available data as it already exceeds the processing capacity of manual methods and conventional database approach [43]. Data science as a field supports the process of taking data-driven decisions while depending largely on “Big data” storage, engineering and analysis [43]. Therefore thinking data science application in a field implies the intention to gather data, process such data, analyze and utilize such data for the purpose of understanding illness, understanding the reason for such illness (diagnosis), understanding how the illness is progressing (prognosis), understanding the possible endpoint of such illness (prediction) and understanding the intervention that could bring the best out of such situation (treatment/recommendation) [44].

Autoimmune diseases are dangerous or disruptive disease conditions that affect the tissues of the body, which is facilitated by the susceptible genes present in the host and environmental factors where the body’s immune system attacks itself through the presentation and recognition of specific antigens and the response of the target organs [45].

2.6.1 Data science approaches

In an attempt to harness the recent and innovative development taking place with regards to computing infrastructure, methods of data processing and tools for data analysis, the discipline of data science is evolving with serious evolving challenges. Cluster computing and cloud computing are fundamental components of data science that enhance usage of powerful algorithms necessary to access, visualize, interpret, organize, analyze, and rapidly with a reasonable degree of efficiency manage cross-scale big data necessary for enhanced use of artificial intelligence. The availability of big data and the advancement in the field of artificial intelligence has led to the development of various machine learning algorithms, deep learning algorithms and deep neural networks algorithms to process big data considering its high volume and complexity.

2.6.2 Machine learning

One big question that has been raised in the field of computing is the question of how to design and enable computers that are capable of improving automatically through the various experience without explicit instructions and limited human intervention. Such question was answered by the birth of the field of machine learning which stands as one of the most rapidly growing technical fields today which is a point where computer science intersects with statistics and stands as the heart of artificial intelligence and data science [46]. The mechanism of machine learning, a rapidly developing arm of computational algorithms, is to simulate and emulate human reasoning and intelligence by allowing the designed system to learn from the environment. Low cost of computation, online access and availability of data, discovery of new theories and new learning algorithms among other are forces that drives machine learning [46]. Different machine-learning algorithms has been made with the intention to solve various machine learning related problems and use the large variety of data types [47, 48]. Conceptually, what machine-learning algorithms do can be perceived as running through a large selection of the program to select a program of choice and this choice is guided by experience acquired through training and the choice would be a program that optimizes the performance metric. The great range of variation seen in machine-learning algorithms depends in part on the method by which the algorithm represents its candidate programs (e.g., mathematical functions, decision trees, and general programming languages) [47]. The variation is also dependent on the method through which such algorithm search through this list of programs (e.g., optimization algorithms with well-understood convergence guarantees and evolutionary search methods that evaluate successive generations of randomly mutated programs) [47]. Supervised learning stands as the most widely employed method of training machine learning algorithms [47].

2.6.3 Deep learning

Deep learning involves the use of computational models that are made up of multiple layers of processing, which are capable of learning using representations of data with multiple levels of abstraction. Deep learning methods have rapidly and progressively improved technologies available for recognizing and processing speech, recognizing and identifying visual objects, and many other domains. Deep learning has also been useful in fields such as drug discovery and genomics. Conventional machine-learning techniques were limited in their ability to process natural data in their raw form. However, deep learning using multiple levels of abstraction and representation that is obtained by making simple but non-linear modules that can transform the representation at one level (starting with the raw input) into a representation at a higher, slightly more abstract level and with the composition of enough of such transformations, very complex functions can be learned [49].

2.6.4 Deep neural network

Multiple levels of non-linearity in the networks of artificial neurons that makes up deep multi-layer neural networks enables such algorithm to compactly represent functions which are non-linear and highly-varying. Some interesting characteristics of neural network-based systems include the fact that they can learn and adapt while learning because they consist of an architecture of artificial neurons which are wired to form networks that are arranged in layers, has a loss or optimisation function driving the learning process and possess a training algorithm constantly run through changing parameters [50].


3. Application of data science in the treatment of autoimmune thyroid diseases

Data science is known to encompass the preparation of data for analysis, this includes aggregating, cleaning, and manipulating the data to uncover patterns and draw out insights. Exploiting historical clinical datasets to improve future treatment choices has proved beneficial for both patients and physicians [43, 51]. Through machine learning (a branch of artificial intelligence), it is very possible to obtain patterns within patient data, the exploitation of these patterns helps to predict and treat patients in order to improve clinical disease management [52].

Machine learning also features selection algorithms such as Kruskal-Wallis’ analysis, Fisher’s discriminant ratio, and Relief-F. In some research, these algorithms have been used to analyze databases containing clinical features (such as U.S. Surveillance Epidemiology and End Results (SEER) database) from identified thyroid disease patients [51].

Also, the discovery of data mining has been essential in the health care sector as its application have been reported in drug delivery, disease predictions and abnormality detections. Electronic health records have provided access to vast clinical data, the application of data mining techniques has helped transform this data information into valuable knowledge for making health care decisions [53]. Also, data mining algorithms have been used on health record data sets to analyze factors contributing to autoimmune diseases such as those associated with thyroid disease [54].

Although the major autoimmune thyroid disease include Graves’ disease and Hashimoto’s thyroiditis [55], these diseases are different clinically. Genetic data shows that their pathogenesis shares immuno-genetic mechanisms. Some shared susceptibility genes include human leukocyte antigen DR containing arginine at position (β74 HLA-DRβ1-Arg74). Exploring the genetic-epigenetic interactions of autoimmune thyroid pathogenesis is essential to uncover new therapeutic targets [55], this suggests how important genetic datasets are in developing therapeutic targets.

Precision medicine has also been implemented in a therapeutic approach to autoimmune thyroid disease such as Graves’ disease [1]. Therefore, recent therapies are targeting a key co-stimulatory molecule usually expressed on antigen-presenting cells (CD40), due to this, anti-CD40 monoclonal antibody has been developed [56]. Studies on genetic data suggest that genetic polymorphisms in the CD40 gene drive its expression and response to anti-CD40 monoclonal antibody like Iscalimab (also known as CFZ 533), which is a full human IGg1 [56, 57]. Furthermore, studies established that thyroglobulin antibody (TgAb) and thyroid peroxidase antibody (TPOAb) are the most characteristic autoimmune antibodies to Hashimoto’s thyroiditis [58].

The aim of analyzing datasets (such as genomic datasets and electronic health records) in precision medicine of autoimmune thyroid disease is to determine the treatment options, manner of implementation and choice of therapy. Lastly, this section demonstrate that existing medical datasets has been a reliably strength in clinical predictions, thus, it helps medical practitioners to make an informed and optimized treatment decisions. Figure 1 illustrates the steps in the application of data science to treat autoimmune thyroid disease.

Figure 1.

Shows the steps taken in applying data science to treat autoimmune thyroid disease (AITD).

3.1 Biological agents in treatment of Graves’s disease

Biological agents are usually precise for a specified target, a few have subsequently renowned standard target (e.g. rituximab for B-lymphocytes) [59]. Considering specific agents with specific targets is the strategy that aid to achieve cure for this autoimmune disease [60]. Some biological agents involved in novel treatment of Grave’s disease include:

  1. Rituximab (RTX): rituximab is an anti-B cell agent (monoclonal chimeric antibody) that is against the transmembrane protein CD20 on B cells (but not plasma cells) [61]. Intraorbital administration of rituximab has been shown to be effective as opposed to high dose of systemic glucocorti-coids in the treatment of thyroid-related orbitopathy in grave disease [62, 63].

  2. Adalimumab: T-cells expressing IGF-1 receptors are assumed to show a central role in mediating the autoimmune process in severe grave’s disease [64]. Adalimumab is one of the anti-T-cell agents which seems to have efficacy similar to that of infliximab. It is a human monoclonal IgG1 antibody which clings to both soluble and membrane-bound TNF (tumor necrosis factor), it also repairs complement and induces lysis of cells expressing membrane-bound TNF [64, 65].

  3. Intravenous immunoglobulin: strategically using anti-auto-antigen to stimulate the thyroid but not blocking autoantibodies are highly predominant in severe and vigorous thyroid-associated orbitopathy [66]. Therapeutic measures aiming at the autoantibodies may be effective, even though such consideration must be cross-checked in determining if the presence of such autoantibodies is truly causal or a threat [67].


4. Application of data science in the diagnosis of autoimmune thyroid diseases

4.1 Application of data science in the diagnosis of Graves’ disease

The most common cause of autoimmune hyperthyroidism is Graves’ disease, which primarily affects the thyroid gland. In Graves’ disease, the main auto-antigen is the TSH receptor (thyroid-stimulating hormone receptor (TSHR)), expressed primarily in the thyroid and secondarily in adipocytes, fibroblasts, among others sites. It also appears to be closely related to the insulin-like growth factor 1 (IGF-1) receptor [68]. This disorder presents a systemic clinical manifestation that affect vital organs like the heart, liver and eyes. Failure to diagnose this disease on time can predispose thyroid storm, which carries high morbidity and mortality. Therefore, it is imperative to diagnose and manage the disease early in other to prevent severe cardiac complications such as atrial fibrillation, atrial flutter, and high output cardiac failure [69].

Data mining and machine learning have been reported to play an important role in diagnosing diseases, as they provide a vast classification of accurate techniques for the prediction of disease. Patient data collected from healthcare organizations is useful for accessing the risk factors analysis of diseases such as autoimmune thyroid disease. Classification algorithms is one of the most important applications in the data mining field, which can be used to make decisions in many real-world problems [51, 54]. A recent study uses 34 unique clinical data (variables) such as patients’ age at the time of diagnosis and information regarding lymph nodes to build novel classifiers that distinguish patients who probably live for over ten years since diagnosis from those who did not survive at least five years. This report also shows there is 94.5% accuracy in distinguishing patients in terms of prognosis using machine learning [51].

The diagnosis of Graves’ disease begins with a thorough historical and physical examination. The historical examination includes the data recorded from family history for Graves’ disease, while the physical examination includes assessing goiter size by ultrasound [69, 70]. Dr. Cech began the discussion of precision medicine in the domain of thyroid disease, according to him, the use of radioisotopes to treat hyperthyroidism and thyroid cancer is one of the first uses of precision medicine in thyroid disease [71]. Researchers from the field of endocrine practice investigated Graves’ disease retrospectively by collecting data such as disease severity, smoking rate and severity of orbitopathy [70]. Studies have also reported that TSHR antibodies and activated T cells play a major role in the pathogenesis of Graves’ orbitopathy, this role is by activating adipocyte TSHR, retroocular fibroblast and IGF-1 receptors, also plays an important role by initiating a retro-orbital inflammatory environment [68].

Since the advent of precision medicine, its future application in thyroid dysfunction suggests developing new approaches in quantifying, detecting, and analyzing biomedical information. Since the description of Graves’ disease by Robert Graves, it is known that several environmental and epigenetic factors influence the onset of this disease. Also, some susceptibility elements, such as particular genotypes of HLA, CTLA-4, CD40 or thyroglobulin have been identified. Furthermore, recent data has shed more light on how an epigenetic-genetic interaction between a noncoding single nucleotide polymorphism (SNP) (coded within the TSH receptor (TSHR) gene) alters the thymic expression of TSHR, which further triggers Graves’ disease [72, 73, 74].

4.2 Application of data science in the diagnosis of Hashimoto’s thyroiditis (HT)

Hashimoto’s thyroiditis (HT), also known as chronic lymphocytic thyroiditis or chronic autoimmune thyroiditis, is one of the common autoimmune thyroid diseases that can cause an increased tumor vulnerability and raise the chances of developing chronic heart disease diseases especially in individuals with Hashimoto’s thyroiditis [75]. The biochemical markers for Hashimoto’s thyroiditis are thyroid peroxidase and thyroglobulin autoantibodies in the serum, with greater dominance in females than males. The most significant biochemical etiology of this disease is the presence of thyroid autoantibodies (TAbs) in the patients’ serum against two vital thyroid antigens, which are thyroid peroxidase (TPO) and thyroglobulin (TG) [76]. The diagnosis of Hashimoto’s thyroiditis (HT) usually causes many controversies, and sometimes until the late stage of occurrence before proper diagnosis can yield result. The use of data science to predict the presence of this dysfunction is key to modern day precision medicine. Firstly, through epidemiological study of the disease pattern in areas where iodine intake is normal or excessive, considering age factor, pathogenesis of autoimmune thyroiditis in monozygotic twins as compared with dizygotic twins [77].

Diagnosis of Hashimoto’s thyroiditis (HT) is made by examining a diffuse, smooth, firm goiter in a young woman, with strongly positive titers of TG Ab or TPO Ab and a euthyroid or hypothyroid metabolic condition. This disease caused by immunological damage show conditions that are severe and can cause further complications. Reviewed works of autoimmune hypothyroidism in monozygotic twins, shows there is a corresponding rate below 1 which is traceable to environmental factors and thus, making this factors to be etiologically significant [78]. In precision medicine, the study of genomics can be used to diagnose autoimmune thyroid disease, most especially Hashimoto’s thyroiditis. Genotyping analysis to show the genes that are susceptible to environmental factor endocrine disruptors, taking note of the influence of age, weight, sex, timing, and race to show endocrine levels [76].

4.3 Pathogenesis of autoimmune Hashimoto thyroiditis

The presence of TAbs (thyroid autoantibodies) in the patients’ sera is the principal biochemical characteristic of HT disease. The Tabs is against two major antigens which are, thyroid peroxidase (TPO) and thyroglobulin (Tg). The TPO antigen is crucial for thyroid hormone synthesis and they are located on thyrocyte’s apical membrane, while the Tg are large glycoprotein within the follicular cells of the thyroid gland and they serves as storage for thyroid hormones [76, 77, 78].

The principal factor that drives the pathogenesis of HT is the antibodies against TPO (TPOAbs) and Tg (TgAbs) (in immunoglobulin G (IgG) class). Unlike TgAbs, the TPOAbs damage thyroid cells due to its antibody dependent cell cytotoxicity but both shows great affinity for their respective antigens. Furthermore, studies reported that they both have limited role in the pathogenesis of HT but both T-cell cytotoxicity and apoptotic pathway activation influence the disease onset [77, 78]. Although, the TAbs serves as a biomarker for thyroid autoimmunity but TPOAbs are presented in over 90% of HT patients, while 80% of the patients presents TgAbs [77]. Also, T helper cell type 2 (Th2) has been reported to lead to an excessive stimulation of B cells and production of plasmatic cells that produce antibodies against thyroid antigens leading to autoimmune thyroiditis [78].

Table 1 shows some factors that can influence HT [77, 79].

Genetic factorA strong genetic susceptibility has been shown to be associated with the disease incidence, development and severity. Of this genes, CTL antigen-4, Tg, vitamin D receptor, cytokines, TPO and PTPN-22 (Protein Thyrosine Phosphatase nonreceptor-type-22) are the most important
Endogenous factorMost important endogenous factor for this disease are female sex, fetal microchimerism, pregnancy and postpartum period
Environmental factorMost important factors that influence this disease development are drugs, iodine intake, chemicals/toxins and infections
Self-toleranceAltered self-tolerance complemented with increased antigen presentation is a strong cause of HT

Table 1.

Factors that initiates Hashimoto thyroiditis.

4.4 Importance of data science in thyroid diseases

Studies have reported a vast prediction algorithms that help in classifying, monitoring and suggesting treatment regimen for thyroid diseases, therefore the importance of data science is to serve as early approach to diagnosis, prognosis and treatment of thyroid diseases. Below are studies that achieve a high percentage of accuracy with new data approaches to investigate and treat thyroid diseases.

Since proper interpretation of thyroid functional data is an important issue in the classification of thyroid disease [80], thyroid disease dataset from UCI machine learning database has been used in comparative thyroid disease diagnosis. This was attained by using probabilistic, multilayer and learning vector quantization neural networks [81]. Likewise, Polat et al., also make use of dataset from UCI machine learning repository to diagnose thyroid diseases by hybridizing AIRS (artificial immune recognition system) which was first proposed by A. Watkins, with developed Fuzzy weighted pre-processing. The classification obtained from this study is about 85% accurate [80].

Moreover, Ruggeri et al., use data recordings of medical history, assessment of selected autoantibodies profiles and physical examination to delineate clinical patterns in patients with Hashimoto thyroiditis from pediatric/adolescent to adult age. It was found out that there is high prevalence of non-thyroidal autoimmune diseases (NTADs) in HT patients and this is also influenced by the patient’s age [82]. Therefore, NTADs should be watch out for in patients confirmed to be affected by Hashimoto thyroiditis. Hence, exploring clinical dataset with data science has helped in the prognosis of autoimmune thyroid disease.

Some of the recently proposed algorithms with high accuracy are Expert System for Thyroid Disease Diagnosis (ESTDD), this is an expert system that diagnose thyroid diseases via neuro fuzzy rules with about 95% accuracy [54, 83].

In addition, classification based data mining has also played important role in providing significant diagnosis, decision making and proper treatment for thyroid diseases at early stage. Some data mining algorithms have shown a very high accuracy, speed, performance and low cost for treatments [54]. Example of these algorithms that helps to find better treatments for thyroid patients are kNN (k nearest-neighbor), support vector machine, ID3ara and Naïve bayes [54]. Lastly, novel intelligent hybrid decision support system was utilized in the diagnosis of thyroid disorder, the classification analysis made by algorithms were sensitive, specific and high in accuracy (94.7%, 99.7% and 98.5% respectively). It was also reported that this approach can be applied to other deadly diseases [84].

4.5 Challenges in diagnosing and treating autoimmune thyroid disease

Given the ease of diagnose and treatment of thyroid disease, expectations are high on the specific and personalized approach to the diagnosis and treatment of such disease. However, some aspect of the methods of diagnosis and treatment needs improvement to enhance the health of thyroid disease patients. Table 2 discusses few of the challenges that has been identified or associated with the management of thyroid related diseases.

DiagnosisCharacterizing the common and individualized genetic background autoimmune thyroid diseases
Identifying environmental endocrine factors that enhance the development of thyroid diseases
Predicting the chances of anti-thyroid drug side effects in a particular patient
TreatmentDevelopment of a new thyroid gland from stem cells (for hypothyroid patients (Hashimoto’s disease)
Development of blocking molecules for the self-activated TSH-receptor
Development of small molecule targeted at thyroid autoantibodies (or gland antigens) to counteract their activity autoimmune thyroid disease
Precisely tailoring thyroid hormone replacement dose to any patient according to individual needs
Develop precise targeted immune therapy for the autoimmune disease
Use of genomic data to predict the chances of acquiring an autoimmune disease in a patient

Table 2.

Challenges in diagnosing and treating autoimmune thyroid disease [68].


5. Conclusion

Data science has been shown to be a useful tool in preparing, aggregating, cleaning, and manipulating clinical data to uncover disease patterns and draw insights into how the disease can be treated. Also, genomic datasets in databases have been utilized in precision medicine to diagnose and treat patients. These facts show green light for data science usage by medical practitioners and researchers in the near future.


6. Recommendations

It is recommended that data science be incorporated into clinical practice to improve precise targeted immune therapy for autoimmune thyroid diseases. Also, it is recommended that more research be carried out using genomic data to further bolster the precision from these data in the diagnosis and treatment of individual patients.


  1. 1.Galofré JC, Díez JJ, Cooper DS. Thyroid dysfunction in the era of precision medicine. Endocrinología y Nutrición (English Edition). 2016;63(7):354-363
  2. 2.Tumino D, Grani G, Di Stefano M, Di Mauro M, Scutari M, Rago T, et al. Nodular thyroid disease in the era of precision medicine. Frontiers in Endocrinology. 2020;10:907
  3. 3.Ng A, Soo K. Numsense! Data Science for the Layman: No Math Added. Singapore: Annalyn Ng and Kenneth Soo; 2017
  4. 4.Teschendorff AE. Avoiding common pitfalls in machine learning omic data science. Nature Materials. 2018;18(5):422-427. DOI: 10.1038/s41563-018-0241-z
  5. 5.Cho JH, Feldman M. Heterogeneity of autoimmune diseases: Pathophysiologic insights from genetics and implications for new therapies. Nature Materials. 2015;21:730
  6. 6.Denny JC, Bastarache L, Roden DM. Phenome-wide association studies as a tool to advance precision medicine. Annual Review of Genomics and Human Genetics. 2016;17(1):353-373. DOI: 10.1146/annurev-genom-090314-024956
  7. 7.Eriksson N, Tung JY, Kiefer AK, Hinds DA, Francke U, Mountain JL, et al. Novel associations for hypothyroidism include known autoimmune risk loci. PLoS One. 2012;7(4):e34442
  8. 8.Hwangbo Y, Park YJ. Genome-wide association studies of autoimmune thyroid diseases, thyroid function, and thyroid cancer. Endocrinology and Metabolism. 2018;33(2):175-184
  9. 9.Ajayi A, Adebayo O, Adebayo E. Precision Medicine of Autoimmune Diseases. London: IntechOpen; 2020. DOI: 10.5772/intechopen.95248. Available from:
  10. 10.Hemminki K, Li X, Sundquist J, Sundquist K. The epidemiology of Graves’ disease: Evidence of a genetic and an environmental contribution. Journal of Autoimmunity. 2010;34:J307-J313
  11. 11.Villanueva R, Greenberg DA, Davies TF, Tomer Y. Sibling recurrence risk in autoimmune thyroid disease. Thyroid. 2003;13:761-764
  12. 12.Dittmar M, Libich C, Brenzel T, Kahaly GJ. Increased familial clustering of autoimmune thyroid diseases. Hormone and Metabolic Research. 2011;43:200-204
  13. 13.Banga JP, Schott M. Autoimmune thyroid diseases. Hormone and Metabolic Research. 2018;50:837-839
  14. 14.Zimmermann MB, Boelaert K. Iodine deficiency and thyroid disorders. The Lancet Diabetes and Endocrinology. 2015;3:286-295
  15. 15.Bartalena L. Diagnosis and management of Graves disease: A global overview. Nature Reviews. Endocrinology. 2013;9(12):724-734. DOI: 10.1038/nrendo.2013.193
  16. 16.McLeod DS, Cooper DS. The incidence and prevalence of thyroid autoimmunity. Endocrine. 2012;42:252-265
  17. 17.Golden SH, Robinson KA, Saldanha I, Anton B, Ladenson PW. Clinical review: Prevalence and incidence of endocrine and metabolic disorders in the United States: A comprehensive review. The Journal of Clinical Endocrinology and Metabolism. 2009;94:1853-1878
  18. 18.Delemer B, Aubert JP, Nys P, Landron F, Bouee S. An observational study of the initial management of hypothyroidism in France: The ORCHIDEE study. European Journal of Endocrinology. 2012;167:817-823
  19. 19.Vanderpump MP. The epidemiology of thyroid disease. British Medical Bulletin. 2011;99:39-51
  20. 20.Caturegli P, De Remigis A, Rose NR. Hashimoto thyroiditis: Clinical and diagnostic criteria. Autoimmunity Reviews. 2014;13(4-5):391-397. DOI: 10.1016/j.autrev.2014.01.007
  21. 21.Camperio C, Caristi S, Fanelli G, Soligo M, Del Porto P, Piccolella E. Forkhead transcription factor FOXP3 upregulates CD25 expression through cooperation with RelA/NF-kappaB. PLoS One. 2012;7:e48303
  22. 22.Iddah MA, Macharia BN. Autoimmune thyroid disorders. International Scholarly Research Notices. 2013;2013:Article ID 509764, 9 p. DOI: 10.1155/2013/509764
  23. 23.Tomer Y, Menconi F. Interferon induced thyroiditis. Best Practice & Research. Clinical Endocrinology & Metabolism. 2009;23:703-712
  24. 24.Brand OJ, Gough SCL. Genetics of thyroid autoimmunity and the role of the TSHR. Molecular and Cellular Endocrinology. 2010;322:135-143
  25. 25.Morshed SA, Latif R, Davies TF. Delineating the autoimmune mechanisms in Graves’ disease. Immunol. Res. 2012;54:191-203
  26. 26.Morshed SA, Latif R, Davies TF. Delineating the autoimmune mechanisms in graves’ disease. Immunologic Research. 2012;54:191-203
  27. 27.Mikoś H, Mikoś M, Obara-Moszyńska M, Niedziela M. The role of the immune system and cytokines involved in the pathogenesis of autoimmune thyroid disease (AITD). Endokrynologia Polska. 2014;65(2):150-155. DOI: 10.5603/ EP.2014.0021
  28. 28.Caturegli P. Autoimmune hypophysitis: An underestimated disease in search of its autoantigen(s). The Journal of Clinical Endocrinology and Metabolism. 2007;92:2038-2040
  29. 29.Ajjan RA, Weetman AP. Cytokines in thyroid autoimmunity. Autoimmunity. 2003;36:351-359
  30. 30.Palazzo FF, Hammond LJ, Goode AW, et al. Death of the autoimmune thyrocyte: Is it pushed or does it jump? Thyroid. 2000;10:561-572
  31. 31.Ramos-Leví AM, Marazuela M. Pathogenesis of thyroid autoimmune disease: The role of cellular mechanisms. Endocrinology and Metabolism. 2016;31:379-385. DOI: 10.1016/j.endonu.2016.04.003
  32. 32.Antonelli A, Ferrari SM, Corrado A, Di Domenicantonio A, Fallahi P. Autoimmunethyroid disorders. Autoimmunity Reviews. 2015;14:174-180
  33. 33.Weetman AP. Autoimmune thyroid disease: Propagation and progression. European Journal of Endocrinology. 2003;148:1-9
  34. 34.Prabhakar BS, Bahn RS, Smith TJ. Current perspective on the pathogenesis of Graves’ disease and ophthalmopathy. Endocrine Reviews. 2003;24:802-835
  35. 35.Quadbeck B, Hoermann R, Hahn S, et al. Binding, stimulating and blocking TSH receptor antibodies to the thyrotropin receptor as predictors of relapse of Graves’ disease after withdrawal of antithyroid treatment. Hormone and Metabolic Research. 2005;37:745-750
  36. 36.Ajjan RA, Weetman AP. The pathogenesis of Hashimoto’s thyroiditis: Further developments in our understanding. Hormone and Metabolic Research. 2015;47(10):702-710. ISSN: 0018-5043
  37. 37.Hiromatsu Y, Satoh H, Amino N. Hashimoto’s thyroiditis: History and future outlook. Hormones (Athens, Greece). 2013;12:12-18
  38. 38.Canaris GJ, Manowitz NR, Mayor G, Ridgway EC. The Colorado thyroid disease prevalence study. Archives of Internal Medicine. 2000;160:526-534
  39. 39.Biondi B, Bartalena L, Cooper DS, Hegedus L, Laurberg P, Kahaly GJ. The 2015 European Thyroid Association guidelines on diagnosis and treatment of endogenous subclinical hyperthyroidism. European Thyroid Journal. 2015;4:149-163
  40. 40.Smith TJ, Hegedüs L. Graves’ disease. The New England Journal of Medicine. 2016;375(16):1552-1565. DOI: 10.1056/NEJMra1510030
  41. 41.Neumann S, Nir EA, Eliseeva E, Huang W, Marugan J, Xiao J, et al. A selective TSH receptor antagonist inhibits stimulation of thyroid function in female mice. Endocrinology. 2014;155:310-314
  42. 42.Kim D. The role of vitamin D in thyroid diseases. International Journal of Molecular Sciences. 2017;18(9):1949. DOI: 10.3390/ijms18091949
  43. 43.Provost F, Fawcett T. Data science and its relationship to big data and data-driven decision making. Big Data. 2013;1(1):51-59
  44. 44.Van Der Aalst W. Data science in action. In: Process Mining. Berlin, Heidelberg: Springer; 2016. pp. 3-23
  45. 45.Marrack P, Kappler J, Kotzin BL. Autoimmune disease: Why and where it occurs. Nature Medicine. 2001;7(8):899-905
  46. 46.Jordan MI, Mitchell TM. Machine learning: Trends, perspectives, and prospects. Science. 2015;349(6245):255-260
  47. 47.Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York, NY: Springer Science & Business Media; 2009. DOI: 10.1007/978-0-387-84858-7
  48. 48.Murphy KP. Machine Learning: A Probabilistic Perspective. Cambridge, Massachusetts, London, England: The MIT Press; 2012
  49. 49.Yan LC, Yoshua B, Geoffrey H. Deep learning. Nature. 2015;521(7553):436-444
  50. 50.Durstewitz D, Koppe G, Meyer-Lindenberg A. Deep neural networks in psychiatry. Molecular Psychiatry. 2019;24(11):1583-1598
  51. 51.Mourad M, Moubayed S, Dezube A, Mourad Y, Park K, Torreblanca-Zanca A, et al. Machine learning and feature selection applied to SEER data to reliably assess thyroid cancer prognosis. Scientific Reports. 2020;10(1):1-11
  52. 52.Stafford IS, Kellermann M, Mossotto E, Beattie RM, MacArthur BD, Ennis S. A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. NPJ Digital Medicine. 2020;3(1):1-11
  53. 53.Bhattacharya S, Sengupta S. Application of data mining techniques in autoimmune diseases research and treatment. In: Machine Learning and IoT. Boca Raton: CRC Press; 2018. pp. 113-131. DOI: 10.1201/9781351029940
  54. 54.Begum A, Parkavi A. Prediction of thyroid disease using data mining techniques. In: 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS). Manhattan, New York, U.S: IEEE; 2019:342-345. DOI: 10.1109/icaccs.2019.8728320
  55. 55.Tomer Y. Mechanisms of autoimmune thyroid diseases: From genetics to epigenetics. Annual Review of Pathology: Mechanisms of Disease. 2014;9:147-156
  56. 56.Faustino LC, Kahaly GJ, Frommer L, Concepcion E, Stefan-Lifshitz M, Tomer Y. Precision medicine in Graves’ disease: CD40 gene variants predict clinical response to an anti-CD40 monoclonal antibody. Frontiers in Endocrinology. 2021;12:642
  57. 57.Kahaly GJ, Stan MN, Frommer L, Gergely P, Colin L, Amer A, et al. A novel anti-Cd40 monoclonal antibody, iscalimab, for control of Graves’ hyperthyroidism—A proof-of-concept trial. The Journal of Clinical Endocrinology & Metabolism. 2019;105(3):696-704. DOI: 10.1210/clinem/dgz013
  58. 58.Jia X, Zhai T, Zhang JA. Metformin reduces autoimmune antibody levels in patients with Hashimoto’s thyroiditis: A systematic review and meta-analysis. Autoimmunity. 2020;53(6):353-361. DOI: 10.1080/08916934.2020.1789969
  59. 59.Gillespie EF, Smith TJ, Douglas RS. Thyroid eye disease: Towards an evidence base for treatment in the 21st century. Current Neurology and Neuroscience Reports. 2012;12(3):318-324
  60. 60.Bartley GB, Fatourechi V, Kadrmas EF, Jacobsen SJ, Ilstrup DM, Garrity JA, et al. The incidence of Graves' ophthalmopathy in Olmsted County, Minnesota. American Journal of Ophthalmology. 1995;120(4):511-517
  61. 61.Salvi M, Vannucchi G, Currò N, Campi I, Covelli D, Dazzi D, et al. Efficacy of B-cell targeted therapy with rituximab in patients with active moderate to severe Graves’ orbitopathy: A randomized controlled study. The Journal of Clinical Endocrinology & Metabolism. 2015;100(2):422-431
  62. 62.Salvi M, Vannucchi G, Currò N, Introna M, Rossi S, Bonara P, et al. Small dose of rituximab for graves orbitopathy: New insights into the mechanism of action. Archives of Ophthalmology. 2012;130(1):122-124
  63. 63.Briceño CA, Gupta S, Douglas RS. Advances in the management of thyroid eye disease. International Ophthalmology Clinics. 2013;53(3):93
  64. 64.McCoy AN, Kim DS, Gillespie EF, Atkins SJ, Smith TJ, Douglas RS. Rituximab (Rituxan) therapy for severe thyroid-associated ophthalmopathy diminishes IGF-1R+ T cells. The Journal of Clinical Endocrinology & Metabolism. 2014;99(7):E1294-E1299
  65. 65.Keystone E, Haraoui B. Adalimumab therapy in rheumatoid arthritis. Rheumatic Disease Clinics. 2004;30(2):349-364
  66. 66.Kampmann E, Diana T, Kanitz M, Hoppe D, Kahaly GJ. Thyroid stimulating but not blocking autoantibodies are highly prevalent in severe and active thyroid-associated orbitopathy: A prospective study. International Journal of Endocrinology. 2015;2015:678194
  67. 67.Kahaly G, Pitz S, Müller-Forell W, Hommel G. Randomized trial of intravenous immunoglobulins versus prednisolone in Graves’ ophthalmopathy. Clinical and Experimental Immunology. 1996;106(2):197-202
  68. 68.Davies TF, Burch HB. Clinical features and diagnosis of Graves’ orbitopathy (ophthalmopathy). In: Post TW editor. 2019. Retrieved on 09/11/2021. Available from:
  69. 69.Pokhrel B, Bhusal K. Graves disease [updated 2020 Jul 21]. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2021. Available from:
  70. 70.Rago T, Cantisani V, Ianni F, Chiovato L, Garberoglio R, Durante C, et al. Thyroid ultrasonography reporting: Consensus of Italian Thyroid Association (AIT), Italian Society of Endocrinology (SIE), Italian Society of Ultrasonography in Medicine and Biology (SIUMB) and Ultrasound Chapter of Italian Society of Medical Radiology (SIRM). Journal of Endocrinological Investigation. 2018;41(12):1435-1443
  71. 71.Kahaly GJ, Bartalena L, Hegedüs L, Leenhardt L, Poppe K, Pearce SH. 2018 European Thyroid Association Guideline for the Management of Graves' Hyperthyroidism. European Thyroid Journal. 2018;7(4):167-186
  72. 72.Chiovato L, Magri F. The medical management of Graves disease in the era of precision medicine. Endocrine Practice. 2019;25(1):112-114. DOI: 10.4158/ep-2018-0528
  73. 73.Brix TH, Hegedüs L. Twins as a tool for evaluating the influence of genetic susceptibility in thyroid autoimmunity. Annales d'endocrinologie. 2011;72:103-107
  74. 74.Ando T, Davies TF. Self-recognition and the role of fetal microchimerism. Best Practice & Research. Clinical Endocrinology & Metabolism. 2004;18:197-211. DOI: 10.1016/j.beem.2004.03.002|Medline
  75. 75.Stefan M, Wei C, Lombardi A, Li CW, Concepcion ES, Inabnet WB 3rd, et al. Genetic-epigenetic dysregulation of thymic TSH receptor gene expression triggers thyroid autoimmunity. Proceedings of the National Academy of Sciences of the United States of America. 2014;111:12562-12567. DOI: 10.1073/pnas.1408821111
  76. 76.Zaletel K. Determinants of thyroid autoantibody production in Hashimoto's thyroiditis. Expert Review of Clinical Immunology. 2007;3:217-223
  77. 77.Zaletel K, Gaberscek S. Hashimotos thyroiditis: From genes to the disease. Current Genomics. 2011;12(8):576-588. DOI: 10.2174/138920211798120763
  78. 78.Pyzik A, Grywalska E, Matyjaszek-Matuszek B, Roliński J. Immune disorders in Hashimoto’s thyroiditis: What do we know so far? Journal of Immunology Research. 2015;21(4);523-528. DOI: 10.1155/2015/979167
  79. 79.Akamizu T, Amino N. Hashimoto’s thyroiditis [updated 2017 Jul 17]. In: Feingold KR, Anawalt B, Boyce A, et al., editors. Endotext [Internet]. South Dartmouth (MA):, Inc.; 2000 Available from:
  80. 80.Polat K, Şahan S, Güneş S. A novel hybrid method based on artificial immune recognition system (AIRS) with fuzzy weighted pre-processing for thyroid disease diagnosis. Expert Systems with Applications. 2007;32(4):1141-1147
  81. 81.Temurtas F. A comparative study on thyroid disease diagnosis using neural networks. Expert Systems with Applications. 2009;36(1):944-949
  82. 82.Ruggeri RM, Trimarchi F, Giuffrida G, Certo R, Cama E, Campennì A, et al. Autoimmune comorbidities in Hashimoto’s thyroiditis: Different patterns of association in adulthood and childhood/adolescence. European Journal of Endocrinology. 2017;176(2):133-141. DOI: 10.1530/eje-16-0737
  83. 83.Al-muwaffaq I, Bozkus Z. MLTDD: use of machine learning techniques for diagnosis of thyroid gland disorder. Computer Science and Information Technology. 2016;23:67-3
  84. 84.Ahmad W, Ahmad A, Lu C, Khoso BA, Huang L. A novel hybrid decision support system for thyroid disease forecasting. Soft Computing. 2018;22(16):5377-5383. DOI: 10.1007/s00500-018-3045-9

Written By

Ayodeji Folorunsho Ajayi, Emmanuel Tayo Adebayo, Iyanuoluwa Oluwadunsi Adebayo, Olubunmi Simeon Oyekunle, Victor Oluwaseyi Amos, Segun Emmanuel Bamidele and Goodness Olusayo Olatinwo

Submitted: August 27th, 2021Reviewed: October 14th, 2021Published: November 18th, 2021