|
|
||||||||
CLINICAL STUDY |
1 International NRW Graduate School in Bioinformatics and Genome Research Center of Biotechnology (CeBiTec), Bielefeld University, D-33 615 Bielefeld, Germany, and Hannover Medical School, D-30 625 Hannover, Germany, 2 Molecular Genetics Laboratory Raue, Hentze, D-69 121 Heidelberg, Germany, Departments of 3 Visceral and Transplantation Surgery and 4 Clinical Enocrinology, Hannover Medical School, D-30 623 Hannover, Germany, 5 Applied Neuroinformatics Group, Faculty of Technology, Bielefeld University, D-33 615 Bielefeld, Germany, Departments of 6 Pediatrics and 7 Pharmacology, University of Heidelberg, D-69 120 Heidelberg, Germany and 8 Howard Hughes Medical Institute and Computational Neurobiology Laboratory, The Salk Institute, San Diego, California 92 186, USA
(Correspondence should be addressed to (K Prank; Email: klaus.prank{at}cebitec.uni-bielefeld.de)
| Abstract |
|---|
|
|
|---|
Design and methods: As a model, we used a well-defined set of interconnected data consisting of unstimulated serum levels of steroid intermediates assessed in 54 subjects heterozygous for a mutation of the 21-hydroxylase gene (CYP21B) and in 43 healthy controls.
Results: The genetic alteration was predicted from the pattern of steroid levels with an accuracy of 39% by clinicians and of 64% by linear analysis. In contrast, non-linear analysis, such as self-organizing artificial neural networks, support vector machines, and nearest neighbour classifiers, allowed for higher accuracy up to 83%.
Conclusions: The successful application of these non-linear adaptive methods to capture specific biochemical problems may have generalized implications for biochemical testing in many areas. Nonlinear analytical techniques such as neural networks, support vector machines, and nearest neighbour classifiers may serve as an important adjunct to the decision process of a human investigator not trained in a specific complex clinical or laboratory setting and may aid them to classify the problem more directly.
| Introduction |
|---|
|
|
|---|
hydroxyprogesterone (17-OHP), 21-deoxycortisol (21-DF), dehydroepiandrosterone (DHEA), 17
-hydroxypregnenolone (17-OHPreg) and cortisol (Fig. 1
|
|
| Subjects and methods |
|---|
|
|
|---|
The first group consisted of 54 subjects (age, 2746 years; mean, 34.1 years) with heterozygous mutations in CYP21B. These subjects were relatives of patients with homozygous CYP21B mutations leading to the classical forms of congenital adrenal hyperplasia (CAH). None of the subjects had clinical symptom of CAH or of hyperandrogenism. The geographic origin of most of the subjects with heterozygous mutations in CYP21B was Germany except for 2030% of Turkish origin. The personal and family history of the 43 control subjects (age, 2838 years; mean, 31.5 years) was unremarkable. Blood was sampled between 0800 and 1100 h. The study was approved by the local Committee on Medical Ethics, and all subjects gave their informed written consent.
Determination of steroid profiles and genetic testing
The serum levels of 17-OHP, 21-DF, DHEA, 17-OHPreg and cortisol were determined in both groups under basal (unstimulated) conditions (5, 6). Steroids were measured after extraction and chromatographic purification by radioimmunoassay using specific antibodies (7). The CYP21B gene of both groups was specifically amplified by PCR as described previously (8). The 10 exons, all exonintron junctions, intron 2 and 7 and 400 bp of the promoter region of the CYP21B gene were analyzed on an automated DNA sequencer (LICOR, Lincoln, NE, USA). In a recent study we could demonstrate an exact classification of splice site mutations, the largest group of mutations in this study (9, 10).
Training and testing of classifiers{TX}
In the first step of analysis the four different classification algorithms were applied to the data: the linear discriminant analysis (LDA), the k-nearest-neighbour classifier (k-NN), the neural-gas clustering algorithm and the support vector machine (SVM). The algorithms are explained breifly below. The classifiers applied to the serum levels of the steroid intermediates 17-OHP, 21-DF, DHEA, 17-OHPreg and cortisol. Since it has been reported that the diagnosis of CAH might be based on fewer parameters (11), we tried to predict mutations of CYP21B from the unstimulated levels of 17-OHP and cortisol only applying the same training and testing procedure as on the full data. Each one of the algorithms is applied in a leave-one-out approach. The full data set is split into test and training data cases. In this work, the number of test cases is one. The remaining training data cases are used to train the classifier (LDA, neural gas, SVM) or to classify the test case directly (k-NN). To evaluate one algorithm, it is applied to each test/training data separation and the mean value and standard deviation of correct classification is computed as an indication of accuracy.
Linear discriminant analysis (LDA)
The linear Fisher discriminant was introduced to compute a one-dimensional projection of the data that maximizes the distance of projected class means, in regard to minimizing the result within class covariance. Based on the means and variances of the two classes the linear classification function can be computed explicitly.
k-Nearest-neighbour classifier (k-NN)
Nearest-neighbour algorithms (NNs) are ubiquitous in many research areas such as pattern recognition, machine learning and case-based reasoning (12). The k-NN algorithm is a simple generalization of case-based reasoning, which is k-NN for k = 1. It maintains the set of training cases and classifies the test case according to the most frequent class among the k examples in the training set that are most similar to the test case. In practice, if there is a tie, the class assigned to the smallest distance of k-NN is chosen. Basically, the motivations for such a large diffusion of NN reside in its good performance, which can be obtained in many situations, and its ease of use, as no parameter needs to be tuned and very little experience is required on the part of the user.
Neural-gas clustering
Because of the relatively small number of profiles available, we used self-organizing artificial neural networks (13) instead of the standard multi-layer perceptrons (14). The architecture of the self-organizing artificial neural networks used in this study (neural-gas algorithm) is based on the principle of competitive learning (15, 16) and is a derivative of Kohonens topology-conserving feature map algorithm (17, 18). In contrast to the LDA the neural-gas model is not based on the assumption of a linear relation between the input data and the predicted class or cluster. During the learning process (training) single neurons respond selective to different clusters in the sets of training patterns. After successful training each neuron (or code-book vector) is placed on the center of mass of a class, thereby representing the cases of that cluster. So the neural-gas model will elaborate differences in the profiles of steroid intermediates between healthy controls and subjects with heterozygous mutations of the 21-hydroxylase gene (CYP21B). A classification function was developed on the basis of the trained neural gas which represents the cluster structure. The steroid patterns of subjects with heterozygous mutations of the 21-hydroxylase gene or healthy controls were associated with every neuron. To allow for a stable classification we used 100 differently initialized networks for each training and testing run respectively. A majority vote was used for final classification.
Support vector machines (SVMs)
In recent years, kernel-based methods have been the object of much interest and research in the machine learning community. The success of many kernel methods is based on the concept of combining well-known linear algorithms, such as principal component analysis, with non-linear kernel functions. While the application of these functions allows more powerful non-linear solutions, the kernelized algorithms (19, 20) retain most properties of their linear versions. The most prominent algorithm among these is the support vector machine (SVM) proposed by Vapnik (21) for binary classification. The SVM is gaining popularity due to many attractive features and its superb classification performance, shown in numerous applications. It realizes pattern recognition between two classes by finding a decision function (hyperplane) determined by selected points from the training data, termed support vectors. In general this hyperplane corresponds to a non-linear decision boundary in the input space. While traditional techniques for pattern recognition are based on minimizing the empirical risk (i.e. on the attempt to optimize the performance on the training set), SVMs minimize the structural risk (i.e. the probability of yet-to-be-seen patterns to be classified correctly for an unknown probability distribution of the data). The SVM with a Gaussian kernel is trained and its parameters (width of Gaussian) and C (regularization) are optimized using the LIBSVM1 package, which combines the sequential minimal optimization (SMO) (22, 23) and the SVMLight2 algorithm. In this study, we used a SVM with Gaussian and linear kernels and ten-fold cross-validation
Diagnosis by human investigators
To compare the diagnostic results obtained from the artificial neural networks with the conventional clinical approach, four investigators known for their experience in the field were asked to classify the profiles of basal (unstimulated) levels of serum steroids of all subjects into heterozygous mutations or healthy control.
Statistical analysis
The 95% confidence intervals were calculated for the sensitivity, specificity and accuracy of the neural networks as well as for the diagnosis of each of the experts.
| Results |
|---|
|
|
|---|
|
| Discussion |
|---|
|
|
|---|
In contrast to non-linear analysis, human experts failed to classify the disease from unstimulated steroid levels. The close similarity of their results to the results of linear analysis, suggests that the clinicians were unable to use the subtle non-linear relations between unstimulated steroid parameters for making their diagnostic decision. The predictive results of the neural network approach may be further evaluated in larger groups (28) and with formal testing of the influence of assay noise, interassay variability and the type of assay used. The observation that only two commonly assayed hormonal parameters out of the five initially included in predicting the genotype are sufficient to maintain the predictive power of non-linear analysis is in favour of a rather robust performance of the analytical procedure. However, the objective of our application of machine learning techniques is a screening test for further diagnostic refinement and not to achieve an accuracy close to 100%, which would be a requirement for genetic counselling for instance.
The present study is in line with the successful use of non-linear analytical methods such as neural networks and phase space reconstruction in the diagnosis of myocardial infarction (26), certain forms of epilepsy (27) and cancer based on expression profiling (29, 30). The efficacy of the non-linear approaches rests on their capability to delineate non-linear interactions by progressively learning from examples.
Correct classification of multivariate interconnected biochemical data is the essence of many diagnostic steps in medicine. Non-linear relations are frequent and in many instances no simple threshold levels can be defined. Thus, simple linear threshold analysis fails and the diagnostic accuracy of the experienced clinician is based on the detection of non-linear relations. In heterozygous CYP21B mutations this relation only becomes apparent following augmentation of the relative relations by ACTH stimulation.
Medical experts gather their experience from a large number of cases accumulated over a long time span by learning complex interactions of laboratory parameters, not easily evident. The novel non-linear techniques used in this study are capable of learning such non-linear relations from large data sets of well-defined examples. They are able to reduce the required time span to minutes and might help to reduce the costs of a diagnostic procedure. The successful application of these non-linear adaptive methods to capture specific biochemical problems may have generalized implications for biochemical testing in many areas. Nonlinear analytical techniques such as neural networks, SVMs and nearest-neighbour classifiers may serve as an important adjunct to the decision process of a human investigator not trained in a specific complex clinical or laboratory setting and may help them to classify the problem more directly.
| Acknowledgements |
|---|
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |