Issue |
A&A
Volume 669, January 2023
|
|
---|---|---|
Article Number | A101 | |
Number of page(s) | 19 | |
Section | Planets and planetary systems | |
DOI | https://doi.org/10.1051/0004-6361/202243886 | |
Published online | 17 January 2023 |
Neural network for determining an asteroid mineral composition from reflectance spectra
1
Department of Geosciences and Geography,
PO Box 64,
University of Helsinki,
00014
Helsinki, Finland
e-mail: david.korda@helsinki.fi
2
Department of Physics,
PO Box 64,
University of Helsinki,
00014
Helsinki, Finland
3
Department of Computer Science,
PO Box 68,
University of Helsinki,
00014
Helsinki, Finland
4
Institute of Geology, Czech Academy of Sciences,
Rozvojová 269,
16500
Prague, Czech Republic
Received:
27
April
2022
Accepted:
25
September
2022
Context. Chemical and mineral compositions of asteroids reflect the formation and history of our Solar System. This knowledge is also important for planetary defence and in-space resource utilisation. In the next years, space missions will generate extensive spectral datasets from asteroids or planets with spectra that will need to be processed in real time.
Aims. We aim to develop a fast and robust neural-network-based method for deriving the mineral modal and chemical compositions of silicate materials from their visible and near-infrared spectra. The method should be able to process raw spectra without significant pre-processing.
Methods. We designed a convolutional neural network with two hidden layers for the analysis of the spectra, and trained it using labelled reflectance spectra. For the training, we used a dataset that consisted of reflectance spectra of real silicate samples stored in the RELAB and C-Tape databases, namely olivine, orthopyroxene, clinopyroxene, their mixtures, and olivine-pyroxene-rich meteorites.
Results. We used the model on two datasets. First, we evaluated the model reliability on a test dataset where we compared the model classification with known compositional reference values. The individual classification results are mostly within 10 percentage-point intervals around the correct values. Second, we classified the reflectance spectra of S-complex (Q-type and V-type, also including A-type) asteroids with known Bus–DeMeo taxonomy classes. The predicted mineral chemical composition of S-type and Q-type asteroids agree with the chemical composition of ordinary chondrites. The modal abundances of V-type and A-type asteroids show a dominant contribution of orthopyroxene and olivine, respectively. Additionally, our predictions of the mineral modal composition of S-type and Q-type asteroids show an apparent depletion of olivine related to the attenuation of its diagnostic absorptions with space weathering. This trend is consistent with previous results of the slower pyroxene response to space weathering relative to olivine.
Conclusions. The neural network trained with real silicate samples and their mixtures is applicable for a quantitative mineral evaluation of spectra of asteroids that are rich in dry silicates. The modal abundances and mineral chemistry of common silicates (olivine and pyroxene) can be derived with an accuracy better than 10 percentage points. The classification is fast and has a relatively small computer-memory footprint. Therefore, our code is suitable for processing large spectral datasets in real time.
Key words: meteorites, meteors, meteoroids / minor planets, asteroids: general / methods: numerical / methods: data analysis / techniques: spectroscopic
© The Authors 2023
Open Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.
1 Introduction
Ordinary chondrites are the most common type of meteorites. They originate from the S-complex asteroids and are one of the oldest materials in our Solar System. Their dominant constituents are olivine, orthopyroxene, FeNi metal, and troilite. The Fe content of the silicate minerals encodes information about redox conditions in the early Solar System. The mineralogical and chemical compositions of asteroids are also important in in-space resource utilisation and planetary defence.
Olivine and pyroxene have strong absorption bands in their near-infrared reflectance spectrum. The absorption bands are due to Fe2+ cations in crystalline structure. The three overlapping absorption bands of olivine are located approximately at the 1 µm wavelength. The two strong absorption bands of pyroxene are located approximately at the 1 and 2 µm wavelengths, and one weak band of iron-rich pyroxene is approximately at the 1.25 µm wavelength. The properties of the individual bands change with the chemical composition of olivine and pyroxene, while the overall spectral shape depends on the ratio of the olivine to pyroxene modal abundances or on the grain sizes of the minerals. For this reason, previous authors have been focusing on establishing relations between the properties of the bands and the corresponding composition. Adams (1974) pointed out a correlation between the chemical composition of pyroxene and the position of the minima of the two strongest pyroxene bands. Cloutis et al. (1986) found a relation between the area of the olivine–orthopyroxene mixture absorption bands and the modal composition of the mixture. Because the central wavelength and the area of the absorption bands are sensitive to the modal and chemical compositions of the minerals, it is important to model the bands properly. A model-fitting tool of the reflectance spectra was derived by Sunshine et al. (1990), who introduced modified Gaussians to model the absorption bands in logarithmic reflectance. These methods have been intensively used and tested (e.g. Gaffey 1984; McSween et al. 1991; Fornasier et al. 2003; Kanner et al. 2007; Dunn et al. 2010; Popescu et al. 2012; Han et al. 2020). Therefore, the chemical composition of olivine and pyroxene, and their abundance in mixtures, can in principle be obtained from the reflectance spectra through spectral deconvolution.
The deconvolution methods usually consist of two steps: (1) separation of a spectrum into continuum and absorption bands by fitting, and (2) application of empirical relations between the band parameters of the fit and the chemical composition. However, for each mineral mixture, we need to know a priori which minerals we expect to be present. This results in a different number of fitting parameters for fitting the spectrum in each case. Another limiting factor is that the band depth depends on the grain size, which results in a varying relative contribution of individual bands in the mineral-mixture spectrum. Without a precise fit of the whole spectrum and a correct separation of the bands and the continuum, no precise chemical composition can be derived.
In the past 50 yr, several asteroid classification schemes (taxonomies) were introduced (e.g. Tholen 1984; Bus 1999; DeMeo et al. 2009). The taxonomies are based on observed asteroid spectra of (typically) a few hundred asteroids in a given number of colours (broad-band filters) or in a continuous spectral range (with most of the observations taken from visible up to 1 µm) and are based on spectral features rather than on the features in the asteroid composition space. Therefore, taxonomies can tell us whether the spectrum of the observed asteroid contains absorption bands of specific minerals, but they contain little information about the quantitative mineral abundances or mineral chemical compositions of the asteroid.
For this reason, spectrum deconvolution methods have been applied on asteroid spectra to gain the quasi-quantitative metrics of their mineral and chemical composition. In the next years, spacecraft missions to asteroids or planets (e.g. Hera, BepiColombo, or Psyche) will provide us with high-resolution disk-resolved reflectance spectra that will increase the need for precise spectrum-processing methods even further to determine subtle variations in the surface compositions with little a priori knowledge.
Machine learning is based on flexible models that are trained using observed data. Deep learning (Goodfellow et al. 2016) considers specific machine-learning models, neural networks, that consist of multiple layers of simple computational steps. Neural networks have been successfully used in astronomy (e.g. Angel et al. 1990; Odewahn et al. 1992; Snider et al. 2001; Pearson et al. 2018; Hon et al. 2018; Ni et al. 2021), geophysics and planetary science (e.g. Rigol-Sanchez et al. 2003; Ross et al. 2018; Civilini et al. 2021), or asteroid studies (e.g. Howell et al. 1994; Misra & Bus 2008; León et al. 2010; Lieu et al. 2019; Wallace et al. 2021; Penttilä et al. 2021, 2022) because they combine a high degree of flexibility with efficient processing of new observations. Therefore, they are ideal candidates for the processing of reflectance spectra on the ground and in flight using devices embedded on-board of spacecraft.
The aim of this work is to create a neural-network model to estimate the modal abundances of olivine, orthopyroxene, and clinopyroxene and their chemical compositions based on the recorded spectrum. We evaluate the reliability of error metrics on a subset of test spectra. Subsequently, we apply the method to derive the composition of S*-complex asteroids from their reflectance spectra. Within this work, we use the term S*-complex asteroids to refer to asteroids that are rich in dry silicates such as olivine and pyroxenes. This includes the S, Q, R, O, and V types and their subtypes as defined by DeMeo et al. (2009). Additionally, we include A and Sa types because they are assumed to have an olivine-rich composition.
The Python scripts and datasets including all sample spectra and composition information we used in this study together with metadata can be downloaded from GitHub repository1.
2 Data
Representative reflectance spectra of olivine (OL; 100 samples), orthopyroxene (OPX; 102 samples), clinopyroxene (CPX; 108 samples), laboratory silicate mixtures (137 samples), and meteorites (45 ordinary chondrites, seven HED, five lodranites, four SNC achondrites, and two brachinites; 63 samples) were recovered from the RELAB2 and C-Tape3 databases, or prepared and measured in our laboratory (designated as TK).
The spectra were selected based on the following criteria: (1) the wavelength range must cover 450–2450 nm interval with a step size of 15 nm or smaller to match available asteroid spectra in DeMeo et al. (2009) and Binzel et al. (2019), (2) both mineral modal and elemental compositions should be documented, for instance by X-ray diffraction (XRD), full electron microprobe analysis (EMPA), or at least cation ratios. Alternatively, a reasonable confidence to published composition information should exist, for example through information of the sample locality, (3) the fayalite number of olivine and the ferrosilite number of pyroxene is higher than 3 and 5, respectively, as we found that samples with lower content lack significant diagnostic features, (4) the sample does not contain more than 35 vol% of additional phases that contain strong absorption bands in the visible to near-infrared range (e.g. plagioclase), as we found that a higher content of these phases can have a noticeable effect on the overall spectral shape; spectrally neutral or opaque phases such as metals or sulphides were ignored as they do not alter silicate absorptions, and (5) visual inspection of the spectra shows no signs of terrestrial weathering, contamination, or other artefacts.
The mineral composition must be documented with high precision to match the reflectance spectrum to a specific material composition. For a modal mineral composition, we used data from X-ray diffraction with an uncertainty of about 3 percentage points (pp; a percentage point is the unit for the arithmetic difference of two percentages). When analytic results were available in weight percent, we converted them into volumetric modal abundances. For the conversion, we determined the mineral density of a given composition from the linear interpolation of the corresponding end-member densities. The densities of end-members and molar masses we used are listed in Table A.2. The elemental mineral composition was either gathered from the RELAB database (when available) or from the literature and is typically measured using EMPA with a typical uncertainty of a few pp.
The aim of our model is to detect the composition of olivine-pyroxene-rich mixtures independently of their grain size or spectrally neutral components. This we accomplished during training by including samples with varying grain sizes and forms (e.g. powders or slabs) and normalising modal abundances to olivine–orthopyroxene–clinopyroxene alone, ignoring other spectrally neutral phases. This approach resulted in a high variation of the continuum shape in the training dataset and subsequently in locking our model on diagnostic band shapes and their relative proportions rather than on absolute values (e.g. band depths) or continuum shapes. The general overview of our sample set is presented in Table 1, and the full sample list together with analytical results and the physical state is given in Appendix A and additionally as an Excel spreadsheet in the GitHub repository.
The mineralogical distributions of the data are plotted in Fig. 1. Although our data cover the full range of individual modal abundances, most of our samples are pure minerals or mixtures with compositions similar to those of ordinary chondrites. Compositions of olivine and orthopyroxene have a peak in relatively low-iron contents, but all chemistry combinations are contained in our dataset. In the chemistry of clinopyroxene, individual components also cover most of the solid solution of clinopyroxene, but samples peak at the high calcium and low iron part of the compositional space. This bias is caused by the natural availability of samples in the RELAB database.
The peaks in the mineral chemical compositions can cause biases in the predictions of our model when spectra of samples with compositions very different from the composition of the training samples are evaluated. For these samples, the model may predict a composition that is artificially closer to the peaks. On the other hand, our model was made to be used on S-complex asteroids, whose composition is comparable to that of ordinary chondrites, and all the peaks in the compositions correspond to ordinary chondrites. For this reason, we do not expect significantly biased results for S-complex asteroids.
The reflectance spectra of the samples were interpolated to the wavelength grid from 450 nm to 2450 nm with a resolution of 5 nm. This choice enabled us to use the model directly on asteroid spectra without superfluous interpolation, and it keeps the information of high-resolution laboratory measurements. To reduce the noise level of the re-sampled spectra, we performed a convolution filtering on the spectra. The convolution kernel was a Gaussian with σ=7 nm. To highlight relative differences among the spectra, the denoised spectra were normalised at 550 nm. The processed spectra are plotted in Fig. 2.
Basic information about our dataset.
![]() |
Fig. 1 Mineralogical distributions of the data. |
![]() |
Fig. 2 Denoised and normalised spectra of silicates, their mixtures, and asteroids utilised in this study. |
3 Neural network
We modelled the unknown mapping between the spectra and the composition using a feed-forward convolutional neural network. The network takes the spectrum as input and propagates it through layers of non-linear transformations consisting of individual computational units called neurons, where the parameters of the transformation are learnt from data.
Each layer performs a simple non-linear transformation on its inputs h in the form o = f(Wh + b), where o is the layer output, W is the weights, b is the bias, and f is the non-linear activation function. The trainable parameters of the network are the weights and biases of all layers, whereas the choice of the activation functions and the layer architecture are considered design choices. The first hidden layer takes the spectrum as the input, and the last layer provides the predicted abundances as its output. We built the neural network using the Keras library (Chollet et al. 2015) in Python.
3.1 Model architecture
The neural network takes as inputs the reflectance spectra at given wavelengths. The wavelengths range from 450 nm to 2450 nm with a step of 5 nm, resulting in input vectors of 401 values.
The model architecture was chosen using a validation and random search, following the procedure described in Sect. 3.4. The final network consisted of two convolutional hidden layers and an output layer. A convolutional layer performs a convolution on its input. The layer uses the given number of convolution kernels (filters) with the given size. Each kernel is adapted to a different pattern in data. Both convolutional layers use kernels of width five and consist of 24 and 8 kernels, and the output of the second convolutional layer is flatted as input for the fully connected output layer. Throughout the network, we used the rectified linear unit (ReLU) f(x) = max (0, x) as the activation function, except for the output layer, which used the sigmoid activation f(x) = (1 + exp (−x))−1 with further normalisation described next.
The outputs of our model are modal abundances of olivine, orthopyroxene, and clinopyroxene in volume percent together with their chemistry (fayalite [Fa] and forsterite [Fo] numbers for olivine; ferrosilite [Fs] and enstatite [En] numbers for orthopy-roxene; and ferrosilite, enstatite, and wollastonite [Wo] numbers for clinopyroxene). We normalised the output of the sigmoid function for each group of parameters separately (i.e. separately for the modal abundances and separately for the end-members of each mineral). This guarantees that the outputs are directly related to the modal and chemical compositions of the minerals. Additionally, the mineral modal composition has a meaning of a volume fraction normalised to studied minerals, for example the olivine volume fraction in percent is defined as . Similarly, the mineral chemical composition is normalised to the given end-members, for example Fs of orthopyroxene is defined as
, while Fs of clinopyroxene is
.
3.2 Loss function and accuracy metrics
We assumed K different minerals with Ck chemical end-members. Each sample i has a modal volume fraction wik of the kth mineral, and the mole fraction zikc of the cth end-member.
(1)
which are also the basic properties of our output activation function. The loss function ℒ should reflect the variable modal composition of various minerals and must be small when the predicted modal wpred and chemical zpred compositions are both close to the actual values wtrue and ztrue. We defined the loss function in the form of
(2)
where the colon stands for all possible indices at the given position. The first term controls the predictions of the modal compositions, and the second term fits the chemical compositions of individual minerals. a is the trade-off parameter between the fit precision of the modal and chemical compositions. The function ɡ adds an additional large penalisation if the predicted solution is in a region with non-existent mineral solid solution (e.g. when predicted Wo > 60). If the abundance of the kth mineral is small, its influence on the spectrum is small as well. This is reflected in the second term, where a small modal abundance automatically leads to a small influence of the loss-function value. On the other hand, the predictions of the chemical compositions are potentially very inaccurate in these cases.
Our task is ultimately that of non-linear regression. We therefore used three standard measures of regression accuracy to evaluate the accuracy of the predictions: the root-mean-square error (RMSE), the coefficient of determination (R2), and the spectral angle mapper (SAM). The vector accuracy metrics computed from the test data give us information about the model generalisation (i.e. how successful the model is when analysing new observations). Namely, RMSE gives us an estimation of the errors in the predicted values for each of the value separately, R2 provides the meaning of which part of the variations in the data is explained by the model, and SAM quantifies the similarity of the actual and predicted values in terms of the angle between these two vectors. We note that the metrics do not account for various modal abundances, therefore, a mismatch in a low-abundance mineral can significantly contribute to the final mineral mixture metrics.
3.3 Training and regularisation
Neural networks are trained using iterative gradient descent algorithms, where the gradients of the loss with respect to the weights and biases are obtained using the backpropagation algorithm, a special case of automatic differentiation. We followed standard practices and applied the Keras library to solve the optimisation problem and the Adam optimiser (Kingma & Ba 2014) with an initial learning rate of 0.0005, which was selected together with the model architecture using the validation data.
For the training and evaluation of the model, we split the available data randomly into three parts: training (60%, 306 spectra), validation (20%, 102 spectra), and test (20%, 102 spectra). This was done in a stratified manner to retain the relative numbers of each type of data (columns in Table 1). Only the training data were used to update the weights and biases, whereas the validation data were only used to select the model architecture (see Sect. 3.4) and to determine when to terminate the optimisation. For the final evaluation, we chose the model at the point of the iterative training that performed best on the validation data. The test samples were only used at the very end to evaluate the performance of this final model.
We used two types of regularisation to minimise overfit-ting for the training data: L1 (absolute value) regularisation for the weights and biases to encourage small weights, and dropout regularisation (Srivastava et al. 2014), which randomly sets selected connections between the neurons as zeros during training. The parameters for these regularising elements were selected together with the model architecture. In addition, we used the commonly employed batch normalisation between the layers.
In addition to the olivine-orthopyroxene-clinopyroxene model, we also tried different mineral configurations. These models included an olivine–orthopyroxene–only model or a more complex olivine–orthopyroxene–clinopyroxene-plagioclase model. Compared with the presented baseline model, the additional models resulted in reliability matrices that were similar to or worse than that of the baseline model. These models are described separately in Appendix B.
Hyperparameters and other properties of the neural network.
3.4 Hyperparameters
The exact neural network architecture (including e.g. kernel widths and the number of convolutional filters), the hyperparameters controlling the optimisation routine, and the regularisation techniques and their parameters were selected based on the performance on the validation data using a random search over possible architectures and parameter choices. For each hyperparameter, we chose a range of values and trained over 2500 models with unique randomly chosen hyperparameters, selecting the architecture with the lowest validation RMSE as the final choice. The final parameters and the ranges of possible values are summarised in Table 2. Because we considered fairly small neural networks that are appropriate for the limited number of training samples, the random search was sufficient to discover a good architecture. More advanced architecture search techniques, such as Bayesian optimisation (Kandasamy et al. 2018), could be used to reduce the computational cost of the search process.
4 Results
We trained the neural-network model using a training set of 306 spectra of natural silicate samples and their physical mixtures, including meteorites. The hyperparameters of the model were set based on a model performance on a validation set of 102 spectra.
4.1 Evaluation on the test data
The reliability tests were performed on the test set of 102 spectra and are plotted in Figs. 3–6. In all the plots, the horizontal axis corresponds to the true value and the vertical axis to the predicted value. The diagonal lines delimit the prediction accuracy intervals. Error bars in actual values (horizontal) are considered as the conservative analytical error of 3 pp, and in the predicted values (vertical), they correspond to the RMSE errors given in Table 3. Additionally, in Figs. 4–6, the colour of the points indicates the modal abundance of the mineral. The colour map is the same in all figures. The RMSE, R2, and SAM values for all predicted quantities can be found in the corresponding subplots and in Table 4.
The predicted modal abundances (see Fig. 3) are mostly within the 10 pp error interval. Six distinct outliers significantly worsen the accuracy metrics. Five of them are due to the mismatch between orthopyroxene and clinopyroxene (see the 0, 50, and 100 actual abundances of OPX and CPX in Fig. 3), and one olivine–orthopyroxene-clinopyroxene mixture was predicted to be almost pure olivine because the 2 µm pyroxene band is very shallow. When we computed the RMSE without the orthopyroxene–clinopyroxene outliers, the estimated errors of the silicate modal abundances were 6.5 pp and 3.5 pp for orthopyroxene and clinopyroxene, respectively.
The error estimates (RMSE) for different predicted intervals are listed in Table 3. These errors show us that within the typical composition of ordinary chondrites, our model predictions are mostly better than 10 pp. High errors in some bins are usually due to a single outlier and a small number of points in the bins.
The RMSE metric is very sensitive to outliers and overestimates the real prediction error of orthopyroxene and clinopy-roxene abundances. To better present the errors between the true and the predicted values, we therefore computed the quantiles of the absolute error. They are plotted in Fig. 7 and summarised in Table 5, and they are shown in the density plots in Fig. 8. Table 5 shows that more than 75% of the predictions in each composition separately are within the 10 pp error interval, and more than 85% of the predictions are within the 15 pp error interval. The predicted chemical compositions are very accurate for olivine (Fig. 4) and orthopyroxene (Fig. 5), where 95% of the predictions are within the 10 pp error interval for both. The predictions of the clinopyroxene composition (Fig. 6) are less accurate; only 80% of the predicted Fs, 78% of the predicted En, and 76% of the predicted Wo are within the 10 pp error interval. The precision of the model is limited mostly by the number of samples with intermediate abundances of clinopyrox-ene, by the lack of olivine-clinopyroxene mixtures, and also by the number of samples in general. When we assume a normal distribution for the absolute errors, the 1-σ errors are 2.5, 5.7, and 2.1 pp for the modal compositions ofolivine, orthopyroxene, and clinopyroxene, respectively, and 4.9, 3.3, 5.5, 6.4, and 5.9 pp for fayalite, OPX ferrosilite, CPX ferrosilite, CPX enstatite, and CPX wollastonite, respectively.
4.2 Classification of asteroid spectra
The asteroid dataset consists of 387 reflectance spectra of S*-complex asteroids (DeMeo et al. 2009; Binzel et al. 2019) with assigned Bus–DeMeo taxonomy classes (DeMeo et al. 2009). The data include spectra of 147 S-, three Sa-, 41 Sq-, 17 Sqw-, 31 Sr-, eight Srw-, three Sv-, two Svw-, 57 Sw-, 42 Q-, one Qw-, 26 V-, two Vw-, and seven A-type asteroids. These spectra are in the range from 450 nm to 2450 nm and have a resolution of 10 nm. As in the case of mineral spectra, we resampled the spectra to a resolution of 5 nm, denoised them with the Gaussian kernel, and finally normalised them at 550 nm (see the bottom right panel in Fig. 2).
The S-type asteroids are the most common asteroids in the near-Earth region and are very common in the inner and middle Main Belt (Binzel et al. 2019). Their reflectance spectra are dominated by the 1 and 2 µm bands of olivine and pyroxene. The Q-type asteroids are very similar in composition to the S-type asteroids. The difference is in the slope of the spectrum. The spectra of Q-type asteroids have a neutral slope, while the spectra of the S-type asteroids have a red slope. Reflectance spectra of the V-type asteroids show a narrow 1 µm band and a wide 2 µm band. These suggest a high abundance of pyroxene. The A-type asteroids are dominated by the 1 µm band of olivine and a high red slope of the spectrum. We refer to Fig. 15 of DeMeo et al. (2009) for the shape of reflectance spectra of individual taxonomy classes.
The predictions of the neural-network model for the asteroid spectra show that the S-type asteroids are depleted in olivine compared to Q-type asteroids, but are of the same mineral chemical composition, and both are comparable with the chemical composition of ordinary chondrites. The V-type asteroids are mainly made of orthopyroxene (Fs about 31) and clinopyroxene, and the A-type asteroids are made of almost pure olivine (Fa about 26). Standard deviations indicate that the mineral chemical compositions are mostly homogeneous within the asteroid types, while modal abundances are potentially more various. The statistical information about the predictions is listed in Tables 6 and 7.
Error estimates computed from predicted values at the given interval.
![]() |
Fig. 3 Scatter plots of the true and predicted modal compositions of the test data. The diagonal lines delimit the accuracy of the predictions. Error bars in actual values (horizontal) are considered as a conservative analytical error of 3 pp, and in predicted values (vertical), they correspond to the RMSE errors given in Table 3. |
![]() |
Fig. 4 Scatter plots of the true and the predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. |
![]() |
Fig. 5 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. |
![]() |
Fig. 6 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. |
Accuracy metric values of the model on the test data.
5 Discussion
5.1 Biases
The density plots in Fig. 8 visualise the approximate distributions of our test data and dependence between the error and composition. In this way, the density plots show biases in the predictions for the test data. The top and middle rows of the density plot show the approximate distributions of the points in error-composition space. A possible trend in the dependence of the error on composition, non-zero mean value of error, or a significantly asymmetric error distribution indicate biased predictions. The bottom row in Fig. 8 show that the error distributions peaked at 0 pp error and do not show significant asymmetries. No error-composition dependences are observed in the modal abundances. For olivine, only about 10% of the data have Fa > 50, which might cause the slight iron depletion in the prediction of the high-iron (50 < Fa < 80) samples. This observation was made on four points and thus cannot be proved. The prediction of the chemical composition of orthopyroxene does not contain any biases. For clinopyroxene, we observe trends in the enstatite and wollastonite numbers. They are probably caused by the distribution of our training data and by the coupling between enstatite and wollastonite via the normalisation of the chemical composition. Even other metrics indicate that the predictions of the clinopyroxene chemical composition are the least accurate. For these reasons, the predictions of the clinopy-roxene composition of our model are reliable only in a limited region of approximately 40 < En, Wo < 60. This region is valid for ordinary chondrites or achondrites, for instance, and thus also for S-complex asteroids.
![]() |
Fig. 7 Quantites of the absolute errors between the actual and predicted values in pp. The dashed black lines indicate the 10 pp and 20 pp error. |
Part of predictions that is within the given error interval.
![]() |
Fig. 8 Density plots based on error predictions (predicted minus actual) and specific compositions. The dashed lines delimit zero error. Top and middle rows: concentrations of the test data as a function of composition and error. Bottom row: error distributions. Biases are present when the error is dependent on the composition, the mean value of error is significantly non-zero, or when the error distributions are asymmetrical. |
Mean values and standard deviations of the model predictions for the asteroid spectra.
5.2 Asteroid spectra and the effects of space weathering
Our model output indicates that S-type asteroids are relatively depleted in olivine abundance compared to Q-type asteroids, while the mineral chemistry remains comparable. At first glance, this is a surprising finding. However, space weathering provides an explanation for this observation. Based on their spectra, it is generally thought that Q-type asteroids represent fresh material that is similar to the material of ordinary chondrites (Binzel et al. 2010). The S-type asteroid spectra deviate from the Q type in the way in which they show more signs of space weathering, that is, the silicate absorption bands are more attenuated and the overall spectral slope is redder; see Fig. 1 in Binzel et al. (2010). It is also known that olivine undergoes space weathering changes on a shorter timescale than pyroxene (Sasaki et al. 2002; Marchi et al. 2005; Quadery et al. 2015; Chrbolková et al. 2021). Moreover, our model is trained on non-weathered data, hence the model interprets the weaker olivine bands as a lower olivine abundance. Therefore, we explain our model results so that our model does not actually indicate the physical absence of olivine, but rather that accelerated space weathering masks its diagnostic spectral absorptions relative to pyroxene. This is further supported by the fact that despite the relatively lower observed olivine modal abundance, its chemistry is not significantly altered. Figure 9 supports this as well: the apparent depletion of the olivine abundance correlates with the space-weathering trend (red arrow) defined by Binzel et al. (2019) in the PCA-based asteroid spectrum classification.
To further test the space-weathering effects on our neural-network output, we used the Chelyabinsk dataset (olivine-pyroxene mixtures) from Kohout et al. (2020), which includes laboratory space-weathered, impact-melted, and shock-darkened mixture series, and also a pure olivine and pure pyroxene dataset from Chrbolková et al. (2021), which includes a space-weathered series.
The OL, OPX, and CPX-normalised compositions of the Chelyabinsk meteorite are based on Reddy et al. (2014). The XRD is consistent with our model output for a fresh Chelyabinsk spectrum (see Table 8). The model predictions for the Kohout et al. (2020) dataset are listed in the bottom part of Table 8.
We note that the predictions of the CPX chemical composition can be ignored for the fresh sample because the predicted CPX abundance is only 3.9 vol%. With increasing laboratory-induced space weathering, the relative modal abundance of olivine in Chelyabinsk as indicated by our model systematically decreases from 66.6 to 2.8 vol%, similarly to the trend observed in S-type asteroids, while the olivine and orthopyroxene chemistries show weaker systematic decreases that are within its 1-RMSE error. In contrast, for the Chelyabinsk mixtures with spectrally neutral impact-melted or shock-darkened materials (DVI and SD series), the modal abundances as well as the silicate chemistry remain similar (i.e. 62.8–68.2 vol% and 58.0–67.9 vol% for the olivine fraction in IM and SD, respectively) and mostly within 1 RMSE to the fresh Chelyabinsk XRD-derived values (see Appendix C). This demonstrates that especially the predictions of the modal abundances of our model are indeed sensitive to the relative changes in individual mineral absorption bands and are robust against global spectral attenuation (e.g. caused by shock or melting). This is consistent with the finding of Kohout et al. (2020), who showed that the relative depth and area of the 2 µm and 1 µm bands do not depend on the amount of shock-darkened material, but that space weathering attenuates the 2 µm band to a lesser extent than the 1 µm band (see their Fig. 4).
Chrbolková et al. (2021) irradiated olivine and pyroxene samples with H+, He+, Ar+, and with a femtosecond pulse laser; see their Fig. 2 for the spectra. Especially ion irradiation is widely used to simulate space weathering (e.g. Loeffler et al. 2009; Lantz et al. 2017). We made predictions for these spectra and found that regardless of method or intensity, the predicted modal abundances were very close to those of a pure sample. The minimum olivine fraction in the pure olivine samples was 98.5 vol% (laser irradiated, 375.0 J cm−2). Comparable results were obtained for pure pyroxene, where 78% of the samples were predicted with a pyroxene fraction of 99 vol% and more. The predictions for pyroxene also showed a possible mismatch between pyroxene and olivine in cases of extreme space weathering. The samples with very high laser irradiation (450 and 1800 J cm−2) had broadened the 1 µm band, which was interpreted as 11 and 34 vol% of olivine. For all experiments, predictions for the iron contents in the samples were weakly dependent on the degree of weathering. Namely, there was a positive trend for olivine (which is the opposite trend from what we observed for Chelyabinsk) and a negative trend for pyroxene. Moreover, the prediction of the pyroxene composition was offset to significantly lower values. These effects are demonstrated in Tables 9 and 10, which list the predictions for the laser-irradiated spectra. The results from the other experiments are summarised in Appendix D.
The mineral chemical compositions of the mean S- and mean Q-type asteroids are comparable within standard deviations and also with ordinary chondrites. This is shown in Fig. 10, where the boxes delimit individual types of ordinary chondrites. However, the Fa and Fs values of the mean S-type asteroid are slightly lower than those of the mean Q-type asteroid (see Tables 6 and 7). We hypothesise that this might be related to the observation that Q-type asteroids are mostly present in the near-Earth asteroid population; see Fig. 6 in Binzel et al. (2019). Moreover, LL types dominate the ordinary chondrite meteorite population (Grady 2000). Therefore, Q types might be slightly more oxidised (LL-like) than the mean S type.
Another finding is that the mineral chemical distributions of various types of S-complex asteroids are comparable (see Table 7). The individual types are not related to specific compositions, but rather to the appearance of spectra. The spectra of S-complex asteroids differ mostly in the shapes of the continua or the relative depths of the absorption bands. This can be caused by additional featureless minerals or space weathering. The similarity in mineral chemical composition can also be seen in the S column of Table 6, which is indicated by the low values of the standard deviations of the chemical compositions. As we showed for the Chelyabinsk meteorite, the predictions of our model are only slightly sensitive to global attenuation. When we compare the mean predicted values for a type and its weathered version (e.g. S and Sw), we observed similar trends as for the samples of Chrbolková et al. (2021). The prediction for fayalite numbers is slightly higher for weathered types, while orthopyroxene forsterite numbers are always much smaller. This may also indicate that the method used by Chrbolková et al. (2021) corresponds more closely to the physical reality of asteroid space weathering.
V-type asteroids are classified to be predominantly composed of pyroxenes, which correlates with high PC2’ values (Fig. 9). In contrast, A-type asteroids are predicted to be composed exclusively of olivine, consistent with their low PC2’ values in the DeMeo classification.
These tests show that the model predicts a reasonable chemical composition of olivine even for weathered samples. Predictions for the chemical composition of pyroxene are more sensitive to space weathering and can be offset to lower values. Furthermore, the predicted modal abundances of low-weathered asteroids (e.g. Q or Sq) or asteroids with amono-mineral composition (e.g. containing one dominant phase, such as pyroxene in V, or olivine in A or Sa) are not significantly influenced by space weathering. On the other hand, the predicted modal abundances of S-type asteroids (i.e. weathered mixtures of olivine and pyroxene) lead to an artificial distortion of the modal ratio of pyroxene to olivine in favour of pyroxene because olivine is more strongly weathered and hence its diagnostic absorption bands are more strongly compressed than those of pyroxene.
Mean values and standard deviations of the model predictions for the S-complex asteroid spectra.
![]() |
Fig. 9 Principal components of S*-complex asteroids. The predicted olivine fraction decreases in the direction of increasing space weathering, which is indicated by the red arrow. |
Modal abundances and mineral chemical composition of the Chelyabinsk meteorite.
Modal abundances and mineral chemical composition of the space-weathered olivine samples.
Modal abundances and mineral chemical composition of the space-weathered pyroxene samples.
![]() |
Fig. 10 Iron contents in olivine and orthopyroxene in the mean S-and Q-type asteroids. The boxes roughly delimit the types of ordinary chondrites. Red, green, and blue correspond to H, L, and LL types, respectively. |
5.3 Comparison to legacy methods
Legacy methods for a spectrum deconvolution that are optimised for an olivine-pyroxene-rich sample typically use the 1 µm and 2 µm bands, namely the positions of their central wavelengths (designated Band I Centre, BIC, and Band II Centre, BIIC) and the ratio of the 2 µm band area and 1 µm band area (band area ratio, BAR). BIC, BIIC, and BAR are used as an input for empirically derived relations where outputs are the olivine-to-pyroxene ratio and the chemistry of olivine and pyroxene. The compositional interval in which these methods are valid is typically rather restricted, and different relations exist for various mineralogies (see e.g. Gaffey et al. 2002).
We applied the equations from Cloutis et al. (1986), Gaffey et al. (2002), and Reddy et al. (2015) to our test data, which consisted of selected olivine-orthopyroxene-dominated laboratory mixtures and meteorites (CPX lower than 10 vol%). The empirical relation of Cloutis et al. (1986, BAR) was optimised for olivine-to-pyroxene ratios of S- and Q-type asteroids. The pyroxene chemistry equations from Gaffey et al. (2002, BIIC) must be solved iteratively, but the validity interval is much broader than that based on the equations in Reddy et al. (2015, BIC). The comparison between our neural-network model and these legacy models is shown in Fig. 11. The BAR- and BIC-based methods agree well with our neural-network-based model and with the actual analytical compositional values. The BIIC-based method shows an overall lower agreement with analytical results and with our model. The RMSE error estimate of the BAR method is 10.5 pp for the olivine fraction, and the RMSEs of the BIC and BIIC methods for orthopyroxene Fs are 6.8 and 13.7 pp, respectively. We note that the application of these BAR- and BIC-based methods was optimised to rather narrow compositional intervals. The good agreement in this interval is therefore expected. In contrast, our neural-network-based method shows a better performance and is applicable to the whole range of compositions.
We also applied the BAR method on the asteroid spectra and plot the olivine fractions to the PCA in Fig. 12. The BAR method is less sensitive to space weathering than our neural-network model. This may be due to the fact the BAR method uses the combined 1 µm olivine-pyroxene band area, and the relatively lower attenuation of the olivine bands does not influence the band area significantly. The neural-network method evaluates the whole spectrum shape and thus may be more sensitive to small (space-weathering-related) changes in the olivine bands. This was partially confirmed when we applied the method blindly to non-S-type asteroids. In the case of V-type asteroids, the BAR method results in up to 50 vol% of olivine (mean fraction of 27 vol%), which is unrealistic given the pyroxene-dominated composition of V-type asteroids. It therefore seems that the BAR method always divides the 1 µm band between olivine and pyroxene even when one of the constituents may be absent. For this reason, the BAR method is not a robust indicator for a broad range of mineral modal abundances.
6 Conclusions
We introduced a deep-learning-based method for estimating the modal and chemical compositions of olivine-pyroxene-rich mixtures, meteorites, and asteroids directly from their visible to near-infrared reflectance spectra. We used a feed-forward convo-lutional neural network with two hidden layers, which consisted of 24 and 8 different convolutional kernels.
We trained the model with real samples of silicates and their mixtures. The model predicted modal abundances and the mineral chemical compositions of olivine, orthopyroxene, and clinopyroxene. The predictions for most of the test-data samples had a precision better than 10 percentage points. Subsequently, we applied the method to derive the composition of olivine-pyroxene-rich asteroids with known taxonomy types. As expected, the V-type asteroids are dominated by orthopyroxene, and the A-type asteroids are made of almost pure olivine. The S-type and the Q-type asteroids have similar mineral chemical compositions among themselves, and the compositions are also similar to those of ordinary chondrites. The differences in predicted olivine modal abundances between the S-type and the Q-type asteroids are consistent with a lower resistance of olivine to space weathering. Standard deviations show that mineral chemical compositions are more homogeneous within the types, but modal abundances are potentially very different.
The main advantages of our neural-network model over legacy spectrum deconvolution methods are (1) the overall low computational and memory requirements during classification runs, (2) the versatility, demonstrated by the scores of the used reliability metrics over a wide range of olivine and pyroxene modal and chemical compositions, (3) the ability to process complete spectra without the need of prior derivation of spectral parameters (i.e. fitting and quantification of absorption band properties), and (4) the independence of our results of grain sizes, spectrally neutral phases, or shapes of continua. In the following years, space missions will generate extensive spectral datasets from asteroids or planets. Our code may be useful for these cases when a large number of spectra has to be evaluated in real time, for example, during hyperspectral remote sensing.
![]() |
Fig. 11 Scatter plots of the true and modelled olivine fraction and ferrosilite number of orthopyroxene in the olivine-pyroxene laboratory mixtures and meteorites in the test data. The diagonal lines delimit the accuracy of the models. |
![]() |
Fig. 12 Principal components of S*-complex asteroids. The direction of increasing space weathering is indicated by the red arrow. The model is based on BAR. |
Acknowledgements
This research is supported by Academy of Finland projects 335595 and 325805, NASA SSERVI Center for Asteroid and Lunar Surface Science (CLASS), and within institutional support RVO 67985831 of the Institute of Geology of the Czech Academy of Sciences. We would like to thank Francesca DeMeo and Richard Binzel for providing us the dataset on the asteroid spectra. We utilised data stored in the RELAB Spectral Database operated by Brown University and C-Tape database operated by The University of Winnipeg. This research has made use of NASA’s Astrophysics Data System Bibliographic Services. The authors would like to thank the anonymous referee for the valuable comments and suggestions, which greatly improved the quality of the paper.
Appendix A On-line supplement with scripts and datasets
The data and Python scripts used in our work are available online4 in version v1.0. The datasets, modules, and models are in the corresponding sub-folders. For this work, we present the results of six models, which are distinguished in REAMDE.md file in the Models/compositional sub-folder.
Appendix A.1 Datasets
All datasets were interpolated to an equidistant grid from 450 nm to 2450 nm with a step of 5 nm. Data files in the Datasets folder are distinguished by the suffix. That is the suffix ’-denoised’ means that the data were denoised with the Gaussian convolution kernel and the suffix ‘-norm’ means that the data were normalised at 550 nm. The description of the data, their labels, the wavelengths we used, and the known metadata is stored together in the data files (see README.md in the Datasets sub-folder for loading the data and metadata). Except for metadata, they also can be found in the README.md and are summarised in Table A.1. The known information about the data and metadata is stored in Sample_Catalogue.xlsx. This catalogue contains four sheets. Three sheets (RELAB, C-Tape, and TK) contain known metadata, and the fourth sheet (Chemical_analyses) summarises the chemical analyses. The metadata include details about the samples (e.g. grain sizes, sample types and subtypes, sample textures, sample source, and origin). From column R to the right, we list the labels, the conversion of the modal abundances from wt% into vol%, and the references to the labels.
Unit conversion
To describe the conversion of modal abundances from wt% m into vol%, or the mineral chemical composition from wt% to the end-members, we used the molar masses M and end-member densities ρ from Table A.2.
For an example of the conversion, we consider the ordinary chondrite with ID OC-SXS-022-D we find in the RELAB sheet in row 10 of the Sample_Catalogue.xlsx table. In Table 2 of the reference article (in the Notes column AH), we found in the case of olivine chemical composition, for instance, FeO 28.02 wt% and MgO 34.85 wt%. The fayalite and forsterite numbers (columns V and W) were computed from these two numbers and the molar masses of FeO and MgO as
(A.1)
(A.2)
To convert modal abundances in wt% into vol%, we used the values in Table 1 of the reference article. First, we computed the approximate densities of olivine, orthopyroxene, clinopyrox-ene, and plagioclase, for which we know the individual chemical composition. The density estimates correspond to the weight averages of the mineral chemical composition and end-member densities (cells from Z2 to AE2). For example, the density of orthopyroxene for the OC-SXS-022-D sample with Fs24.7, En72.3, and Wo3.0 (columns X, Y, and Z) is
(A.3)
which can be found in column AX. With the knowledge of densities, the non-normalised volume fractions of individual minerals were computed as the modal abundance in wt% divided by the density (columns BB to BE). The fractions were normalised to sum up to 100 to obtain the modal abundances in volume percent (columns R to U).
![]() |
Fig. A.1 Schematic visualisation of the neural-network pipeline. |
Datafile description
Appendix A.2 Modules
All Python scripts we used are well documented via comment lines and can be found in the modules sub-folder. The neural-network pipeline (main.py script) is visualised in the block diagram in Fig. A.1. The behaviour of the whole code is driven by the NN_config.py, where the user selects labels, hyperparameters, and selection criteria for the data, and may change the wavelength grid of the data and their normalisation.
Densities and molar masses used for unit conversions.
Appendix B Test models
In addition to the baseline OL-OPX-CPX model, we tested the behaviour of the neural network in five additional tasks: evaluation of the chemical composition of (1) pure olivine, (2) pure orthopyroxene, (3) pure and binary mixtures of olivine and orthopyroxene, (4) a model that also contains the modal abundance of plagioclase, and (5) a model that additionally contains a plagioclase chemical composition and the orthopyroxene wollastonite number. We note that for pure olivine, pure orthopy-roxene, and pure and binary olivine-orthopyroxene mixtures, we used models with fewer convolutional kernels and increased the regularisation parameters to overcome any overfitting of these models. The hyperparameters that differ from those in Table 2 are listed in Table B.1.
Appendix B.1 Models with pure olivine and orthopyroxene
For the very simple models for pure olivine and pure orthopy-roxene, we expected to obtain more accurate predictions than in case of the final model because pure samples have a stronger effect on the loss function. Therefore, better precision is naturally expected. This was confirmed in both cases. Furthermore, for these models we did not need to omit samples with low iron content, which gave us six more enstatite-rich orthopyroxenes (97.5 ≤ En ≤ 100.0). For the pure olivine model, we obtained RMSE = 4.4 pp, and the worst prediction was only 8.4 pp away from the actual value (i.e. all the predictions were within the 10 pp error interval). For the pure orthopyroxene model, we also obtained a better model with similar properties as for the pure olivine model: RMSE = 4.5 pp, and the worst prediction was 9.6 pp away from the actual value. The results are plotted in Figs. B.1–B.3. We note that the results in Fig. B.l can be directly comparable with the pure samples from Fig. 4. This does not hold for the pure orthopyroxene model because of the added enstatite-rich samples and the consequently different shuffling of data between training, validation, and test datasets.
Updated hyperparameters for less complex models.
![]() |
Fig. B.1 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with pure olivine. |
Appendix B.2 Model of olivine-orthopyroxene mixtures
For a slightly more complex model of pure olivine, pure orthopyroxene, and their binary mixtures, we omitted low-iron olivines and kept low-iron orthopyroxenes (the six additional enstatite rich samples). In this case, the mineral chemical compositions were of the same precision as in the final model: RMSEs were 6.4 and 5.2 pp for olivine and orthopyroxene compositions, respectively. The advantage of this model is that there were no mismatches between orthopyroxene and clinopyroxene. This led to a high precision in mineral modal abundance, where only four predictions (6%) were not within the 10 pp error interval, and over 90% of mineral chemical compositions were within the 10 pp error interval. These results are comparable to the final model when we ignore the orthopyroxene–clinopyroxene mismatches in modal abundances. The overall RMSE for modal composition was 4.7 pp and R2 = 0.99. The results of this model and the quantiles with absolute errors are plotted in Figs. B.4–B.7.
In the zeroth approximation, we can estimate that S-, Q-, V-, and A-type asteroids are olivine-orthopyroxene mixtures. We applied this model on the asteroid samples. For the mineral chemical compositions, the predictions are the same within one standard deviation as for the final model. As expected, the mineral modal abundances of the mean V-type and A-type asteroids are made of almost pyre orthopyroxene (96.5 vol%) and pure olivine (100.0 vol%), respectively. Similarly as in the final model, the mean Q type is dominated by olivine (82.0 vol%) and the mean S types are dominated by orthopyroxene (76.0 vol%). This difference is due to the space-weathered surface of S-type asteroids and to our non-space-weathered dataset. The standard deviation in modal abundances of S-type asteroids, which is equal to 21 pp, indicates a different level of space weathering in this class. This is also shown in Fig. B.8.
![]() |
Fig. B.2 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with pure orthopyroxene. |
![]() |
Fig. B.3 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the models with pure olivine and pure orthopyroxene together. |
![]() |
Fig. B.4 Scatter plots of the true and predicted modal compositions of the test data. The diagonal lines delimit the accuracy of the predictions. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
![]() |
Fig. B.5 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
![]() |
Fig. B.6 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
![]() |
Fig. B.7 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
![]() |
Fig. B.8 Principal components of S*-complex asteroids. The predicted olivine fraction decreases in the direction of increasing space weathering, which is indicated by the red arrow. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
Appendix B.3 Model with plagioclase modal abundances
In the following two models, we also incorporated modal and complete (modal and chemical) information about plagioclase (PLG). This includes anorthosite (An), albite (Ab), and ortho-clase (Or). The latter model also included orthopyroxene wol-lastonite information. This slightly changed the numbers and classification of our samples into different mixture categories in Table 1. In total, we had 44 pure plagioclase spectra and 15 additional mixtures containing a significant portion of plagioclase (e.g. lunar soils). On the other hand, we did not know the specific plagioclase chemical composition for all the non-plagioclase mixtures. In the model in Appendix B.4, we therefore removed 36 non-plagioclase mixtures.
The additional information about the modal fraction of plagioclase in the model had significant effect on the predictions of the mineral modal compositions. They were significantly worse (see Fig. B.9). In addition to the orthopyroxene-clinopyroxene mismatches, there are additional outliers in the olivine modal composition, and the accuracy of predictions was worse in general. In the 10 pp error interval, there are only 82%, 77%, 84%, and 89% predicted modal compositions of olivine, orthopyroxene, clinopyroxene, and plagioclase, respectively. This can be caused either by a limited dataset with an intermediate plagioclase modal composition or by the fact that the plagioclase 1.3 µm band is due to Fe2+ cations that naturally substitute Ca2+ cations in the structure of plagioclase (Burns 1989) rather than due to the main Ca2+, Na+, or K+ cations. Therefore, the shape of the plagioclase spectrum may vary significantly, while the abundance of the main cations is almost the same, and the 1.3 µm band of plagioclase can be misinterpreted, for example, as the contribution of the redmost band of olivine or of the weak band of orthopyroxene. We note that the predictions of the mineral chemical compositions were not affected by the additional information (see Figs. B.10–B.13).
![]() |
Fig. B.9 Scatter plots of the true and predicted modal compositions of the test data. We show the model with plagioclase modal abundance information. |
![]() |
Fig. B.10 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with plagioclase modal abundance information. |
![]() |
Fig. B.11 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with plagioclase modal abundance information. |
![]() |
Fig. B.12 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with plagioclase modal abundance information. |
![]() |
Fig. B.13 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. We show the model with plagioclase modal abundance information. |
Appendix B.4 Model with complete information
The most complex model also contains the orthopyroxene wol-lastonite number and the chemical composition of plagioclase. For this model, we applied additional penalisation terms to avoid predictions in regions of non-existent mineral solid solution (i.e. the region where An > 15 and Ab < 50 and Or > 15 for plagioclase and Wo > 10 for orthopyroxene). As expected from the previous test models, the mineral modal abundances and mineral chemical compositions of olivine, orthopyroxene, and clinopyroxene were not affected by this additional information (see Fig. B.14-B.17); the maximum predicted wollastonite number of orthopyroxene was lower than one. This model totally failed in predictions of the plagioclase chemical compositions (see Fig. B.18) because the shape of plagioclase spectrum is determined by the presence or absence of Fe2+ cations and not by the main cations. Considering the 10 pp error interval, 88%, 81%, 85%, and 93% on olivine, orthopyroxene, clinopyroxene, and plagioclase modal composition are in it. For the mineral chemical compositions, 93% of olivine Fa or Fo are within the 10 pp error interval; 91%, 92%, and 100% of orthopyroxene Fs, En, and Wo; 70%, 77%, and 81% of clinopyroxene Fs, En, and Wo; but only 55%, 44%, and 88% of plagioclase An, Ab, and Or are within the interval (see Fig. B.19).
![]() |
Fig. B.14 Scatter plots of the true and predicted modal compositions of the test data. We show the model with complete orthopyroxene and plagioclase information. |
![]() |
Fig. B.15 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with complete orthopyroxene and plagioclase information. |
![]() |
Fig. B.16 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with complete orthopyroxene and plagioclase information. |
![]() |
Fig. B.17 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. We show the model with complete orthopyroxene and plagioclase information. |
![]() |
Fig. B.18 Scatter plots of the true and predicted chemical compositions of plagioclase in the test data. Left: calcium content. Middle: sodium content. Right: potassium content. The colours of the points correspond to the actual modal abundance of plagioclase in the samples. We show the model with complete orthopyroxene and plagioclase information. |
![]() |
Fig. B.19 Quantites of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with complete orthopyroxene and plagioclase information. |
Appendix B.5 Summary of the test models
Based on the models we tested, it seems that when we added another chemical property to our samples, it did not worsen the predictions of the previous properties. The test shows that the chemical properties of olivine and orthopyroxene can be predicted with very high accuracy, and the chemical properties of clinopyroxene can be predicted with acceptable accuracy. Moreover, the wollastonite number in case of orthopyroxene can be neglected. Furthermore, the models show that the chemical properties of plagioclase cannot be reliably obtained from the reflectance spectra.
On the other hand, additional information about mineral modal abundances caused misinterpretations of the model. The misinterpretations were mostly between orthopyroxene and clinopyroxene, which are natural and cannot be fully removed. Other misinterpretations were between the 1.3 µm absorption band of plagioclase, which overlaps with the weak 1.25 µm band of orthopyroxene, and the redmost absorption band of olivine. For these reasons, we decided to omit plagioclase modal and chemical compositions and the wollastonite number of orthopyroxene from the final model.
Appendix C Evaluation of the Chelyabinsk meteorite
The model predictions for impact-melted and shock-darkened mixtures of the Chelyabinsk meteorite are listed in Tables C.1 and C.1. The designation follows Kohout et al. (2020). In the top part of the tables, we repeat the XRD-derived values of Reddy etal. (2014).
Modal abundances and mineral chemical composition of the Chelyabinsk meteorite mixed with the impact-melted phase.
Modal abundances and mineral chemical composition of the Chelyabinsk meteorite mixed with the shock-darkened phase.
Appendix D Evaluation of ion-irradiated samples
The model predictions for the ion-irradiated samples of pure olivine and pure pyroxene are summarised in Table D.I. The designation follows Chrbolková et al. (2021).
Modal abundances and mineral chemical composition of the ion-irradiated pure olivine and pyroxene samples.
References
- Adams, J. B. 1974, J. Geophys. Res., 79, 4829 [NASA ADS] [CrossRef] [Google Scholar]
- Angel, J. R. P., Wizinowich, P., Lloyd-Hart, M., & Sandler, D. 1990, Nature, 348, 221 [NASA ADS] [CrossRef] [Google Scholar]
- Binzel, R. P., Morbidelli, A., Merouane, S., et al. 2010, Nature, 463, 331 [NASA ADS] [CrossRef] [Google Scholar]
- Binzel, R. P., DeMeo, F. E., Turtelboom, E. V., et al. 2019, Icarus, 324, 41 [Google Scholar]
- Burns, R. G. 1989, Mineral. Mag., 53, 135 [NASA ADS] [CrossRef] [Google Scholar]
- Bus, S. J. 1999, PhD thesis, Massachusetts Institute of Technology, USA [Google Scholar]
- Chollet, F., et al. 2015, Keras, https://github.com/fchollet/keras [Google Scholar]
- Chrbolková, K., Brunetto, R., Durech, J., et al. 2021, A&A, 654, A143 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Civilini, F., Weber, R. C., Jiang, Z., Phillips, D., & Pan, W. D. 2021, Geophys. J. Int., 225, 2120 [NASA ADS] [CrossRef] [Google Scholar]
- Cloutis, E. A., Gaffey, M. J., Jackowski, T. L., & Reed, K. L. 1986, J. Geophys. Res., 91, 11–641 [NASA ADS] [Google Scholar]
- de León, J., Licandro, J., Serra-Ricart, M., Pinilla-Alonso, N., & Campins, H., 2010 A&A, 517, A23 [CrossRef] [EDP Sciences] [Google Scholar]
- DeMeo, F. E., Binzel, R. P., Slivan, S. M., & Bus, S. J. 2009, Icarus, 202, 160 [Google Scholar]
- Dunn, T. L., McSween, Harry Y. J. Jr., McCoy, T. J., & Cressey, G. 2010, M&PS, 45, 135 [NASA ADS] [Google Scholar]
- Fornasier, S., Barucci, M. A., Binzel, R. P., et al. 2003, A&A, 398, 327 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Gaffey, M. J. 1984, Icarus, 60, 83 [NASA ADS] [CrossRef] [Google Scholar]
- Gaffey, M. J., Cloutis, E. A., Kelley, M. S., & Reed, K. L. 2002, in Asteroids III (Tucson: University of Arizona Press), 183 [Google Scholar]
- Goodfellow, I., Bengio, Y., & Courville, A. 2016, Deep Learning (MIT Press) [Google Scholar]
- Grady, M. M. 2000, Catalogue of Meteorites (Cambridge University Press) [Google Scholar]
- Han, H.-J., Lu, X.-P., Yang, Y.-Z., Zhang, H., & Mutelo, A. M. 2020, Res. Astron. Astrophys., 20, 129 [CrossRef] [Google Scholar]
- Hon, M., Stello, D., & Yu, J. 2018, MNRAS, 476, 3233 [NASA ADS] [CrossRef] [Google Scholar]
- Howell, E. S., Merenyi, E., & Lebofsky, L. A. 1994, J. Geophys. Res., 99, 10847 [NASA ADS] [CrossRef] [Google Scholar]
- Kandasamy, K., Neiswanger, W., Schneider, J., Poczos, B., & Xing, E. 2018, ArXiv e-prints [arXiv: 1802.07191] [Google Scholar]
- Kanner, L. C., Mustard, J. F., & Gendrin, A. 2007, Icarus, 187, 442 [NASA ADS] [CrossRef] [Google Scholar]
- Kingma, D. P., & Ba, J. 2014, ArXiv e-prints [arXiv: 1412.6980] [Google Scholar]
- Kohout, T., Penttilä, A., Mann, P., et al. 2020, Planet. Sci. J., 1, 37 [NASA ADS] [CrossRef] [Google Scholar]
- Lantz, C., Brunetto, R., Barucci, M. A., et al. 2017, Icarus, 285, 43 [CrossRef] [Google Scholar]
- Lieu, M., Conversi, L., Altieri, B., & Carry, B. 2019, MNRAS, 485, 5831 [NASA ADS] [CrossRef] [Google Scholar]
- Loeffler, M. J., Dukes, C. A., & Baragiola, R. A. 2009, J. Geophys. Res. (Planets), 114, E03003 [CrossRef] [Google Scholar]
- Marchi, S., Brunetto, R., Magrin, S., Lazzarin, M., & Gandolfi, D. 2005, A&A, 443, 769 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- McSween, H. Y., Bennett, M. E., & Jarosewich, E. 1991, Icarus, 90, 107 [NASA ADS] [CrossRef] [Google Scholar]
- Misra, A., & Bus, S. J. 2008, in AAS/Division for Planetary Sciences Meeting Abstracts, AAS/Division for Planetary Sciences Meeting Abstracts, 40, 60.03 [Google Scholar]
- Ni, Y., Li, Y., Lachance, P., et al. 2021, MNRAS, 507, 1021 [NASA ADS] [CrossRef] [Google Scholar]
- Odewahn, S. C., Stockwell, E. B., Pennington, R. L., Humphreys, R. M., & Zumach, W. A. 1992, AJ, 103, 318 [NASA ADS] [CrossRef] [Google Scholar]
- Pearson, K. A., Palafox, L., & Griffith, C. A. 2018, MNRAS, 474, 478 [NASA ADS] [CrossRef] [Google Scholar]
- Penttilä, A., Hietala, H., & Muinonen, K. 2021, A&A, 649, A46 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Penttilä, A., Fedorets, G., & Muinonen, K. 2022, Front. Astron. Space Sci., 9, 816268 [CrossRef] [Google Scholar]
- Popescu, M., Birlan, M., & Nedelcu, D. A. 2012, A&A, 544, A130 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
- Quadery, A. H., Pacheco, S., Au, A., et al. 2015, J. Geophys. Res. (Planets), 120, 643 [NASA ADS] [CrossRef] [Google Scholar]
- Reddy, V., Sanchez, J. A., Bottke, W. F., et al. 2014, Icarus, 237, 116 [NASA ADS] [CrossRef] [Google Scholar]
- Reddy, V., Dunn, T. L., Thomas, C. A., Moskovitz, N. A., & Burbine, T. H. 2015, in Asteroids IV (Tucson: University of Arizona Press), 43 [Google Scholar]
- Rigol-Sanchez, J. P., Chica-Olmo, M., & Abarca-Hernandez, F. 2003, Int. J. Rem. Sens., 24, 1151 [NASA ADS] [CrossRef] [Google Scholar]
- Ross, Z. E., Meier, M.-A., Hauksson, E., & Heaton, T. H. 2018, Bull. Seismol. Soc. Am., 108, 2894 [CrossRef] [Google Scholar]
- Sasaki, S., Hiroi, T., Nakamura, K., et al. 2002, Adv. Space Res., 29, 783 [NASA ADS] [CrossRef] [Google Scholar]
- Snider, S., Allende Prieto, C., von Hippel, T., et al. 2001, ApJ, 562, 528 [NASA ADS] [CrossRef] [Google Scholar]
- Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. 2014, J. Mach. Learn. Res., 15, 1929 [Google Scholar]
- Sunshine, J. M., Pieters, C. M., & Pratt, S. F. 1990, J. Geophys. Res., 95, 6955 [NASA ADS] [CrossRef] [Google Scholar]
- Tholen, D. J. 1984, PhD thesis, University of Arizona, Tucson, USA [Google Scholar]
- Wallace, S. M., Dyar, M. D., Burbine, T. H., & Sheldon, D. 2021, in Lunar and Planetary Science Conference, 2507 [Google Scholar]
All Tables
Mean values and standard deviations of the model predictions for the asteroid spectra.
Mean values and standard deviations of the model predictions for the S-complex asteroid spectra.
Modal abundances and mineral chemical composition of the space-weathered olivine samples.
Modal abundances and mineral chemical composition of the space-weathered pyroxene samples.
Modal abundances and mineral chemical composition of the Chelyabinsk meteorite mixed with the impact-melted phase.
Modal abundances and mineral chemical composition of the Chelyabinsk meteorite mixed with the shock-darkened phase.
Modal abundances and mineral chemical composition of the ion-irradiated pure olivine and pyroxene samples.
All Figures
![]() |
Fig. 1 Mineralogical distributions of the data. |
In the text |
![]() |
Fig. 2 Denoised and normalised spectra of silicates, their mixtures, and asteroids utilised in this study. |
In the text |
![]() |
Fig. 3 Scatter plots of the true and predicted modal compositions of the test data. The diagonal lines delimit the accuracy of the predictions. Error bars in actual values (horizontal) are considered as a conservative analytical error of 3 pp, and in predicted values (vertical), they correspond to the RMSE errors given in Table 3. |
In the text |
![]() |
Fig. 4 Scatter plots of the true and the predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. |
In the text |
![]() |
Fig. 5 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. |
In the text |
![]() |
Fig. 6 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. |
In the text |
![]() |
Fig. 7 Quantites of the absolute errors between the actual and predicted values in pp. The dashed black lines indicate the 10 pp and 20 pp error. |
In the text |
![]() |
Fig. 8 Density plots based on error predictions (predicted minus actual) and specific compositions. The dashed lines delimit zero error. Top and middle rows: concentrations of the test data as a function of composition and error. Bottom row: error distributions. Biases are present when the error is dependent on the composition, the mean value of error is significantly non-zero, or when the error distributions are asymmetrical. |
In the text |
![]() |
Fig. 9 Principal components of S*-complex asteroids. The predicted olivine fraction decreases in the direction of increasing space weathering, which is indicated by the red arrow. |
In the text |
![]() |
Fig. 10 Iron contents in olivine and orthopyroxene in the mean S-and Q-type asteroids. The boxes roughly delimit the types of ordinary chondrites. Red, green, and blue correspond to H, L, and LL types, respectively. |
In the text |
![]() |
Fig. 11 Scatter plots of the true and modelled olivine fraction and ferrosilite number of orthopyroxene in the olivine-pyroxene laboratory mixtures and meteorites in the test data. The diagonal lines delimit the accuracy of the models. |
In the text |
![]() |
Fig. 12 Principal components of S*-complex asteroids. The direction of increasing space weathering is indicated by the red arrow. The model is based on BAR. |
In the text |
![]() |
Fig. A.1 Schematic visualisation of the neural-network pipeline. |
In the text |
![]() |
Fig. B.1 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with pure olivine. |
In the text |
![]() |
Fig. B.2 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with pure orthopyroxene. |
In the text |
![]() |
Fig. B.3 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the models with pure olivine and pure orthopyroxene together. |
In the text |
![]() |
Fig. B.4 Scatter plots of the true and predicted modal compositions of the test data. The diagonal lines delimit the accuracy of the predictions. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
In the text |
![]() |
Fig. B.5 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
In the text |
![]() |
Fig. B.6 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
In the text |
![]() |
Fig. B.7 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
In the text |
![]() |
Fig. B.8 Principal components of S*-complex asteroids. The predicted olivine fraction decreases in the direction of increasing space weathering, which is indicated by the red arrow. We show the model with pure olivine, pure orthopyroxene, and their binary mixtures. |
In the text |
![]() |
Fig. B.9 Scatter plots of the true and predicted modal compositions of the test data. We show the model with plagioclase modal abundance information. |
In the text |
![]() |
Fig. B.10 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with plagioclase modal abundance information. |
In the text |
![]() |
Fig. B.11 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with plagioclase modal abundance information. |
In the text |
![]() |
Fig. B.12 Quantiles of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with plagioclase modal abundance information. |
In the text |
![]() |
Fig. B.13 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. We show the model with plagioclase modal abundance information. |
In the text |
![]() |
Fig. B.14 Scatter plots of the true and predicted modal compositions of the test data. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
![]() |
Fig. B.15 Scatter plots of the true and predicted chemical compositions of olivine in the test data. Left: iron content. Right: magnesium content. The colours of the points correspond to the actual modal abundance of olivine in the samples. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
![]() |
Fig. B.16 Scatter plots of the true and predicted chemical compositions of orthopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of orthopyroxene in the samples. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
![]() |
Fig. B.17 Scatter plots of the true and predicted chemical compositions of clinopyroxene in the test data. Left: iron content. Middle: magnesium content. Right: calcium content. The colours of the points correspond to the actual modal abundance of clinopyroxene in the samples. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
![]() |
Fig. B.18 Scatter plots of the true and predicted chemical compositions of plagioclase in the test data. Left: calcium content. Middle: sodium content. Right: potassium content. The colours of the points correspond to the actual modal abundance of plagioclase in the samples. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
![]() |
Fig. B.19 Quantites of the absolute errors between the actual and predicted values. The dashed black lines indicate the 10 pp and 20 pp errors. We show the model with complete orthopyroxene and plagioclase information. |
In the text |
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.