A&A 401, 1203-1213 (2003)
DOI: 10.1051/0004-6361:20030195
P. G. Willemsen1 - C. A. L Bailer-Jones2 - T. A. Kaempf1 - K. S. de Boer1
1 - Sternwarte der Universität Bonn, Auf dem Hügel 71, 53121
Bonn, Germany
2 - Max-Planck-Institut für Astronomie, Königstuhl 17, 69117 Heidelberg, Germany
Received 4 November 2002 / Accepted 5 February 2003
Abstract
We have assessed how well stellar parameters (
,
and [M/H]) can be
retrieved from low-resolution dispersed images to be obtained by
the DIVA satellite. Although DIVA is primarily an all-sky astrometric mission, it
will also obtain spectrophotometric information for about 13 million stars
(operational limiting magnitude
mag). Constructional studies foresee a grating
system yielding a dispersion of
200 nm/mm on the focal plane (first
spectral order). For astrometric reasons there will be no cross dispersion
which results in the overlapping of the first to third diffraction orders. The
one-dimensional, position related intensity function is called a DISPI ( DISPersed Intensity).
We simulated DISPIs from synthetic spectra taken from Lejeune et al. (1997) and Lejeune et al. (1998) but for a limited range of metallicites, i.e. our results are for [M/H] in the range -0.3 to 1 dex. We show that there is no need to deconvolve these low resolution signals in order to obtain basic stellar
parameters. Using neural network methods and by including simulated data of
DIVA's UV telescope, we can determine
to an average accuracy of
about 2% for DISPIs from stars with 2000 K
20 000 K
and visual magnitudes of V=13 mag (end of mission data).
can be
determined for all temperatures with an accuracy better than 0.25 dex for
magnitudes brighter than V=12 mag. For low temperature stars with 2000 K
5000 K and for metallicities in the range -0.3 to +1 dex a determination of [M/H] is possible (to better than 0.2 dex) for these magnitudes. For higher temperatures, the metallicity signatures are exceedingly weak at DISPI resolutions so that the determination of [M/H] is there not possible.
Additionally we examined the effects of extinction E(B-V) on DISPIs and found
that it can be determined to better than 0.07 mag for magnitudes brighter than
V=14 mag if the UV information is included.
Key words: astrometry - stars: fundamental parameters - methods: data analysis - techniques: spectroscopic
DIVA will perform an all-sky survey with a limiting visual magnitude of
mag. Note that every observed star will be measured about 120 times in
the course of the mission. The stated magnitude limits refer to the combined
images of all single measurements. The measurements include the precise
determination of positions, trigonometric parallaxes, proper motions, colours
and magnitudes. For about 13 million stars, spectrophotometric data will also be
obtained down to a visual magnitude of
.
An additional
UV telescope will perform photometry in two spectral ranges adjacent to the
Balmer jump.
![]() |
Figure 1: Wavelength versus position on the spectroscopic CCD (SC) for the first, second and third order. The zeroth order which is made up of undispersed (white) light would lie at about pixel position 122, but is not shown. |
Open with DEXTER |
The DIVA survey represents a large scale and deep astrometric and photometric survey of the local part in our Galaxy. The importance of these data to modern astrophysics will be significant, with applications ranging from stellar structure and evolution to cosmological aspects. Examples are a precise determination of the luminosity function in the solar neighbourhood, a better understanding of the structure and formation of our galaxy, the estimation of the amount of dark matter as well as a better calibration of the cosmological distance ladder (Röser 1999).
After the mission the photometric and spectrophotometric images will be used to
obtain the brightness, the colour and the DISPIs for the stars. The DISPIs will allow to derive the astrophysically relevant parameters
,
,
[M/H] and E(B-V).
Especially the derivation of
is of importance for objects too distant to
result in an accurate parallax. With these objects in mind, we have carried out
the present study. We will demonstrate that the essential parameters of the
stars can be retrieved with a reasonable level of accuracy from the DISPIs alone. We will show that astrophysical parameters can be well derived down to the survey limit, perhaps even adequately for stars 1 to 2 mag fainter. There are good scientific arguments to reach fainter in selected fields, see e.g. Salim et al. (2002).
The maximum transmission of the first order is at about 750 nm (see Fig. 2), while that of the second and third orders are at about 380 and 250 nm respectively (see also Scholz 1998; Scholz 2000).
Given the nature of the DISPIs, a classical spectrophotometric analysis - like line and continuum fitting - to derive astrophysical parameters is cumbersome. We will show that by training Artificial Neural Networks (ANNs) on simulated DISPIs, we can readily access this information without any further pre-processing of the signal. Using DISPIs from calibration stars, i.e. stars with known physical and apparent properties, we would initially build up a standard set of DISPI data. The automated classification technique as developed will then use this library. The calibration could be iteratively improved using DISPIs and their parametrization results obtained during the mission. Note that in these simulations, we did not use absolute fluxes as they will be available from the mission (see below). In this work, only the shape of a DISPI and its line features were used for tests to determine basic stellar parameters.
![]() |
Figure 2: Transmission curves for the first, second and third grating orders, plus the quantum efficiency. The order's curves have been scaled down by a factor of 0.6 to better represent data from the real grating. |
Open with DEXTER |
V[mag] | S/N |
![]() |
|
8 | 122 |
9 | 77 |
10 | 47 |
11 | 28 |
12 | 16 |
![]() |
|
8 | 91 |
9 | 56 |
10 | 34 |
11 | 20 |
12 | 11 |
![]() |
|
8 | 82 |
9 | 50 |
10 | 30 |
11 | 17 |
12 | 9 |
![]() |
|
8 | 95 |
9 | 59 |
10 | 36 |
11 | 21 |
12 | 12 |
![]() |
|
8 | 118 |
9 | 74 |
10 | 46 |
11 | 28 |
12 | 16 |
The signal-to-noise ratio of a single DISPI as measured from the 10 central pixels around effective pixel position 60 is shown for different visual magnitudes and temperatures in Table 1.
![]() |
Figure 3:
Input spectrum (top) and simulated DIVA DISPI (bottom) for a star with
![]() ![]() |
Open with DEXTER |
![]() |
Figure 4:
Input spectrum (top) and simulated DIVA DISPI (bottom) for a star with
![]() ![]() ![]() ![]() ![]() |
Open with DEXTER |
A network consists of an input layer, one or two hidden layers and an output
layer. Each layer is made up of several nodes. All the nodes in one layer are
connected to all the nodes in the preceding and/or following layers. These
connections have adaptable "weights", so that each node performs a weighted sum
of all its inputs and passes this sum through a nonlinear transfer function.
That weighted sum is then passed on to the next layer. Before the network can be
used for parametrisation, it needs to be trained, meaning the weights have to be set to
their appropriate values to perform the desired mapping. In this process,
DISPIs together with known stellar parameters as target values are presented
to the network. From these data, the optimum weights are determined by
iteratively adjusting the weights between the layers to minimize an output
error, i.e. the discrepancy between the targets and the network outputs. This
is performed by a multidimensional numerical minimization, in this case with
the conjugate gradients method. When this minimization converges, the weights
are fixed and the network can be used in its "application" phase: now, only the
DISPI input flux vector is presented and the network's outputs produce the
stellar parameters of these DISPIs. Since we used only the central 51 effective pixels of the DISPIs (range 30 to 80, see Figs. 3 and 4), the input layer of
the network was always made up of the same number of nodes, i.e. 51. We found that the
performance was best when using two hidden layers, each containing 7 nodes.
More nodes did not improve the result significantly but increased the training
time considerably. With four output parameters this network then contains
weights (plus 18 bias weights).
Since we wanted to classify DISPIs solely based on their shapes, the absolute flux
information was removed by area-normalizing each DISPI, i.e. each flux bin
of a given DISPI was divided by the total number of counts in that DISPI.
Given the non-uniform distribution of the training data over
,
we classified DISPIs in terms of
log
instead of
.
Note that, in our tests, we have not included distance information as it eventually might be done using DIVA parallaxes, since the present goal was to test the retrieval of stellar parameters from DISPIs only.
The parametrization errors given below are the average (over some set of DISPIs) errors for each parameter, i.e.
As input spectra we used synthetic spectra from Lejeune et al. (1997, 1998). In total there were about 5600 spectra covering a parameter grid
with 68 values for
between 2000 K and 50 000 K (in steps of 200 K for the
low temperature star, and 2500 K for the high temperature stars), 19 possible values for
ranging from
in steps of approx. 0.1 to 0.3 dex
and 13 values for [M/H] with
[M/H]
in steps of 0.5 and 0.1
dex. Note that in our tests there were no input data for metallicities in the range from -2.5 to -0.3 dex
.
The obvious advantages of using synthetic spectra are the complete wavelength range from 200 to 1200 nm and the large number of spectra over a large parameter space. We are currently constructing a library of (previously published) real stellar spectra. However, since it combines spectra from many different available catalogues, there is a considerable heterogeneity among these data. Moreover, few stars have been observed with the desired wavelength range from the UV to the IR.
Interstellar extinction was modelled by using a synthetic extinction curve for R=3.1 given as
versus
.
We used the extinction curve from Fitzpatrick (1999),
simulating 7 different extinction values in steps of 0.15 and 0.2 in the range
E(B-V)
mag. Note that the zeroth order was
omitted in this and all other simulations, so that we only worked with dispersed
images made up of the first to third spectral order. Since the data were
area-normalized before passing them through the neural network, the magnitude
information of the zeroth order is lost anyway. For the
simulation of the UV-telescope (see below), the same extinction curve was applied.
This procedure was done for five different visual magnitudes in the range from
mag.
Noise was added to these two-dimensional intensity distributions by passing
them through another software tool developed by Ralf Scholz.
Here, a mean sky backround of
sky = 0.04 e-/(pix s) and a dark current of
dark = 2 e-/(pix s) were added with additional source and sky Poisson
noise. The CCD's read out noise was 2 e-/eff.pix.
![]() |
Figure 5: A dispersed image for an M type star, V = 10 mag (with noise) as generated by the simulation software from Scholz (1998). Note the contribution of the zeroth order seen as a single intensity "blob" in the upper part of the image. The first, second and third spectral order are all overlapping (lower intensity "stretch") due to the grating. The intensity stripes to the side of the dispersed image are due to diffraction at the telescope's aperture. |
Open with DEXTER |
The size of the SC window to be cut from the on-board data stream around the DISPI is crucial as it determines the data rate which is in turn related to the satellite's overall performance: a smaller window permits a larger number of SC windows (objects) to be transmitted. This would, for example, permit a fainter magnitude limit. The optimum window size, i.e. the window around a dispersed image with the highest amount of important and lowest amount of redundant information, was investigated in earlier studies. Concerning the window size in the cross dispersion direction it was found that the innermost 7 pixel are sufficient (Hilker et al. 2001) in terms of highest S/N. However, due to the satellite's intrinsic attitude uncertainty it is required that the smallest acceptable window size in the scanning direction be 12 pixel (see Bastian & Schilbach 2001). For our studies we therefore summed up the TDI-rows over the innermost 13 rows (6 pixels in each direction about the central row). Future work will use a profile fit to obtain the stellar intensity.
The optimum size in the dispersion direction was evaluated by S/N studies and
the (spectral) information content. This amount of information was measured by
the ability of Neural Networks to determine the stellar parameters
,
and [M/H] for different ranges of DISPIs. It was found
(Willemsen et al. 2001) that these parameters can be adequately retrieved from
approximately 45 effective pixels around the maximum intensity in the DISPIs (which is at about effective pixel 60). However, since these earlier studies
included only DISPIs with
4000 K and since the overall
intensity distribution moves to smaller effective pixel values for lower
temperatures, we chose the range from 30 to 80 effective pixels in this work.
This should also be appropriate for very red objects like L and M dwarfs with
1200-4000 K.
For further processing, the simulated sky was subtracted from the dispersed image by evaluating the background level from a single column in scanning direction next to the dispersed image.
The UV imaging telescope will make use of the same type of CCD's as
the main instrument. The UV magnitudes in the two different passbands next to the
Balmer jump were calculated from the same synthetic spectra as described above,
simply by integrating the flux in the ranges from 310 to 360 nm and 380 to 410
nm. Of course, the true filters will not have exactly square transmission curves,
but this approximation is sufficient for a first analysis of the influence of the UV channel.
The two UV flux values
were fed into the network in three different ways. First, we calculated the
of the flux ratio, i.e.
)
(note that the
- function is not undefined for negative values, in contrast to the
log-function. Negative values might occur due to noise for very low temperature
stars with almost no flux in the UV). This ratio is designed to be sensitive to the Balmer jump thus yielding additional information about gravity
and temperature. Second, we summed up the intensity in a DISPI in the
range 70 to 80 effective pixel (
)
and calculated the
ratios (
)
and
(
). Since the first order's contribution in the
selected effective pixel range corresponds to a wavelength range from about 550
to 600 nm (see Fig. 1), these ratios should be a good measure of
extinction due to the long "lever" ranging from the UV to the visual/red part
of the spectrum.
![]() |
Figure 6:
One-hundred added frames each of the "single'' visual magnitude given on the x-axis yield the same S/N as a star which is ![]() |
Open with DEXTER |
We found that the separation into such small temperature regions yielded
improved parametrization results. This is understandable as the classification
results for the stellar parameters, especially
and [M/H], depend
upon the presence of spectral features in a DISPI which are also closely
related to the temperature of a star. This effect was also found in
Weaver & Torres-Dodgen (1997), using spectra in the near-infrared and classifying
them in terms of MK stellar types and luminosity classes. For our ANN work we chose simple temperature ranges, also aiming at database subsamples of similar size. Though some of the intervals roughly correspond
to the temperatures found in certain MK classes which are characterized by
certain line-ratios, i.e. common physical characteristics, the chosen distinction was
motivated to allow for a reasonable training time for the networks. Another
reason was to see what can be learned from DISPIs in different temperature
regimes in principle. The mid-temperature samples
(M-samples) were defined for the range in which the
Balmer jump and the H-lines (e.g. H
)
change their meaning as
indicators for temperature and surface gravity (see e.g. Napiwotzki et al. 1993).
Under real conditions one would have to employ a broad classifier to first separate
DISPIs into smaller (possibly overlapping) temperature ranges. This could be
also based on neural networks, but also on other methods, such as minimum
distance methods. Each temperature sample was finally divided into two
disjoint parts, the training- and the application data. This means that
our classification results (see Sect. 6) are from DISPIs in the
gaps of our training grid.
The generalization performance of a network or its ability to classify
previously unseen data is influenced by three factors: The size of
the training set (and how representative it is), the architecture of
the network and the physical complexity of the specific problem, which also
includes the presence of noise. Though there are distribution-free, worst-case
formulae for estimating the minimum size of the training set (based on
the so called VC dimension, see also Haykin 1999), these are often
of little value in practical problems. As a rule of thumb, it is
sometimes stated (Duda et al. 2000) that there should be (W
10)
different training samples in the training set, W denoting the total
number of free parameters (i.e. weights) in the network. In our
network without extinction there were 452 weights. Thus, in some
cases, there were fewer training samples than free
parameters. However, we found good generalization performance (see
Sect. 6 and results therein). This may be due both to (1)
the "similarity" of the DISPIs in a specific
range, giving
rise to a rather smooth (well-behaved) input-output function to be
approximated, and (2) redundancy in the input space. Both give rise to
a smaller number of effective free parameters in the network.
We also tested whether there are significant differences between
determining each parameter separately in different networks and determining all parameters
simultaneously. In the first case each network would have only one ouput node while in the latter case the network had multiple outputs. If the parameters (
,
etc.) were independent of each other, one could train a
network for each parameter separately. However, we know that the
stellar parameters influence a stellar energy distribution
simultaneously at least for certain parameter ranges, (e.g. hot stars
show metal lines less clearly than cool stars).
Also, for specific spectral features, changes in the chemical composition [M/H] can sometimes mimic gravity effects (see for example Gray 1992). Varying
extinction can cause changes in the slope of a stellar energy distribution
which are similar to those resulting from a different temperature.
sample | temperature range | without/with ext. |
L1 | 2000 K ![]() ![]() |
330/2300 |
L2 | 4000 K ![]() ![]() |
570/3980 |
L3 | 6000 K ![]() ![]() |
500/3500 |
M1 | 8000 K ![]() ![]() |
400/2800 |
M2 | 10 000 K ![]() ![]() |
180/1200 |
H1 | 12 000 K ![]() ![]() |
390/2700 |
H2 | 20 000 K ![]() ![]() |
450/3100 |
Recently, Snider et al. (2001) determined stellar parameters for low-metallicity stars from stellar spectra (wavelength range from 3800 to 4500 Å). They reported better classification results when training networks on each parameter separately. We tested several network topologies with the number of output nodes ranging from 1 to 3 (in case of extinction from 1 to 4) in different combinations of the parameters. It was found that single output networks did not improve the results. We therefore classified all parameters simultaneously.
In this section we report the results from our parameter determination. In
order to appreciate the results one has to realize that the effects of
,
and [M/H] on a spectrum differ significantly in magnitude. The
strongest signal is that of temperature (the Planck function for black bodies).
The much weaker signal is that of
,
present in the width of spectral lines
but only weakly in the continuum. Metallicity is a very
weak signal visible in individual spectral lines or perhaps in broader opacity structures (such as G-band or molecular bands). However, in a DISPI essentially all line structure is washed out. These general aspects can also be found in Gray (1992). For more see Sect. 7.
![]() |
Figure 7:
The average classification uncertainty for
![]() ![]() ![]() |
Open with DEXTER |
The errors given are the average errors as in Eq. (1).
was classified in terms of log
but for better understanding, the resulting errors were transformed to give the fractional error in
.
These fractional errors are stated throughout this paper.
The networks had the same overall topology for all tests, though the number of
inputs naturally was three larger for the case with UV information.
We did not tune the networks to the very best
performance possible. Tests showed that results in individual
-ranges
could be improved through adjustments of several free parameters in the
networks (e.g. the number of hidden nodes, number of iterations etc.). In some
cases, we had to adjust some of these parameters, for example in the cases that
the networks did not converge properly. However, such individual tests are very
time-consuming. For the purpose of this paper, we used one topology.
Figure 7 shows the classification results for the stellar
parameters
,
and [M/H] in case of no extinction, while
Fig. 8 presents the results for simulations with extinction
included. For each plot, the left column shows the results when UV data were
included, while the right column refers to the cases without additional UVdata. The upper magnitude scale shows the visual magnitudes for a single
detection, whereas the lower magnitude scale is relevant for a sum of 100
DISPIs, representative of end-of-mission quality data (see Fig. 6).
As a comparison, we tested the performances of random, i.e. untrained, networks. These are presented in Table 3. If trained networks give parameters with uncertainites larger than the values listed here, those parameter values are not meaningful. The corresponding error for E(B-V) was about 0.25 mag for all temperature ranges and magnitudes.
Param. | AL1* | AL2 | AL3 | AM1 | AM2 | AH1 | AH2 |
![]() |
0.15 | 0.1 | 0.07 | 0.06 | 0.05 | 0.15 | 0.17 |
![]() |
1.9 | 1.4 | 1.3 | 1.0 | 0.8 | 0.8 | 0.6 |
1.2 | 1.8 | 1.8 | - | - | - | - |
A variation in [M/H] changes a DISPI only very subtly for higher temperature stars due to the simple fact that metallicity features are very weak in the energy distribution of hotter stars.
As can be seen from Figs. 7 and 8 our results show
interesting trends related with the temperature of the stars. In general,
temperature is the dominating factor for the shape of and even the details in
a spectral energy distribution. Overall,
can be retrieved to very
acceptable accuracy, even without additional UV information: the classification
is better than 10% even for very faint stars (V=14 mag, no extinction
included: through this section the V magnitude refers to end-of-mission quality data).
Including UV fluxes improves the
parameter results most noticeably for
hot stars (
> 9000 K) and here especially for the fainter ones. For
example, in case of no extinction, the error for DISPIs with V=14 mag in the
range 12 000-20 000 K (H1 sample) drops from 5% to about
3% when the UV data are included. When extinction is included, the temperature error drops from 9% to about 4% for this temperature range at this
magnitude. Though the information in the short
wavelength range is already available in the DISPI, the higher sensitivity of the UV telescope obviously contributes essential information.
![]() |
Figure 8: The same as in Fig. 7 but with extinction included. The left column shows the results for networks which were trained with UV data while the right column shows the classification results for tests without UV data. |
Open with DEXTER |
Concerning
we see from the figures that the classification performance
can be improved when additional UV information is included. For example, in case of no extinction and for temperatures in the range 8000 K
< 10 000 K (M1) and visual magnitude V=14, the error in
reduces from about 0.4 to 0.15 dex with additional UV information. At this magnitude but for temperatures in the range 6000 K
< 8000 K (L3), the error reduces considerably from about 0.85 dex to 0.15 dex. These results emphasize the benefit of UVtelescope data in the classification process. In general, the
results are poorer for temperatures in the range
4000 K
< 6000 K (L2 sample) when compared to other
ranges. This is understandable since in this temperature range hardly
any atomic/molecular signatures sensitive to the density (and thus
)
of the
gases in a stellar atmosphere are present. In contrast, for the very
low temperature ranges the numerous mostly molecular spectral
features provide the information about the density of the atmospheric
gases. For the higher temperatures the Balmer jump provides still
gravity information.
Metallicity is the most difficult parameter to derive from DISPIs. This was to
be expected: in data with such low spectral resolution all details of
spectral line information, and thus of metal abundances, are lost. In Figs. 7 and 8 only the classification results for
metallicities in the range -0.3 dex to 1.0 dex are shown. This is due to a lack of input data in the metallicity range from -2.5 dex to -0.3 dex (see above). Very low metallicities ([M/H]
-2.5 dex) make only a very small imprint on a DISPI such that the classification almost fails completely at
these resolutions, except for very bright, cool stars (
K).
The results for metallicities in the analysed metallicity range are reasonably good even for
low magnitudes (better than about 0.3 dex for
mag, no extinction).
When extinction is included, the metallicity performance declines considerably, except for the very cool objects (
< 4000 K, L1 sample) the
metallicity of which can be determined to better than 0.3 dex for all simulated
magnitudes.
For DISPIs in the temperature range 6000 K
< 7000 K (L3) we see that the error can be reduced from about 0.45 dex to 0.3 dex (no extinction) and from about 0.8 dex to 0.6 dex (with extinction) when the UV information is included.
Extinction is not easy to be retrieved solely from the shape of a
DISPI, since its overall effect mimics, to some extent, that of
temperature. However, as can be seen from the figures, the
determination of extinction improves when the UV information is
included. This was to be expected since the UV data gives the
strength of the Balmer jump, giving
information unblurred by
extinction.
One may ask whether the accuracy of the parameters derived would be
better when working with the single orders of the DIVA dispersed
image. We have tested this for a limited number of parameter
combinations. We generally found that for brighter objects ( mag) the accuracy is improved when using the single orders and then only by a small amount (in the range 30 to 80 effective pixels). This might
be surprising at first but it is understandable since for fainter
objects, using the signal in a single order, deteriorates the S/N, so we find a poorer result from the parameter extraction routine.
Apart from these results, to obtain the separate order's signals would
require a deconconvolution of the DISPI which in itself leads to
increased uncertainty in the intensities. Moreover, a deconvolution
requires knowledge of the nature of the objects so there is no
guarantee that this process will yield unique results. Since
DIVA will not have separate order images we have not pursued this
aspect further.
Several neural network approaches to stellar parametrization have been reported in astronomy. A comparison with those would have to address at least two aspects, such as nature of the type of network and characteristics of the data used. The networks may indeed be very different in structure such as learning and regularization technique and especially topology (compare e.g. Weaver & Torres-Dodgen 1995; Snider et al. 2001 and this work). But because in all these cases the data were of very different nature (e.g. different resolutions and number of input flux bins), a general comparison of the results is not really possible. A few remarks can be made nevertheless.
Projects to obtain MK classifications normally use data in the wavelength range
from 3800 to 5200 Å with high spectral resolution of about 2 to 3 Å (see
e.g. Bailer-Jones et al. 1998).
Weaver & Torres-Dodgen (1995) and Weaver & Torres-Dodgen (1997) classified stars in terms of MK classes in the visual to near-infrared wavelength range 5800-8900 Å with a resolution of 15 Å. However, even the resolution of these spectra is still much better (by a factor of approx. three) than the "best" one of DISPIs which is about 40 Å in the low efficiency third order.
We would expect such resolutions to give better precision for spectral type or
as well as for line sensitive parameters (
and [M/H]) than with DISPIs on account of the higher resolution. Snider et al. (2001) recently classified spectra having 1 to 2 Å resolution. They determined
,
and [M/H] of low metallicity stars to an accuracy of about 150 K in
in the range 4250 K <
< 6500 K, 0.30 dex in
over the
range
dex and 0.20 dex in [M/H] for
[M/H]
dex.
From our results, we find for this temperature range (L2 and L3 sample) a classification precision in
of better than 5% for
mag (no extinction) without UV information and about 2% when UV data are included. Only for brighter stars (
mag) do we find that
can be determined from DISPIs to better than 0.3 dex for temperatures in the range 4000 K
< 6000 K but only when UV data are included. Concerning metallicity, our results are comparable (better than 0.2 dex for visual magnitudes
mag, no extinction, UV data included). Clearly, this is because we have only used metallicities in the range from -0.3 dex to +1 dex (see above).
A comparison with the neural network approach using synthetic data for a test of possible GAIA photometric systems (Bailer-Jones 2000) may be of relevance. Bailer-Jones also used input data with various moderate resolutions, some of them similar to those of the spectral orders in DISPIs. The effects of the quantum efficiency (QE) of the detectors was not included, so that in his tests the information provided in the vicinity of (and shortward of) the Balmer jump could be utilized in full. After shifting our results to the fainter magnitudes reachable with GAIAs larger telescope, while considering the other differences between Bailer-Jones' and our investigation as well as the differences between the DIVA and GAIA optics and data format, one must conclude that these ANN analyses work to similar satisfaction.
Little work has been done so far concerning the automated determination of
interstellar extinction. Weaver & Torres-Dodgen (1995) tested the effect of extinction and found
that E(B-V) could be determined from spectra of A type stars with an accuracy of
0.05 mag in the range of E(B-V) of 0-1.5 mag. Gulati et al. (1997) used IUE
low-dispersion spectra (wavelength range 1153-3201 Å, spectral resolution
6 Å) from O and B stars. Applying reddening to their spectra in the E(B-V) range of 0.05-0.95 mag in steps of 0.05 mag, they were able to retrieve
extinction with high accuracy to about 0.08 mag, clearly because of the
presence of the 2200 Å bump in the input data. From Fig. 8 we see that extinction can be determined from DISPIs to better than 0.08 mag for visual magnitudes
mag for all temperatures in case that UV data are included.
Concerning the DIVA satellite, Elsner et al. (1999) found interesting results with Minimum Distance Methods. However, the optical concept of DIVA has changed considerably since then. Thus, also here a comparison of the quality may lead to a skewed judgement and we therefore refrain from going into detail.
A final remark deals with the effect of selecting our training sample randomly from the database. A random selection may accidentally lead to larger regions of parameter space without training data. Since our object ("application") sample is the complement of the training set, objects falling in those gaps clearly are classified worse than objects near trained points. Malyuto (2002) demonstrated such effects when using Minimum Distance Methods. The average errors in our results are influenced by such effects, but this was not investigated.
In considering the accuracies obtained with our ANN approach we have to note that real stellar spectra show a much more complex behaviour than synthetic ones. For example, even the more realistic approach of non-LTE models (Hauschildt et al. 1999) cannot properly describe the true behaviour of elements in a stellar atmosphere (see e.g. Gray 1992, Chap. 13). Moreover, good colour calibrations in accord with observed data are still difficult to obtain, as described e.g. in Westera et al. (2002). It is difficult to estimate how to properly weigh such intrinsic inconsistencies with respect to the final performance of the DIVA satellite. The effect of such cosmic scatter probably is that the final performance of DIVA might be less accurate than our results from these ideal synthetic spectra, or, that the accuracy curves of Figs. 7 and 8 are to be shifted somewhat to brighter magnitudes.
We argued that additional UV data can improve the parameter results considerably in the classification process. The spectral library of our present simulations does not include, however, changes in, e.g., alpha-process elements which can show up in changes also in the range of DIVA's UV channels (e.g. the CN violet system in the range from 385 to 422 nm).
The conclusions of the discussion are
1) The ANN method is well suited to obtain astrophysical parameters
from DIVA DISPIs.
2) The accuracy obtained is related with the strength of the signal of each
parameter as present in a DISPI:
is best, followed by
,
and then E(B-V) and [M/H].
3) The accuracy is clearly related with temperature: toward higher
temperature the signal of both
and [M/H] decreases considerably.
4) Our results were obtained with synthetic spectra. Real stars will not
all behave like text book objects and the classification coming from real data
will necessarily be less good, albeit always to an unknown amount per star.
5) The classification quality is absolutely adequate to be able to select
objects of desired characteristics from the final DIVA database
to do statistical analyses and/or
for efficient post mission type-related investigations.
Note added in proofs: The German Aerospace Center (DLR) informed the DIVA project mid Feb. 2003, that the DIVA mission had to be cancelled due to financial problems. Nonetheless, many of the ideas and techniques presented within this article are of relevance to other future (spectro)photometric projects, such as the GAIA astrometry mission (Perryman et al. 2001) of the European Space Agency (ESA).
Acknowledgements
This project is carried out in preparation for the DIVA mission and we thank the DLR for financial support (Projectnr. 50QD0103). We thank Martin Altmann, Uli Bastian, Michael Hilker, Thibault Lejeune, Valeri Malyuto and Klaus Reif for helpful discussions. We also thank Oliver Cordes, Ole Marggraf and Sven Helmer for assistance with the computer system level support.