[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Orekit Users] Test failure

To: orekit-users@orekit.org
Subject: Re: [Orekit Users] Test failure
From: MAISONOBE Luc <luc.maisonobe@c-s.fr>
Date: Mon, 04 Jun 2018 15:54:20 +0200
In-reply-to: <CAD-QAWVA3kYdvUuRvqu=aWyYrLohBfWaxPLG1miZT=UCUUh0_g@mail.gmail.com>
User-agent: Internet Messaging Program (IMP) H5 (6.2.3)

Hi Walter,

Walter Grossman <w.grossman@ieee.org> a écrit :

I am a newbie to OREkit.  I ran tests and go a "near-miss" failure.  I
resolved by relaxing precision.  How do I know if I am OK?



OrbitDeterminationTest.testW3B:384 expected:<0.687998> but
was:<0.6880143632396981>

found this line:  Assert.assertEquals(0.687998, covariances.getEntry(6, 6),
1.0e-5);


Is the problem that acceptance criterion too tight?  Why?


The test tolerance is intentionally extremely small, see below
for the rationale for this stringent choice. The test should however
succeed with the current settings. Could you tell us which version
of Orekit you use (development version from the git repository, released
version?) and with which Java environment (OS, JVM version, processor)?

Some tests in Orekit are built in several stages. First the test is
created without any thresholds and only output its results, which are
compared by the developer with whatever is available to get confidence
on the results. This may be run of other reference programs if available,
this may be another independent implementation using different algorithms,
or this may be sensitivity analysis with the program under test itself. This

validation phase may be quite long. Once developers are convinced theimplementation

is good, they run the test one last time and register its output as the
reference values with a stringent threshold in order to transform the nature
of the test into a non-regression test. The threshold is therefore not an
indication that the results are very good, it is only a way for us to ensure
that any change in the code that affects this part will break the test and
will enforce developers to look again at this code and to decide what to do.
They can decide that the changes that broke the test are valid and that they

only changed the results in an acceptable way (sometimes to improvethe results),so they change either the reference value or the threshold. They candecide thatthe changes in fact triggered something unexpected and that theyshould improve

their new code so the test pass again without changing it. So as a summary

thresholds for non-regression tests are small to act as a fuse andpeople notice

when it blows up and can take decisions.

best regards,
Luc

Follow-Ups:
- Re: [Orekit Users] Test failure
  - From: Walter Grossman <w.grossman@ieee.org>
- Re: [Orekit Users] Test failure
  - From: Walter Grossman <w.grossman@ieee.org>

References:
- [Orekit Users] Test failure
  - From: Walter Grossman <w.grossman@ieee.org>

Prev by Date: [Orekit Users] Test failure
Next by Date: Re: [Orekit Users] Test failure
Previous by thread: [Orekit Users] Test failure
Next by thread: Re: [Orekit Users] Test failure
Index(es):
- Date
- Thread