doFORC Help

doFORC:

Is a portable (standalone) application working on various operating systems, is made using only free libraries, and it is made freely available to the scientific community.

  • the user-easy to use graphical user interface (GUI) allows users to import the input data, set the fitting parameters, to graphical represent (2D, 3D, and projection) both input and output data, and to export the graphs to image files
  • allows the choice of one of the four implemented nonparametric regression procedures: LOESS and three modified Shepard methods further modified for noisy data. The procedures allow great flexibility because no assumptions about the parametric form of the regression surface are needed. Thus the users can try different methods using their data and select one (or more) that is best suited to their needs.
  • allows the use of different kernel functions
  • input data may have various formats, including the PMC MicroMag format for which the drift correction can be performed
  • allows the use of user weights associated with each data point, weights that indicates the precision of the information contained in the associated observation
  • removes points that are closer than some tolerance (duplicate or nearby points) from the input data
  • the input data can be cropped to ignore certain parts of the input points
  • allows the use of a scale factor for input data to change the shape of the neighborhood, considering the points lying on an ellipse centered at the given point to be equidistant from the given point. This feature is useful when the variables have different scales.
  • allows the standardization of the input data to change the shape of the neighborhood. This feature is useful when the variables have significantly different scales. The standardization is accomplished using Winsorized mean and standard deviation of each variable. Winsorized values are robust scale estimators in that extreme values of a variable are discarded (the smallest and largest 5% of the data) before estimating the data scaling.
  • robust smoothing allows to minimize the influence of outlying data (outliers) allowing the user to choose different robustness weight functions
  • output data can be provided at the user’s choice in:
    1. input points
    2. a regular grid in the $\left( {H_{{\mathrm{applied}}} \ge H_{{\mathrm{reversal}}},\; H_{{\mathrm{reversal}}}} \right)$ half-plane
    3. a regular grid in the $\left( {H_{{\rm{coercive}}} \ge 0,\; H_{{\rm{interaction}}}} \right)$ half-plane
    4. a general rectangular regular grid
    5. user points
  • output consist in:
    1. predicted smoothed values in the points from the input file, along with other information (residuals, smoothed residuals, histogram of the residuals, influences, confidence intervals for fit)
    2. requested derivatives in the output points
    3. requested statistics from (RSS, RSSm, RSE, DF1, DF2, DF3, $\delta _1$, $\delta _2$, $\rho$, AICC, AICC1, GCV)
  • performs statistical inference (deduces properties of data sets from a set of observations and hypotheses) provided that the error distribution satisfies some basic assumptions
  • in order to perform the diagnostics and goodness of fit doFORC compute the residuals to characterize the difference between the actual observed value and the predicted value, generalized cross-validation GCV to measure the predictive performance of the model, two information criteria AICC and AICC1 to quantify the information that is lost by using an approximate model on the available data, and three degrees of freedom DF1, DF2, DF3 to compare different amounts of smoothing being performed by different smoothing methods.
  • based on the above criteria doFORC can perform automatic smoothing parameter selection. Although the default method for selecting the smoothing parameter value is often satisfactory, it is often a good practice to examine how the fit varies with the smoothing parameter. In some cases, fits with different smoothing parameters might reveal important features of the data that cannot be discerned by looking at a fit with just a single "best" smoothing parameter.
  • there are several ways in which user ca control the sequence of fitting parameters (number of neighbors $nn$) examined:
    1. specifying a list of $nn$ values
      • if no criterion is specified then a separate fit is provided for each $nn$ value
      • if a criterion is specified then all values specified in $nn$ list are examined and the value that minimizes the specified criterion is selected
    2. specifying a range $\left( {lower,upper} \right)$ of $nn$ values examined, for which the golden section search method is used to find a local minimum of the specified criterion in the given range
  • provides test problems that consist of sets of data obtained using various known functions, over which a known normal (Gaussian) noise and a certain percentage of outliers are added. These test problems allow users to see the limits of each method, to observe any numerical artifacts. The test problems can also be used to test, asses the accuracy, and validate other FORC type (or two dimensional smoothing) software tools that exist in the scientific literature.

Main doFORC Interface

The main doFORC interface allows to:

  1. Configure the parameters for computation.
  2. Process the input data.
  3. [ Optional ] Graphically represent the input and output data, as well as statistics.
  4. [ Optional ] Export graphs as bmp, eps, jpg, pdf, png, svg or tiff files.


Configuration Interfaces

The configuration interfaces provide tools to:

  • set doFORC parameters for data processing by reading (and possibly modifying) an existing configuration file
  • set new parameters for data processing; optionally, the parameters can be saved in a configuration file
  • verify the correctness of the user parameters

For an existing configuration file:

  • blank lines and comments beginning with an exclamation point (!) are ignored
  • lines that do not comply with the 'keyword = value' format are ignored
  • line terminator (the character or sequence of characters that marks the end of a line of text) can be CR (usually Macintosh files), LF (usually Unix files), or CRLF (usually Windows files). All lines in a given file must have the same terminator.
  • the configuration files can also be created and/or modified using a plain text editor

The configuration file provided as an example contains extensive documentation in the form of comments.


Lite Configuration Interface

  • contains only the mandatory parameters
  • the optional parameters are set to the default values
  • lite configuration interface can be used to:
    • read a configuration file, reading only the mandatory parameters; any erroneous value in the configuration file is replaced with default value
    • set new parameters
    • create a new configuration file
    • the selected parameters can be further used for data processing even if they are not saved in a configuration file
  • input data are graphically represented as:
    • a plot of the magnetic moment vs. applied field $\left( z \;\mathrm{vs.}\; x \right)$
    • a contour plot of the magnetic moment vs. applied and reversal fields $\left( z \;\mathrm{vs.}\; x \;\mathrm{and}\; y \right)$
  • values entered by the user in the editable field are not read until the Enter key is pressed
  • "Save as," "Check config data," and "Plot input data" read all the displayed fields
  • any error or warning is signaled in both the interface and the command prompt
  • saving as a new configuration file is not possible if errors are present
  • if the interface is closed before all errors are solved, then data processing is not possible
  • "New config file" resets all the parameters to their default values


Main Configuration Interface

  • similar to Lite Configuration Interface, but additionally contains the optional parameters
  • initially all parameters are set to their default values
  • the displayed fields may change depending on the value selected from a list
  • the values entered by the user in the editable fields are not read until the Enter key is pressed


Command Prompt

The command prompt contains both the messages displayed in GUI and others in addition.

The error or warning messages from the numerical subroutines are displayed only in the command prompt.


Files and data saved by doFORC

doFORC save the processed data into new files whose names are obtained from the input_file by appending new strings before the extension. If files with the same name already exist, then they will be overwritten.

  1. input_file__smoothed_input contains the smoothed values in the points from the input file, along with other information, according to the table below:
    ColumnDescriptionRestriction
    h_app applied field (column $x$) N/A
    h_rev reversal field (column $y$) N/A
    m magnetic moment (column $z$) N/A
    diagL $\text{diag} \left( L \right) \equiv$ influence ihat ≥ 1 or istat ≥ 1
    confi confidence intervals ihat = 2 or istat = 2
    m_fit smoothed magnetic moment N/A
    res residuals N/A
    res_fit smoothed residuals smoothresidual = true
  2. input_file__smoothed_output contains the requested derivatives in the output_points saved as columns $\left( x,y,z \right)$

  3. input_file__smoothed_output_matrix_order_of_derivative_nn contains the requested derivatives in the output_points saved as matrices, one file for each value of order_of_derivative and/or nn

    Restriction: only for output_points = ha_hr_regular_grid, hc_hu_regular_grid, or rectangular_grid

  4. input_file__statistics contains the requested statistics according to ihat or istat

    If nn_list = true and a CRITERION is specified, or if nn_range = true, then the statistics for all examined nn values are saved, and the selected value that minimizes the specified CRITERION is on the last line.


Test Functions Generator

The pseudorandom number generator is initialized by the $seed$ number, so that each value of the seed generates other realization of the true values for the selected function:

  • if $seed=0$ then the program generates a different pseudorandom sequence each time it runs, using the system clock
  • if $seed\neq 0$ then each value of the $seed$ generates other sequence of random numbers, the same at every run


Screenshots