source: trunk/docs/hints.tex @ 993

Last change on this file since 993 was 993, checked in by MatthewWhiting, 12 years ago

Minor updates to documentation

File size: 7.1 KB
Line 
1% -----------------------------------------------------------------------
2% hints.tex: Section giving some tips & hints on how Duchamp is best
3%            used.
4% -----------------------------------------------------------------------
5% Copyright (C) 2006, Matthew Whiting, ATNF
6%
7% This program is free software; you can redistribute it and/or modify it
8% under the terms of the GNU General Public License as published by the
9% Free Software Foundation; either version 2 of the License, or (at your
10% option) any later version.
11%
12% Duchamp is distributed in the hope that it will be useful, but WITHOUT
13% ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
14% FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License
15% for more details.
16%
17% You should have received a copy of the GNU General Public License
18% along with Duchamp; if not, write to the Free Software Foundation,
19% Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307, USA
20%
21% Correspondence concerning Duchamp may be directed to:
22%    Internet email: Matthew.Whiting [at] atnf.csiro.au
23%    Postal address: Dr. Matthew Whiting
24%                    Australia Telescope National Facility, CSIRO
25%                    PO Box 76
26%                    Epping NSW 1710
27%                    AUSTRALIA
28% -----------------------------------------------------------------------
29\secA{Notes and hints on the use of \duchamp}
30\label{sec-notes}
31
32In using \duchamp, the user has to make a number of decisions about
33the way the program runs. This section is designed to give the user
34some idea about what to choose.
35
36\secB{Preprocessing}
37
38The main choice is whether to alter the cube to try and enhance the
39detectability of objects, by either smoothing or reconstructing via
40the \atrous method. The main benefits of both methods are the marked
41reduction in the noise level, leading to regularly-shaped detections,
42and good reliability for faint sources.
43
44The main drawback with the \atrous method is the long execution time:
45to reconstruct a $170\times160\times1024$ (\hipass) cube often
46requires three iterations and takes about 20-25 minutes to run
47completely. Note that this is for the more complete three-dimensional
48reconstruction: using \texttt{reconDim = 1} makes the reconstruction
49quicker (the full program then takes less than 5 minutes), but it is
50still the largest part of the time.
51
52The smoothing procedure is computationally simpler, and thus quicker,
53than the reconstruction. The spectral Hanning method adds only a very
54small overhead on the execution, and the spatial Gaussian method,
55while taking longer, will be done (for the above example) in less than
562 minutes. Note that these times will depend on the size of the
57filter/kernel used: a larger filter means more calculations.
58
59The searching part of the procedure is much quicker: searching an
60un-reconstructed cube leads to execution times of less than a
61minute. Alternatively, using the ability to read in previously-saved
62reconstructed arrays makes running the reconstruction more than once a
63more feasible prospect.
64
65On the positive side, the shape of the detections in a cube that has
66been reconstructed or smoothed will be much more regular and smooth --
67the ragged edges that objects in the raw cube possess are smoothed by
68the removal of most of the noise. This enables better determination of
69the shapes and characteristics of objects.
70
71While the time overhead is larger for the reconstruction case, it will
72potentially provide a better recovery of real sources than the
73smoothing case. This is because it probes the full range of scales
74present in the cube (or spectral domain), rather than the specific
75scale determined by the Hanning filter or Gaussian kernel used in the
76smoothing.
77
78When considering the reconstruction method, note that the 2D
79reconstruction (\texttt{reconDim = 2}) can be susceptible to edge
80effects. If the valid area in the cube (\ie the part that is not
81BLANK) has non-rectangular edges, the convolution can produce
82artefacts in the reconstruction that mimic the edges and can lead
83(depending on the selection threshold) to some spurious
84sources. Caution is advised with such data -- the user is advised to
85check carefully the reconstructed cube for the presence of such
86artefacts. Note, however, that the 1- and 3-dimensional
87reconstructions are \emph{not} susceptible in the same way, since the
88spectral direction does not generally exhibit these BLANK edges, and
89so we recommend the use of either of these.
90
91{\bf *** 3D  ONE MAY BE DIFFERENT}
92
93If one chooses the reconstruction method, a further decision is
94required on the signal-to-noise cutoff used in determining acceptable
95wavelet coefficients. A larger value will remove more noise from the
96cube, at the expense of losing fainter sources, while a smaller value
97will include more noise, which may produce spurious detections, but
98will be more sensitive to faint sources. Values of less than about
99$3\sigma$ tend to not reduce the noise a great deal and can lead to
100many spurious sources (this depends, of course on the cube itself).
101
102The smoothing options have less parameters to consider: basically just
103the size of the smoothing function or kernel. Spectrally smoothing
104with a Hanning filter of width 3 (the smallest possible) is very
105efficient at removing spurious one-channel objects that may result
106just from statistical fluctuations of the noise. One may want to use
107larger widths or kernels of larger size to look for features of a
108particular scale in the cube.
109
110When it comes to searching, the FDR method produces more reliable
111results than simple sigma-clipping, particularly in the absence of
112reconstruction.  However, it does not work in exactly the way one
113would expect for a given value of \texttt{alpha}. For instance,
114setting fairly liberal values of \texttt{alpha} (say, 0.1) will often
115lead to a much smaller fraction of false detections (\ie much less
116than 10\%). This is the effect of the merging algorithms, that combine
117the sources after the detection stage, and reject detections not
118meeting the minimum pixel or channel requirements.  It is thus better
119to aim for larger \texttt{alpha} values than those derived from a
120straight conversion of the desired false detection rate.
121
122If the FDR method is not used, caution is required when choosing the
123S/N cutoff. Typical cubes have very large numbers of pixels, so even
124an apparently large cutoff will still result in a not-insignificant
125number of detections simply due to random fluctuations of the noise
126background. For instance, a $4\sigma$ threshold on a cube of Gaussian
127noise of size $100\times100\times1024$ will result in $\sim340$
128single-pixel detections. This is where the minimum channel and pixel
129requirements are important in rejecting spurious detections.
130
131%Finally, as \duchamp is still undergoing development, there are some
132%elements that are not fully developed. In particular, it is not as
133%clever as I would like at avoiding interference. The ability to place
134%requirements on the minimum number of channels and pixels partially
135%circumvents this problem, but work is being done to make \duchamp
136%smarter at rejecting signals that are clearly (to a human eye at
137%least) interference. See the following section for further
138%improvements that are planned.
Note: See TracBrowser for help on using the repository browser.