/ ciao / merging / timing.html

Merging Central

Timing Analysis Across Multiple Observations

Introduction to Merging Chandra Data Spatial/Imaging Analysis Spectral Analysis

Timing Analysis with Combined Observations

As in the spectral case, in general, merged observations should not be used for timing analysis. Different pointings, roll angles, bad pixels, and other instrumental effects can lead to incorrect count rates. However, in conditions similar to those described in Caveat: Extracting Spectra from Merged Datasets, "reasonable" results can be obtained when extracting a light curve.

In the following example(s), we use three pointings of the lenticular "Cartwheel" Galaxy (ObsIDs 2019, 9531, 9807) over a ~7 year period, of the 'ULX N10' point source ($\alpha = \mathrm{00^{h}37^{m}39.38^{s}}, \delta = \mathrm{-33^{\circ} 43' 23.08''}$; A&A 426 787, MNRAS 406 1116), which corresponds to the Chandra Source Catalog master source 2CXO J003739.3-334323, which we make use of for our extraction source and background regions.

unix% cat ULX-N10.wcs.reg
ellipse(0:37:39.3936,-33:43:23.517,0.02693',0.02205',100.05401)

unix% cat ULX-N10.bkg.wcs.reg
ellipse(0:37:39.3929,-33:43:23.522,0.13467',0.11023',100.05404)
-ellipse(0:37:39.3929,-33:43:23.522,0.02963',0.02425',100.05404)

Approach I:

Time order matters when combining separate light curve files.

For example, after setting some shell variables:

	bash/dash/zsh	(t)csh
unix%	tbin=2592.8	set tbin=2592.8
unix%	lc_bin="bin time=::${tbin}"	set lc_bin="bin time=::${tbin}"
unix%	efilt="energy=500:7000"	set efilt="energy=500:7000"
unix%	src="sky=region(cartwheel.wcs.reg)"	set src="sky=region(cartwheel.wcs.reg)"
unix%	bkg="sky=region(cartwheel.bkg.wcs.reg)"	set bkg="sky=region(cartwheel.bkg.wcs.reg)"

we can the extract the light curves with dmextract

unix% dmextract infile="cartwheel_2019_reproj_evt.fits[${src},${efilt}][${lc_bin}]" \
? outfile=2019_${tbin}s.lc bkg="cartwheel_2019_reproj_evt.fits[${bkg},${efilt}]" opt=ltc1 clobber+

unix% dmextract infile="cartwheel_9531_reproj_evt.fits[${src},${efilt}][${lc_bin}]" \
? outfile=9531_${tbin}s.lc bkg="cartwheel_9531_reproj_evt.fits[${bkg},${efilt}]" opt=ltc1 clobber+

unix% dmextract infile="cartwheel_9807_reproj_evt.fits[${src},${efilt}][${lc_bin}]" \
? outfile=9807_${tbin}s.lc bkg="cartwheel_9807_reproj_evt.fits[${bkg},${efilt}]" opt=ltc1 clobber+

and combine them together with different time-orderings with dmmerge:

unix% dmmerge infile=2019_${tbin}s.lc,9531_${tbin}s.lc,9807_${tbin}s.lc \
? outfile=lightcurve_merged_${tbin}s.lc clobber+

unix% dmmerge infile=9807_${tbin}s.lc,9531_${tbin}s.lc,2019_${tbin}s.lc \
? outfile=lightcurve_merged-timereverse_${tbin}s.lc clobber+

unix% dmmerge infile=9531_${tbin}s.lc,2019_${tbin}s.lc,9807_${tbin}s.lc \
? outfile=lightcurve_merged-timedisordered_${tbin}s.lc clobber+

Note

An extra complication neglected in this case example (badpixels, effective area, detector effects on response, etc.), is that timing analysis also depends on the number of active CCDs that are readout, and/or any sub-arrays used. For instance, in our example two of the ObsIDs have five active CCDs with a TIMEDEL of 3.141 (s) and DTCOR of 0.9869 while the remaining ObsID uses six CCDs with TIMEDEL of 3.241 (s) and DTCOR of 0.9873. These are not big differences, but integrated over thousands of frames, it can have an effect.

For plotting purposes, if the light curve is being plotted as unconnected scatter plot, the order doesn't matter. However, for use in downstream processing, it will matter for many tools, if the software does not perform sorting internally, there can be a runtime failure.

Compare the Effect of Merging Order on Light Curve Files

[Thumbnail image: compare ordering on merging light curve files]

[Version: full-size]

[Print media version: compare ordering on merging light curve files]

Compare the Effect of Merging Order on Light Curve Files

The merging order of extracted light curve files affects the resultant combined light curve. The left hand plots are the light curves, as net count rate as a function of time and the right hand side plots are the time stamped, extracted light curve row number (array index number). The upper plots are from combining the individual light curves in time order while the lower plots are combining the light curve files in time-reverse. The middle plots are from combining the light curves unordered with time. The color bar represents the FITS light curve table row of the plotted data set.

Approach II (not recommended):

Extracting a light curve from a merged event file, which was combined without concern for time order, will generate a properly ordered light curve.

Warning

This example is only applicable to ACIS data and is much more likely to functionally fail or provide erroneous results. If different sets of CCDs amongst the ObsIDs are used then the incorrect GTI will be used for a subset of the events, and if different number of CCDs or if mixing sub-arrays and full frame data sets, then different TIMEDEL and DTCOR will be applied.

The merging order of events files does not affect the resulting extracted light curve since dmextract will sort the events by time when extracted. However the large time gaps in the data will be included in the light curves since the data set is binned with time.

Compare the Effect of Merging Order on Events Files and Extracted Light Curve

[Thumbnail image: compare light curves extracted from merged event files]

[Version: full-size]

[Print media version: compare light curves extracted from merged event files]

Compare the Effect of Merging Order on Events Files and Extracted Light Curve

The merging order of events files does not affect the resulting extracted light curve. The left hand plots are the light curves, as net count rate as a function of time and the right hand side plots are the time stamped, extracted light curve row number (array index number). The upper plots are extracted from events files merged in time order while the lower plots are extracted from events files merged in time-reverse. The middle plots are from light curves extracted events files unordered in time. Take note of the zero-valued time bins that are included for the time gaps between observations. The color bar represents the FITS light curve table row of the plotted data set.

Note

If there are large time gaps, then extracting the lightcurve from a merged event file will result in a significantly larger output file since it will need to write time bins that contain zero counts.

  unix% ls -alh merged*2592.8s*.lc
  -rw-r--r-- 1 user group  14M Feb 17  2023 merged_2592.8s.lc
  -rw-r--r-- 1 user group  14M Feb 17  2023 merged_time-disordered_2592.8s.lc
  -rw-r--r-- 1 user group  14M Feb 26 14:11 merged_timereverse_2592.8s.lc

  unix% ls -alh lightcurve*2592.8s*.lc
  -rw-r--r-- 1 user group 155K Feb 26 16:41 lightcurve_merged_2592.8s.lc
  -rw-r--r-- 1 user group 155K Feb 26 16:41 lightcurve_merged-timedisordered_2592.8s.lc
  -rw-r--r-- 1 user group 155K Feb 26 16:41 lightcurve_merged-timereverse_2592.8s.lc

The general suggestion is to go along with Approach I and handle time ordering when merging the light curve files together. The minor hassle in doing so early on will avoid possible downstream headaches whilst also minimizing resultant file sizes.

Other Considerations

Time ordering matters when running glvary with a merged events file as input; otherwise the tool will either hang or run until exiting with a segmetation fault.

That said, while the tool may 'mechanically' run with the time-ordered event file, it is unlikely to provide good results. In principle, the Gregory-Loredo algorithm can handle larger gaps in a time series, but its implementation in glvary was intended to look for intra-observation variablity and not inter-observation variability. Since the algorithm works by dividing the entire time range in half, then thirds, quarters, etc. and doing the Bayesian blocks analysis on the time bins, with a large time gap between observations, it is likely that all the events for an individual observation will end up in a single G-L time bin, i.e. one time bin for each ObsID—unless the observations are [really, really] closely separated in time.

Quirks specific to glvary that may be problematic for inter-observation applications include:
- Forcing all the time bins to have non-zero flux, which is why the G-L lightcurve never goes to 0, even for a single ObsID
- Does not account for changes in the spectral response/effective area over large time periods or mixing events from front- and back-illuminated CCDs, which can result in erroneously finding variability for a constant source
- Does not consider the background, so changes in the background level can result in the tool erroneously interpreting variablity even if background flares are removed from a quiescent field.
Upshot — we do not recommend using glvary on merged event files.
Time order matters for the input merged event file running through dither_region.

General Caveats

Depending on the nature of the analysis, consideration for the drop in effective area over the life of the mission may be required since it will introduce an intrinsic dimming to the light curve. From the discussion in the "Precise, Flux-Calibrate Lightcurve" entry in the "Timing Analysis with Lightcurves" Why page, entry:

The most dependable method of calculating fluxes is to model the response-folded source spectrum. This will give an intrinsic flux [ergs/cm2/s], instead of a simple count rate [count/s]. Moreover, one may plot source spectral parameters (the slope of a power law, line FWHM, etc.) versus time for a more detailed study of the nature of the source variation. The Phase-binning a Spectrum thread shows how to extract the data.

In general, creating a flux-calibrated lightcurve is uncomplicated, but time-consuming. First, extract spectra for the times of interest; one spectrum is needed for every datapoint in the final lightcurve. Each spectrum is then fit with a source model and the integrated flux is calculated for a given energy range. Each flux value will be calibrated and based on the appropriate GTIs. Finally, the flux value for each of the fits can be manually written to a file for plotting; the file should have two columns—time and flux (the best-fit parameter value).

Generally speaking, with the variation between CCDs [in a detector array] and for time series analysis, you will ideally like to have events across observations sharing the same CCD, since they will then correctly apply valid GTIs blocks to the merged data set. The instrument configuration should also be the same (mixing gratings and non-gratings is bad) since they have radically different effective areas, and quirks if different detector frame time (EXPTIME) are used. Care must be taken.

Tip

To properly account for changes in the detector, ensure the data sets are (a) reprocessed with a common CalDB version (with chandra_repro and (b) run srcflux with each observation to get fluxes for each observation. The COUNTS and COUNT_RATE for each observation are otherwise meaningless for this comparison since they are the direct detector measurements using different CCDs and/or instrument configurations.
Timing analysis is challenging, if not impossible with large gaps in time between observations. However, if the nature of the source has some known periodicity, then the folded lightcurve datasets (i.e. transforming the time axis into phase), using a common reference point in time, will lend themselves to being combined, or combined as Lomb-Scargle periodgrams (aka Least-squares spectral analysis) analysis for unevenly spaced data sets.

/ ciao / merging / timing.html