jfjoch_writer¶
jfjoch_writer is NeXus compliant HDF5 file writer.
Acknowledgements¶
Zdenek Matej (MAX IV)
Felix Engelmann (MAX IV) for testing and multiple improvement suggestions.
Running directory¶
Writer needs to be running in base directory for writing files - file_prefix will be always relative in regard to writer running directory. Writer detects and protects for basic security issues, like file_prefix starting with a slash, or starting with ../, or containing /../.
Usage¶
Writer needs to be started as a background service, with the following command:
jfjoch_writer {options} <address to connect via ZeroMQ to DCU>
Options:
-R<int> | --root_dir=<int> Root directory for file writing
-H<int> | --http_port=<int> HTTP port for statistics
-r<int> | --zmq_repub_port=<int> ZeroMQ port for PUSH socket to republish images
-f<int> | --zmq_file_port=<int> ZeroMQ port for PUB socket for notifications on finalized files
-w<int> | --rcv_watermark=<int> Receiving ZeroMQ socket watermark (default = 100)
-W<int> | --repub_watermark=<int> Republish ZeroMQ socket watermark (default = 1000)
for example:
jfjoch_writer -H5234 tcp://dcu-address:5400
HTTP interface¶
Writer has dedicated status interface via HTTP. It allows for two operations:
check state of the writer to check if the writer is properly synchronized with DCU (e.g., that
file_prefixagrees with what was set on the DCU) and monitor progress.cancel writing this will close all the HDF5 files being written and restart writer - the option should be used only if DCU process was terminated or disconnected, it SHOULD NOT be used as standard cancellation procedure (when DCU received cancel command it should properly finish writing as well)
Republish¶
Republish creates a PULL socket on the writer, where all the messages are republished for further use by data analysis pipeline. Republish is non-blocking, so if there is no receiver on other end or the sending queue is full - images won’t be republished. In case of START/END messages republishing will attempt sending for 100 ms, but if send times out it won’t be retried.
Republish functionality is optional, if republish port number is omitted this functionality is not enabled.
Finalized files information¶
Creates PUB socket to inform about finalized data files. For each closed file, the socket will send a JSON message, with the following structure:
{
"filename": <string>: HDF5 data file name (relative to writer root directory),
"nimages": <int> number of images in the file (counting from 1!),
"file_number": <int> number of file within the acquisition,
"sample_name": <string> name of sample,
"run_name": <string> name of run,
"run_number": <int> number of run,
"experiment_group": <string> number of p-group / proposal (optional),
"user_data": <any json> user_data,
"beam_x_pxl": <float> beam center (X) in pixels,
"beam_y_pxl": <float> beam center (Y) in pixels,
"detector_distance_m": <float> detector distance (X) in m,
"detector_height_pxl": <int> detector size (X) in pixels,
"detector_width_pxl": <int> detector size (Y) in pixels,
"incident_energy_eV": <float> photon energy of the X-ray beam,
"pixel_size_m": <float> pixel size in meter (assuming pixel X == Y),
"saturation": <int> this count and higher mean saturation,
"space_group_number": <int> space group number (optional),
"underload": <int> pixels with this count should be excluded,
"unit_cell": <optinal> unit cell dimensions in Angstrom/degree {
"a": <float>, "b": <float>, "c": <float>,
"alpha": <float>, "beta": <float>, "gamma": <float>
},
}
user_data is defined as header_appendix in the /start operation in the jfjoch_broker. Other metadata are also carried over from /start operation.
If the header_appendix is a string with valid JSON meaning, it will be embedded as JSON, otherwise it will be escaped as string. For example header_appendix of {"param1": "test1", "param2": ["test1", "test2"]}, than example message will look as follows:
{
"filename": "dataset_name_data_000001.h5",
"nimages": 1000,
"file_number": 0,
"sample_name": "lysozyme",
"run_name": "lyso_cryo",
"run_number": 25,
"experiment_group": "p00001",
"beam_x_pxl": 1200,
"beam_y_pxl": 1500,
"detector_distance_m": 0.155,
"detector_height_pxl": 2164,
"detector_width_pxl": 2068,
"image_time_s": 0.001,
"nimages": 2,
"incident_energy_eV": 12400.0,
"pixel_size_m": 7.5e-05,
"saturation": 32766,
"space_group_number": 96,
"underload": -32768,
"unit_cell": {
"a": 78.0,
"alpha": 90.0,
"b": 78.0,
"beta": 90.0,
"c": 39.0,
"gamma": 90.0
},
"user_data": {
"param1": "test1",
"param2": ["test1", "test2"]
}
}
Notifications for finalized files are optional, if notification port number is omitted this functionality is not enabled.
HDF5 file structure¶
Jungfraujoch aims to generate files compliant with NXmx format, as well as make them as close as possible to files written by DECTRIS Filewriter. This ensures the file compatibility of Neggia and Durin XDS plugins, as well as Albula viewer.
Master file¶
There are custom extension to NXmx format. These will be documented in the future.
Specifically, if data collection was configured with header_appendix having key equal to hdf5 and value as JSON object with number and string values. These will be added to /entry/user.
Data file¶
Data file has the following structure:
Location | Description | Optional |
|---|---|---|
/entry/data/data | Images | |
/entry/detector/timestamp | Timestamp of the image | |
/entry/detector/exptime | Exposure time of the image | |
/entry/detector/number | Image number; if image rejection was used this will be the original image number | |
/entry/azint/bin_to_q | Azimuthal integration - bin-to-Q mapping | X |
/entry/azint/image | Azimuthal integration - per image | X |
/entry/bkg_estimate | Background estimate table | X |
/entry/MX/peakXPosRaw | Peak position X (see CXI format) | X |
/entry/MX/peakYPosRaw | Peak position Y (see CXI format) | X |
/entry/MX/peakTotalIntensity | Peak total intensity (see CXI format) | X |
/entry/MX/nPeaks | Number of peaks per image (see CXI format) | X |
/entry/MX/strongPixels | Number of strong pixel per image | X |
/entry/MX/nPeaksRingFiltered | Number of peaks not belonging to rings | X |
/entry/MX/imageIndexed | Image is successfully indexed | X |
/entry/MX/latticeIndexed | Crystal lattice for the image, assuming it is indexed | X |
/entry/jungfrau/info | Debug field of the JF detector | X |
/entry/jungfrau/storageCell | Storage cell number | X |
/entry/jungfrau/rcvDelay | Receiver delay for the image (Jungfraujoch debugging) | X |
/entry/jungfrau/rcvFreeSendBuffers | Receiver number of free send buffers at the time of sending the image (Jungfraujoch debugging) | X |
/entry/roi/{roi_name}/max | Max pixel value for roi named {roi_name} | X |
/entry/roi/{roi_name}/sum | Sum pixel value for roi named {roi_name} | X |
/entry/roi/{roi_name}/npixel | Number of valid pixel for roi named {roi_name} | X |
/entry/xfel/pulseID | Pulse ID (for XFEL only) | X |
/entry/xfel/eventCode | Event code (for XFEL only) | X |
If spot finding is enabled, spots are written in the CXI format and are recognized by CrystFEL. The following has to be added to the CrystFEL geometry file:
peak_list = /opt/MX
peak_list_type = cxi