-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Different files format #17
Comments
|
(disconnecting now...) |
Yet another proposal for the file format. Probably it will more convenient if data is provided in table-like format: hits.csv: tracks.csv: Solution file: Files in this format can be easily readed using numpy or pandas. |
mikhail : for hits.csv , we agree, but for tracks.csv and tracks_soln.csv it would be very inefficient to repeat the track parameter each line. |
Now I see and agree. |
Hi. Input: Output: or another version of output where truth is also kept: This is communicated to @tboser already. Thanks. |
Hi, |
Hi.
This will be the input and output format:
Input:
https://www.dropbox.com/s/9iq0qbt54vtq489/input.csv?dl=0
The output file I wish to have, should have the following format,
similar to the input, but the ' particle' column is replaced by the
'track' column, and the absolute values of track ids don't matter, only
their connections do.
Output:
https://www.dropbox.com/s/7r3udt7v8omu8nt/result.csv?dl=0
or another version of output where truth is also kept:
https://www.dropbox.com/s/1n90o0mwbetp6eg/result_truth.csv?dl=0
This is the minimum info, we are also considering adding "layer" and
"iphi" information into the input, which are redundant but may be useful.
…On 23/02/17 11:20, hushchyn-mikhail wrote:
Hi,
Is this the final input/output data format?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#17 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AEv_EATZsQEw3jtuxhu4EFTXEdsDY_c4ks5rfV1ggaJpZM4LzOtj>.
|
This is the proposal for the file format
hits
hits.csv
event id, hit id, x, y
Event ID : integer
hit ID : integer
x,y : float in mm, with 4 digit after decimal point (=micron precision)
hit : should be ordered, for example in layer, and in phi (-pi, to pi) (this is probably better than randomizing)
??? currently we have this:
100000710,10000060,[977.9869479074608, 208.68578813535396]
==>how are bracket dealt with by cvs reading module ???
==>also, what is the point to not start at 10000000 ?
ground truth
track parameter and list of hits for each track, and
tracks.csv
event id, track id, signP, phi, d0, hit id1, hit id2, hit id3 ....
signP : momentum signed by charge (+1 or -1, no neutral) ??? other option : sign inverse momentum, which is proportional to curvature
phi : angle ]-pi,pi]
d0 : impact parameter in mm, with 4 digits after decimal point (so micron precision)
??? : is it a problem if number of hit not always the same ?, should maybe fill zeroes up to maximum number of hits (=number of layer)
The points for not giving track id in the hit file :
Solution file
tracks_soln.csv
event id, reco track id, hit id1, hit id2, hit id3,...
reco track id : arbitrary integer (should be unique for one event id). Not absolutely needed, but probably easier for debugging
??? require maximum number of hits ?
??? leave room for reco track parameter (even if not required for the challenge), so that there is a nice symmetry with ground truth file ?
The text was updated successfully, but these errors were encountered: