Skip to content

Commit

Permalink
Merge pull request #88 from aarhusstadsarkiv/check-reidenfity
Browse files Browse the repository at this point in the history
Check reidenfity
  • Loading branch information
clausjuhl authored Oct 2, 2024
2 parents ed75121 + ab18ecb commit b053340
Showing 1 changed file with 25 additions and 13 deletions.
38 changes: 25 additions & 13 deletions fileformats.yml
Original file line number Diff line number Diff line change
Expand Up @@ -847,12 +847,11 @@ fmt/111:
name: OLE2 Compound Document Format
reidentify:
reason: why do we re-run these?
action: convert
convert:
tool: document
outputs:
- ods
- pdf
on_fail: "action"
action: manual
manual:
reason: We don't know what to do with this format.
process: Find out what we can do with this format.
fmt/115:
name: Windows Bitmap 2.0
ignore_if:
Expand Down Expand Up @@ -2122,11 +2121,15 @@ fmt/1516:
outputs:
- pdf
fmt/1600:
name: ''
name: 'ESRI ArcInfo DAT File (Internal)'
reidentify:
reason: Identifies .dat-files as mapinfo-files by extension only. They are sometimes
winmail.dat
action:
on_fail: "action"
action: "manual"
manual:
reason: There was no bit-header which matches the file.
process: Find out what we can do with this format. You can try and see, if there is a mismatch between our bit-header and the bit-header for the fil with the repository "byteheader".
fmt/1711:
name: Software602 Printer Configuration File
action: ignore
Expand All @@ -2146,12 +2149,20 @@ fmt/1730:
reidentify:
reason: Identifies .dat-files as mapinfo-files by extension only. They are sometimes
winmail.dat
action:
on_fail: "action"
action: "manual"
manual:
reason: There was no bit-header which matches the file.
process: Find out what we can do with this format. You can try and see, if there is a mismatch between our bit-header and the bit-header for the fil with the repository "byteheader".
fmt/1756:
name: Opendocument Text 1.3
reidentify:
reason: Identified by ext only. Prone to error
action:
on_fail: "action"
action: "manual"
manual:
reason: There was no bit-header which matches the file.
process: Find out what we can do with this format. You can try and see, if there is a mismatch between our bit-header and the bit-header for the fil with the repository "byteheader".
fmt/1763:
name: MacBinary
action: manual
Expand Down Expand Up @@ -2208,6 +2219,7 @@ x-fmt/13:
name: Tab-separated values
reidentify:
reason: .TAB-files related to GIS is sometimes identified as plaintext
on_fail: "action"
action: convert
convert:
tool: copy
Expand Down Expand Up @@ -2539,10 +2551,10 @@ x-fmt/266:
.wmz: aca-fmt/27
x-fmt/280:
name: XML Schema Definition
action: ignore
reidentify:
reason: If the XSD is not custom made, then we ignore the XSD file.
on_fail: "action"
action: ignore
ignore:
template: not-preservable
reason: A prescriptive XML template, for validating XML files with. Not preservation-worthy if not with a GML file.
Expand Down Expand Up @@ -2582,11 +2594,11 @@ x-fmt/345:
- pdf
x-fmt/346:
name: Microstation CAD Drawing
action: ignore
reidentify:
reason: Pronom identifies the format on extension alone
chunk_size: 2048
on_fail: "action"
action: ignore
ignore:
template: not-convertable
reason: At the moment the file can't be converted with the current tools for CAD-files.
Expand Down Expand Up @@ -2836,10 +2848,10 @@ xfmt/157:
- png
fmt/1549:
name: Bentley Microstation Hidden Line File
action: manual
reidentify:
reason: Pronom sometimes wrongly identifies .dgn as .hln due to small byteheader definition
on_fail: "action"
action: manual
manual:
reason: Never able to find this type of file.
process: Try to open in Bentley View or AB Viewer.
Expand Down

0 comments on commit b053340

Please sign in to comment.