-
Notifications
You must be signed in to change notification settings - Fork 5
UnixStats
Unix provides myriad commands useful for tallying transcripts or keeping tabs on how much work remains. The find command is just one of them but the tool I found most useful.
The following examples are for IBM transcripts but are easily modified to handle Google transcripts or any other text file name or extension.
$ find "/Volumes/Samsung USB/AudioJournals_TEST/" -name "hypotheses.*" -size +10c | sort -r -n -k7
$ find "/Volumes/Samsung USB/AudioJournals_TEST/" -name "hypotheses.*" -size +10c | sort -n -k7
Filters out large files and finds only small files having less than 10 bytes, sorted by size, smallest last
$ find "/Volumes/Samsung USB/AudioJournals_TEST/" -name "hypotheses.*" -size -10c | sort -n -k7
$ find "/Volumes/Samsung USB/AudioJournals_TEST/" -name "hypotheses.*" | wc -l
$ find "/Volumes/Samsung USB/AudioJournals/ibm_stt" -name hypotheses.txt -size +10c | wc -l 10
$ find "/Volumes/Samsung USB/AudioJournals/ibm_stt" -name hypotheses.txt -size -10c | wc -l 0