Hi Guys,
Been a bit out of the loop recently however I've been busy behind the scenes.
I've been putting together a bit of software that will model the recorded audio sequences in order to facilitate any kind of patterns analysis we want. It imports label exports from Audacity to model the sequences in a structured (and most importantly queryable format) with start and end time stamps, along with other metadata, for all elements.
It is built in .Net and the object model and import process is almost complete. I haven't started on a UI yet, but Linq can be used to analyse the sequences and output the results to console. I will get it on Git as soon as I am able. Anyone any good at .Net UI design, as this is likely to be the bit that takes me longest to get in place!
Another benefit of this approach is that we can establish a set of transcriptions using audacity projects and an agreed set of label tags. We can use these to discuss/agree message content and I hope can develop into an approach to establishing a standard data-set as there seems to be quite a lot of inconsistency in the current transcriptions.
Marking up a recording in Audacity using labels is pretty straight forward, so as new recordings come it it will be easy to add them to this common dataset.
Anyone got any views on useful properties to capture for the audio elements in addition to start and end timestamps? At the moment I have sequences, which are formed of Segments. Segments start with a chitter and and with a Howl. They contain a collection of Purrs which can be any length.
Sequences are flagged with Recorder, System, Capture Time, and a decription
Howls just capture whether they are type 1 or 2.
Purrs capture an indentifer for the bit value, whether they are clipped/quiet and can also capture the frequency (if anyone can be bothered to work that out)
Chitters are just time stamps
Does this sound useful?
Cheers
Cmdr Phyl
I think this might be exactly what we need to start over with a deeper analysis of the audio that extends beyond what we've interpreted as just binary. We do need to perhaps gather more, cleaner audio samples that have some sort of quality control though; how many UAs are there still in the hands of CMDRs? Have any more been found since the beginning of last week? Also, is there any variation in the chitters or are they all identical?