History of Transcriber versions
History of Transcriber versions
Version 1.5.2
Changes or bugs fixed:
- Add Arabic input support
- Remove global speaker database widget choice in 'Edit turn attribute' widget display
- Bug correction in the 'Edit turn attribute' widget display
- Separation of the named entity code in for easier modifications
- Explorer frame for easy navigation and selection of files (F2 key)
- Add shortcut to display or hide annotation tags (F7)
- Easier managment of frames and configurable interface
- Volume and Gain scales integrated in control and waveform frames
- Added shortcut for hiding events
- Themes management for easier user preference configuration (fonts/colors)
- Hyphen words can now be spellchecked correctly
- A trs and a automatic transcription can be compared
- Font can be modified in real time (except axis)
- Speaker button font can be modified
- Section button font can be modified
- The list of available fonts is displayed in a list box.It enables to acces all of them on Linux
Known bugs
- When compiling Transcriber from sources, 'configure' cannot be created from 'configure.in' using 'autoconf'
- Inversion of display flags for event tags and menu
Version 1.5.1
Changes or bugs fixed:
- Upper extensions are now recognized as valid extensions for transcription and audio files.
- A DTD different from trans13.dtd or trans14.dtd is not considered anymore as a former DTD.
- All unicode encoding are now supported on Windows
- Transcriber 1.5.0 refused to be launched if it found an external speaker database. That file was created when the speaker turns were modified. That bug has been corrected.
- The default browser (used for viewing help files) could not be set. This bug has been corrected.
Version 1.5.0
New or modified functions:
- A new DTD (trans-14.dtd) replace the old one (trans-13.dtd). For compatibility, files that need an older DTD can be opened with Transcriber-1.5.0. For such files, save or exit without any changes will display a message that inform you that the original format of your file is older than the format of your Transcriber application. You can choose to export in new format (recommended) or keep the original one.
- improvement of the "save audio selection" function: the menu entry is now "Save audio segment(s)" with two submenus "Selected..." and "Automatic...". You can also reached this function with contextual menu by right clicking on the signal and look in "Audio file" entry. The first submenu do exactely the same as the old function (i.e. save an audio selection) and the second one is an automatic mode to save all the audio segemnts corresponding to section, turn and sync. A dialog box allow you to choose wich kind of segment do you want and where you want to save it in.
- the event tags can now be translated when displayed in Transcriber, but they are not translated in TRS files.
- the binary released 1.5.0 is now available on Linux, Windows
- the spell checker is now Aspell and works on Linux and Windows
- help files are now viewed with the default browser
- shape computation for large files is now available on Windows
- help files are available in German
- now, when you destroy a begin or end event it also destroy the symetric associated event.
- trs files are associated to Transcriber on Windows
- shortcuts F1 to F7 are available. F1 is for help, F2 to F7 are used to hide or not the different parts of the interface (text, NE, command button, first and second signal view and message display).
- creation of a new event "Named entities" for named entities (NE) annotation. The F3 button is a shortcut that make available an interface to easily annotate NE. Automatic annotion is possible and every colors or fonts are configurable. The list of entities can also be modified. More details in the reference manual.
- creation of a global speakers database which contains all the global speakers you have imported in it. Import is done by Edit->Speakers->Update global speakers database, then you have to select the file from wich you want to import global speakers. More details in the reference manual.
- transcription files may use a different format than the original
one provided with the package, provided that they point to a DTD
located in the same directory, that their root element is still
<Trans> and that they are in a superset of the existing
format. Unknown XML tag will be only displayed in gray, but may not
be edited. Any XML file whith <Trans> root tag is now
supposed a valid .trs file, regardless of its actual extension (it
used to be limited to .trs and .xml).
- it is possible to define a default topic or speaker import file,
loaded when opening a transcription or creating a new one, using
'Edit/Speaker|Topic/Automatic import from selected file'
- export a transcription to text format in a simple "Speaker i: transcription" format; control over the formatting may be done by setting some configuation values on the command line or in the configuration file - see convert/text.tcl for more details (NB: a beta version of this filter was already distributed in the 1.4.6)
- display of NIST RT'03S .mdtm format (speaker subset only).
- added "-version" command line option.
changes or bug fixes:
- removing of lrec documentation (useless)
- removing of html_library (useless)
- Migration of user preferences to Transcriber 1.5.0 (Event widget size is reset)
- popups Find/Replace, Find Speaker, Find Topic, Spell Checker, Control Panel, Informations are now in foreground
- a warning window pops up if more than one transcription line is deleted
- fixed bug : an error message appeared when doing File/Synchronized Audio File/Add Audio File. That made the function unvailable.
- fixed bug : on some platform when using the Num_lock key, the edition was impossible.
- fixed bug : when changing localisation (for example english to french) the contextual menu of the wave were not translate so that when you wanted to open an audio file, an error message appreared.
- a warning message is displayed when temporal inconsistencies are found in a .trs file.
- Export
to .stm format now includes a commented header giving information about
the conversion process, and the file is written in the user-defined
encoding (rather than the default system encoding) - it may be possible
to override it on the command line e.g with 'trans -set encoding utf-8
-export stm *.trs'.
- multiwav extension now has a menu entry in the main 'File/Synchronized audio files' menu.
- 'Open segmentation file..' has been moved frome 'Options/Display' to 'File' menu.
- default string for new speaker or topics may be modified
(currently only by editing the default settings or the configuration
file, see etc/default.txt).
- improved parsing of LDC hub4 files in sgml/utf formats for segmenation display.
- bug corrected in the "-set" line option.
- bug corrected in saving a modified localization file.
- command line option "-export" may be used instead of "-convertto".
- Transcriber
is launched in batch mode (i.e. with tclsh rather than wish) by 'trans'
script when -export or -help command line options are found.
- under Tcl/Tk 8.4, it should be possible to open multiple files in a single dialog for the multiwav and segmentation features.
- convert filters using the extended tcl API 'readSegmtSet' have been made backward compatible with previou versions (<1.4.6).
Version 1.4.6
New or modified functions:
- multiwav extension, allowing several synchronized audio files to be associated with a single transcription. New audio files can be added using the contextual menu on the waveform (right-click), or a list of audio files can be given on the command line when launching the tool. It is derived from an original proposal by David Gelbart and Dan Ellis from ICSI, intended for meeting recordings management.
- Czech localization of the menus added, kindly provided by Zdenek Konopasek.
- Management of several configuration files (menus "Options/Save configuration as..." and "Options/Load configuration file..."); a configuration file may be specified at startup with '-cfg filename' command line option.
- Save audio selection in a separate .wav file (menu "File/Save audio selection as...").
- Explicit message when signal shape is not available and resolution display is thus limited.
- Text editor may be hidden (checkbox menu "Options/Display/Text editor", '-notext' command line option).
- Segmentation layers for background, sections or turns which are not in use (i.e. having a single segment for the whole document) may be automatically hidden (checkbox menu "Options/Display/Smart segmentation display").
- Random colorization of the turn segmentation layer, so that a color is automatically attributed to all occurences of the same speaker in the document (checkbox menu "Options/Display/Colorize speaker segments").
- Open a label file and display under the signal (menu
"Options/Display/Open segmentation file..." or command line option
"-lbl filename" -- option may be repeated). The label file information
may span several segmentation layers.
- Improved support of NIST .ctm file (including RT'03 format
extensions and multi-level display as labels); .stm can be either imported or displayed as multi-level labels.
- Added '-convertto format filename ...' command line option for
converting a set of file from the Transcriber format to the given
format in batch mode.
- Various other command line options added: '-set option value' for
overriding configuration value, '-sig2' for display of second signal
view, '-socket' for activation of external scripting service, '-help'
for a list of options.
- added compatibility update for Tcl8.4, due to changes in (internal) Tcl and C API. Transcriber binary libs will probably be incompatible between Tcl versions 8.3 and 8.4
- added support for Mac OS X (Darwin/Aqua).
- extended conversion filter API with 'readSegmtSet', for management of multi-level label files.
- discard unreadable default signal or transcription files from a configuration file.
- localization of the interface is now in a separate file for each language, for a better extensibility.
- segmentation layer name in the signal contextual menu becomes more explicit ("Background", "Turns", etc.)
- resolved a confusion involving GB-2312 encoding
Version 1.4.5
Released on July 11, 2002.
needs Tcl/Tk 8.0-8.3, Snack 1.7 or preferably 2.0.1, tcLex 1.2
New or modified functions:
- added new playback mode stopping before (rather than after) next boundary; added related menu entry
- added menu entry and ctrl-return binding for playing next segment
- added bindings alt-backspace/delete for deleting word before/after cursor respectively.
- added ctrl-alt-t binding for modifying turn attributesImplementation.
- added support for floating boundaries. They are viewed in the text editor as empty circles (standard boundaries being displayed as full bullets) and it is possible to toggle the state of each boundary or of the whole set of boundaries by right-clicking (or control-left-clicking) on the bullet (except the first one). When a boundary is moved, the neighbouring floating ones move while keeping a proportionality up to the next "hard" boundary. A floating boundary becomes automatically fixed after being moved. The floating state is marked in the .trs format with a space before the time value (this behaviour is meant for backward compatibility of .trs files with prior versions of Transcriber and should evolve in the future). Floating boundaries are intended for later support of import and edition of non-temporized manual transcriptions or dialogs.
- added import filters for various formats defined by NIST speech group (/www.nist.gov/speech/), especially the output of the 'sclite' scoring tool (alignement between word hypothesis from automatic speech recognition and manual reference) - see header of convert/sclite.tcl file -, the acoustic segmentation .sdt format and the word segmentation .ctm format (developed for the Coretex European project).
- reworked remote playback support (the script tcl/SoundServer.tcl needs to be launched on the client side) - feature still experimental
- support for client-server scripting of Transcriber through sockets. One need to launch the tool with: "trans -patch tcl/Socket.tcl" - see tcl/Socket.tcl for details on API. Experimental, intended for interface with other tools.
- improved support for NIST .stm format (convert/stm.tcl).
- store encoding used along with default settings and localization texts for better portability.
- improved default setting of shape directory for Windows/Mac platforms.
- turn around for Tcl8.3.2 bug involving 'a grave' appearing as 'A tilde'.
- turn around for Tk bug resulting in a bad cursor position when moving to the end of a word.
- define word boundary to be empty char on Unix platform, similar to Windows settings.
- take user preference about start or end of line into account when changing line with cursor.
- do not limit temporal display to 30 sec any more when the signal is not available.
- case insensitive split of the apostrophe for French and Italian spell checking.
- selection of a speaker in the list of speakers in the turn attribute windows now also possible with space keystroke.
- corrected automatic XML encoding detection for OS whose default encoding is not ASCII compatible (e.g. asiatic).
- corrected possible display inconsistency of axis (delayed repaint).
- various corrections to the configuration/compilation step.
Version 1.4.4
Released on April 19, 2002.
needs Tcl/Tk 8.0-8.3, Snack 1.7 or preferably 2.0.1, tcLex 1.2
New or modified functions: none.
Implementation changes or bug fixes:
- The cursor disappeared after playing a segment with Shift-Tab - fixed.
- The waveform display was sometimes not correctly refreshed - fixed.
Version 1.4.3
Released on June 21, 2001.
needs Tcl/Tk 8.0-8.3, Snack 1.7 or preferably 2.0.1, tcLex 1.2
New or modified functions:
- when Snack version 2.0 is installed with Transcriber, it is possible to play only the left or right channel from a stereo file using the new menu [Signal]/[Stereo channel]. It has no effects for mono sound files or when previous Snack version 1.7 is used.
- new conversion filter to .html format, which makes printing the transcription possible from any browser.
- new conversion filter to .cha (Childes/CHAT) format provided by Zhibiao Wu. Support for this format is yet experimental. When reading a ".cha" file, the tool switches to a CHAT mode with new attributes available in the interface. It switches back to the initial mode when creating a new transcription or reading a file in any non-Chat format.
- convert filter to .stm format has been modified; import from .stm is limited to the transcription.
- at first startup, the window automatically fits the whole screen; further resizing is better managed.
- slight corrections for signal and cursor display.
- (hidden feature, activated by putting 'hideLevels 1' in the configuration file): when reading a segmentation file (i.e. .lab file), hide the section/turn/background segmentation levels under the signal since they are always empty (their display can be switched on again using contextual menu available with the right button on the signal)
- Transcriber has been ported for the Macintosh/G3 platform (thanks to L. Spektor for the modifications). A new sub-directory 'src/mac' includes resources for the compilation of Snack, tcLex and Transcriber on a Mac with CodeWarrior Pro 5.
- automatic detection of sound files associated to a transcription file now works also for uppercase extensions (e.g. .WAV); it used to recognize only lowercase extensions (e.g. .wav).
- src/shape.c is now integrated into Snack1.7 (thanks to K. Sjölander) and is no longer distributed with Transcriber. Various parts of the code (src/wavfm.c, tcl/Signal.tcl, tcl/Play.tcl,...) has been modified according to the syntax of the new Snack commands. As a result, Transcriber 1.4.3 doesn't work with Snack 1.6.
- the format of the sound shapes has changed and has now the suffix ".shape". Previously computed shapes with ".shp" extension become useless and can be removed.
- src/segmt.c: corrected a bug in BorderTableFree (attempt to free an empty pointer when a wrong segment color name was given)
- src/wavfm.c: a double pixmap buffering is used for speeding up cursor-only changes (BTW, pixmaps are now freed at the end); a new widget sub-command 'cursor' has been created for that purpose and is used in tcl/Waveform.tcl:proc Cursor.
Version 1.4.2
Released on January 6, 2000
needs Tcl/Tk 8.0-8.2, Snack 1.6, tcLex 1.2
New or modified functions:
- boundary move can be forced with Shift-Button2 (or Shift-Control-Button1) - in this case, other boundaries are pushed accordingly.
- a break can be forced inside a segment with Shift-Return
- different playback modes can be chosen in [Signal]/[Playback mode] menu (continuous, with pauses, stop or beep at segment boundaries, or looping on segment or selection). The mode applies to any further playback. Default pause duration, beep file, and also an initial step back before playing are set in [Options]/[Audio file...].
- default position of the text cursor during automatic synchronization with the signal can be chosen in [Options]/[General...] between start and end of line (it used to be the end of line).
- in the 'Find and Replace' window, the 'Replace all' button sometimes caused an infinite loop - replacement is now stopped at the end of the document. Replacement of text within event descriptions failed and is now avoided.
- in the segmt widget, segments are now splitted vertically at first form-feed character instead of newline.
- execution failed if the installation pathname contained spaces - fixed.
- on first launch, open dialog box in the 'demo' subdirectory rather than in the current directory.
- when cancelling a font chooser and saving the configuration (i.e., [Options]/[Fonts]/[...], then Cancel, then [Options]/[Save configuration]), current font description was lost - fixed.
- always display cursor time with 3 digits precision.
- a date change with monitoring activated caused an infinite loop (e.g. after a night of inactivity).
- under Windows, user configuration file is now saved in the $USERPROFILE directory
- upon creation of a new turn, the channel setting was lost - fixed.
- the list of encodings is now located inside the etc/default.txt file for easier (manual) configuration.
- the character set used in any encoding remained restricted to iso-latin1 in version 1.4.1; it is now extended to the whole Unicode set for Tcl/Tk8.1 or higher. However, it is not possible to choose a font for a specific encoding, and display of non-ascii chars sometimes fails.
- src/shape.c slightly modified to follow API change in Snack1.6.4
Version 1.4.1 - (offers Unicode support under Tcl/Tk 8.1.1 or higher)
Released on October 28, 1999
needs Tcl/Tk 8.0-8.2, Snack 1.6, tcLex 1.2
compiled under Linux/x86 (Debian 1.3/2.0), Linux/alpha, Solaris (SunOS
5.5), SGI (IRIX 6.2).
New or modified functions:
- Unicode support, only available under Tcl/Tk 8.1.1 or higher. The encoding used for writing the transcriptions can be chosen in [Options]/[General...]. It defaults to ISO-Latin-1, but can be modified to Unicode (UTF-8) or other widespread 8-bits encodings. The same user-defined encoding is used for reading transcriptions produced by earlier versions of Transcriber without encoding header. Under Tcl/Tk 8.0, the behaviour remains unchanged and makes reading transcriptions in Unicode not possible.
- Changing all occurences of a speaker to another already existing one is possible by changing the old name to the new one.
- with Transcriber 1.4 under Tcl/Tk8.1.1 or higher, display of non-ASCII chars in the segmentation tier under the signal was followed by garbage; this has been corrected. Any Unicode char should be now correctly displayed, provided it is available in an installed font.
- for Windows, call to XDrawLines in segmt.c with empty segment is avoided.
- the transcription input/output has been modularized; modules can be added into the convert sub-directory for supporting new input or output formats (see convert/README file).
Version 1.4 - (mostly a stable version of 1.3 beta)
Released on September 27, 1999
needs Tcl/Tk 8.0-8.2, Snack 1.6, tcLex 1.2
compiled under Linux/x86 (Debian 1.3/2.0), Linux/alpha, Solaris (SunOS
5.5), SGI (IRIX 6.2).
New or modified functions:
- handling of raw sound files is improved - a default header size can be given in the [Options]/[Audio file...] window; changes to the default raw sound files settings immediately apply to the current sound file if necessary.
- background color of selected signal can be modified by the user - this can be necessary when not enough colors are available in the colormap.
- a sample of speech conversation transcription is provided in the demo sub-directory.
- the Help is viewed through 'netscape' when it is available.
- a -patch directory_name option has been added to the command line options for 'trans'; this will allow loading some extensions to the tool at startup in the future.
- no more Makefile at the root directory - all compilation and installation is done from within the 'src' sub-directory.
- the DTD has been extended with the "elapsed_time" field in the Trans tag (intended for total transcription time monitoring, but not currently used)
- shape calculation mode disabled by default for Windows platform; background calculation is always disabled.
Version 1.3b7
Released on July 6, 1999
needs Tcl/Tk 8.0 or 8.1, Snack 1.5p2 or 1.6a2 (do not use 1.6b1 under
Linux), tcLex 1.1.3 (do not use older release with Tcl/Tk8.1)
compiled under Linux/x86 (Debian 1.3/2.0), Linux/alpha, Solaris (SunOS
5.5), SGI (IRIX 6.2), Windows NT.
New or modified functions:
- check spelling (GUI for using 'ispell' external program)
- upon reading a transcription, look for a more recent automatically saved version and ask user to restore from this version instead (in case of crash during previous session)
- playback with pauses at segment boundaries (menu [Signal]/[Play with pauses])
- significant update of French user and reference manuals
- remote sound file server is functional again (tcl/SoundServer.tcl has to be launched on the host)
- menu for sorting an edited list (bindings, localization, etc.) upon any field (name, value...)
- slight modifications in .stm format
- no clock calibration needed anymore (instead, native Snack "audio elapsedTime" function available since 1.5p2 is used); cursor and callbacks management in tcl/Play.tcl updated.
- mp3 sound files work with the most recent Snack versions (1.6a2).
- new Snack sub-command "order", to be used in conjunction with "shape" and "get" sub-commands for remote sound file access in heterogeneous network.
- use a specific Snack API allowing support of Transcriber under Windows NT platform.
- updated "Help" menu for display of correct .html files.
Version 1.3b6
Released on May 17, 1999
New or modified functions:
- completely new French reference and user manual (HTML format)
- current cursor position in signal and vertical zoom are now saved with session configuration
- at user option, saving current configuration is proposed before leaving
- direct access to a cursor position is now possible with menu [Signal]/[Go to...] or by clicking on the message under the signal
- direct control of sound level in [Signal]/[Control Panel]; vertical zoom moved to the same window
- at startup, sound files are automatically detected, even if their suffix is unknown or lacking (in 1.3b5, they were rejected with the message "format unknown"); more suffixes are now recognized (.aif, .aiff, .smp)
- move segment boundaries can also be done using control-left button 1 (instead of middle button) for 2-buttons only mouse users.
- modification date and version are not editable any more in Episode window.
- language list is now displayed in lexicographical order of the current langugage.
- corrected bug in 1.3b5 with activity logging when day changed during session (caused infinite loop)
- corrected bug in update of turn and section button colors in the text editor
- changed duration for recalibration of clock to 10 seconds.
- "undo" very limited
- no spelling correction, automatic completion etc. yet
- no printing
- mp3 sound files partially working, but randomly causing Transcriber to crash
Version 1.3b5
Released on March 23, 1999
New or modified functions:
- transcription time monitoring has been rewritten and is displayed inside the "Informations" window if a log file is given
- upon startup, user is asked either for a transcription file or for an audio file.
- bug in 1.3b4 with turn attributes edition
Version 1.3b4
Released on March 16, 1999
New or modified functions:
- Cut/Copy/Paste of mixed text and events works as expected
- Bindings replacement string can include Events (through copy/paste or directly with selection); binding key can be typed as-is in the editor field
- Automatic space insertion at user option
- Save the geometry of some windows
- A user glossary is available
- Imported speaker can be selected; topics can also be imported the same way. Unused topics and speakers can be removed. All these functions are available with menu "Edit/Speakers" or "Edit/Topics"
- Bug correction in shape.c for 8bit wav sound files
- Format used in list edition since 1.3b3 slightly modified to allow a generic n-fields edition instead of 2-fields only
Version 1.3b3
Released on March 7, 1999
New or modified functions:
- DTD: extended Event tag with attributes type (noise/lexical/pronounce/language) and extent (begin/end/previous/next/instantaneous); previously defined Events default to instantaneous noise.
- Edition of Event has been upgraded to ths new format
- Background shape computation is back at user option
- Events are displayed in the segmentation bar; they weren't any more in 1.3b2
- Overlapping speech is now managed inside turn window
- Turns and sections can be destroyed from within their edition window
- Information window now also gives information about transcription
- Signal and transcription name are both displayed in the button bar when necessary
- Better localization of the interface
- Added menu "File/Export/..." for explicit exportation to non-native format; "File/Save" and "File/Save as..." always use native XML format
- Default extension of transcription files has been changed to .trs; .xml can still be used
- User configuration of:
- language list (in iso639 code)
- noise events list
- pronounce events list
- lexical events list
- display format of events, their color
- bindings
- localization strings in any language
- Switched to Snack1.5 (final), tcLex1.1 (final) and tcl8.0.5 for development (should still work with previous 8.0.x)
- C sources: reworked shape.c for use of latest Snack function and shape display of any type of sound file, including compressed ones.
- File format .typ has been corrected to come back to 1.2 behaviour
- Updated Transcriber URL to its new home page inside distribution and docs
- no more patch to tk_getSaveFile lib for choosing the save format
Version 1.3b2
(not publically) released on February 12, 1999
New or modified functions:
- DTD:
- added channel attibutes in Turn tag (=telephone/studio)
- added scope attribute in Speaker tag (=local/global)
- Added "File/Import speakers..." menu: import all speakers from a .xml file
- Automatic insertion of spaces before punctuations
- Suppressed "File/Close" menu
- Write to .stm format for further exploitation
- Modifs to Xml parsing libs
Version 1.3b1
Released on January 21, 1999 - test under Linux (Debian 1.3/2.0), Solaris (SunOS 5.5), SGI (IRIX 6.2).
New or modified functions:
- a new XML format becomes the native format for the tool instead of LDC's .typ
- validity of the transcription is checked upon the DTD (i.e. the syntax of the format)
- general informations about the transcriber name, the emission or the signal file are now kept along with the transcription
- when opening a transcription, the corresponding sound file is open automatically
- background music or noise is now handled and displayed in a separate transcription tier
- Speech or non-speech events and of comments are now handled as XML tags rather than normal text
- management of overlapping speech has been reworked
- management of speakers and topics is easier and more powerful, and allows global research and edition of speakers or topics
- automatic shape computation for large signals can be disabled at user option
- loading raw files is once again a problem - but it should work even better than before with future updates of Snack module...
- transcriber's productivity can be logged in a file at user option
- SoX libraries aren't used anymore; instead, all sound files are accessed with Snack module; specific open_sound and play_sound command disapeared
- tcLex module from Frédéric Bonnet (a lexical analyser generator for Tcl) is now used for the implementation of an object-oriented tcl-only validating XML parser
- installation procedure and global tool structure has been reworked thanks to D. van Leeuven useful hints
Version 1.2
Released on September 16, 1998 - test under Linux (Debian 1.3 and Redhat 4.3), Solaris, SGI (IRIX 6.2).
New functions:
- Signal:
- Display topic and speaker segmentation under the waveform along with text segmentation.
- Ability to display a second view of the signal at a different scale.
- Possibility to open raw sound files.
- Contextual menus on the waveform for direct actions on the signal and its display.
- Transcription:
- Undo/Redo, limited to segmentation boundary moves and text changes (undo all text modifs in the current segment since cursor moved into it).
- Find and replace in the text.
- Automatic auto-save at periodic intervals.
- New XML format for transcription output (format will probably be modified soon - see discussion about formats).
- hyphen words can now be spellchecked correctly
- Management of overlapping speech over one segment.
- More configurable options (fonts, colors, windows to display...). Most user choices (including current sound file, transcription file and current resolution display) are saved in the "~/.transcriber" user configuration file upon request.
- Localization of the menus in english or in french as option; other languages can be easily added.
- Help in HTML format.
- hyphen words can now be spellchecked correctly
- Topics were lost when file was saved as ".typ". Original LDC format was slightly modified to include topic. Markup for overlapping speech is now recognized.
- Default configuration file was named "~/.trancriber" (without 's'); it was corrected.
- Shortcut for 'Signal/Replay Segment' has been moved from <Alt-Tab> to <Shift-Tab>; it could be a problem on some platforms.
- Menus 'Segmentation/Move to.../First segment' and '.../Last segment' are now executed.
- ... plus a lot of minor changes
- Internal data management has been completely rewritten. Transcriptions are stored in memory as a tree structured like an XML file, and are accessed with a set of procedures in "tcl/Tags.tcl". Input file parsing is very simplified (an XML-conformant library using tcLex extension for Tcl written by Frédéric Bonnet is under development).
- Sound files are accessed via a new Tcl command "open_sound" which calls sox and/or sphere libraries and creates a new "sound-file" Tcl command - not (yet) compatible with Snack "sound" commands. Playback with a new Tcl command "play_sound" which calls Snack libraries. These functions are documented in the "doc" sub-directory.
- hyphen words can now be spellchecked correctly Sound shape (pre-computed at the resolution of 10 ms per pixel) are not computed in background any more. Rather, the interface is stopped the first time a large sound file is open during the shape computation; it is then saved for the next time it is needed.
- Remote audio playback server is no longer available; it seemed not to be essential, and Snack playback design is more robust. Remote audio file server is still in the distribution, but the default behaviour is to read audio files locally without launching the audio file server on the local host. If needed, the file server should be configured before launching to avoid security holes (e.g. by testing the client port or the sound file directory).
- axis, wavfm and segmt new Tk widgets have been updated (but they are not yet documented):
- wavfm widget now calls a "sound-file" command created with the "open_sound" command for shape calculation rather than reading/writing a channel (pipe or socket). Widget options "-width" and "-height" added.
- segmt widget support one indirection level for time indices : start and end segment times can be an index in an associative array. Time array and list of segment variables are automatically traced, and the widget is redisplayed if they are modified. Some minor other fixes.
- axis and segmt widgets are redrawn if named fonts are modified (allows interactive fonts configuration).
- Distribution structure:
- the main distribution directory is now named Trans-x.y (x.y being 1.2 in this version). For simplification, the "editor" sub-directory has been suppressed and its content ("doc", "img", "src", "tcl") is in the first level. The "pkg" sub-directory has been renamed "lib". The "bin" and "obj" sub-directory are no longer used.
- all C extensions to Tcl/Tk are grouped in a single auto-loadable package "libtrans.so"; there are no binary executable to be launched in background any more.
- a patched version of the last distribution of SoX (v-12.14) is included in the distribution in the "contrib" sub-directory
- html_library-0.3 from Stephen Uhler is included in the "lib" sub-directory.
- Snack official distribution is used without any patch. The lite version of the package is installed in the "lib" sub-directory.
- a configure procedure is available for the compilation of C sources in the "src" sub-directory
Version 1.1
Released on June 26, 1998 - test under Linux (Debian), Solaris, SGI.
Modifications:
- SNACK 1.1 module from Kåre Sjölander is used instead of Sox libraries for audio playback. This solution is more stable and portable : Snack works under HP-UX, Sun Solaris, SGI IRIX, Linux, and Windows 95/NT. The module is used in a patched version.
- No modifications of the user interface, installation procedure still rather crude.
Version 1.0
Released on May 25, 1998 - development under PC/Linux (Debian 1.3); test under Sun Solaris.