The current version of CasualConc is
not a tag-aware program except for certain style of information tags. So all you set here is which ones to ignore. The
tag-deletion function is mainly to handle my own corpus, so this might
not suit to your corpora. I want to make CasualConc handle tags more
sophisticately in the future (including xml tags). These tag-deletion settings will be applied only in File Mode.
The idea is you delete tags when you create a database file, so you
won't need to delete tags when you analaze the text from a database
file. There are four ways to delete tags or tagged texts. These might be reorganized or completely reworked in the future. To enable tag-deletion feature, check the box next to Ignore Tags. End of Header If
you specify a text string in this box, the text up to the string will
be deleted from the analysis. So, for example, if you have a header
that ends with </info>, type it in the box and CasualConc ignores
the text that ends with </info>. Ignore Tags This
is simply to delete tags themselves, assuming tags are in brackets.
The currently CasualConc can only delete tags in <>. If your
tags include characters other than letters (that matches regex '\w')
and a slash ('/'), you need to specify them in the box. Parts to Ignore This
is to specify a part to delete (up to three) that appears in between
specified strings. This assumes the specified part only appears once
in a certain file. So only the first appearance of the specified part
will be deleted. If you want to delete sections with the same tag set,
use Sections to Ignore. Sections to Ignore This
is to specify a section that appears repeatedly in the same file. All
the instances of the area in between the specified strings (including
these strings) will be deleted. |
