CasualConc

© 2008-2009 Yasu Imao
How to Use‎ > ‎

Tags

The current version of CasualConc is not a tag-aware program except for certain style of information tags.  So all you set here is which ones to ignore.  The tag-deletion function is mainly to handle my own corpus, so this might not suit to your corpora.  I want to make CasualConc handle tags more sophisticately in the future (including xml tags).

These tag-deletion settings will be applied only in File Mode.  The idea is you delete tags when you create a database file, so you won't need to delete tags when you analaze the text from a database file.

There are four ways to delete tags or tagged texts.  These might be reorganized or completely reworked in the future.


To enable tag-deletion feature, check the box next to Ignore Tags.

End of Header

If you specify a text string in this box, the text up to the string will be deleted from the analysis.  So, for example, if you have a header that ends with </info>, type it in the box and CasualConc ignores the text that ends with </info>.

Ignore Tags

This is simply to delete tags themselves, assuming tags are in brackets.  The currently CasualConc can only delete tags in <>.  If your tags include characters other than letters (that matches regex '\w') and a slash ('/'), you need to specify them in the box.

Parts to Ignore

This is to specify a part to delete (up to three) that appears in between specified strings.  This assumes the specified part only appears once in a certain file.  So only the first appearance of the specified part will be deleted.  If you want to delete sections with the same tag set, use Sections to Ignore.

Sections to Ignore 

This is to specify a section that appears repeatedly in the same file.  All the instances of the area in between the specified strings (including these strings) will be deleted.