cssutil 20040221-BlameYokohama-auto.1

1.  
2.  
3.  
4.  
5.  
6.  
7.  

NAME

cssutil - create, summarize and edit a .css file

SYNOPSIS

cssutil file.css (.css file name) [-b (brief: print only summary, don't start shell)] [-h (print help)] [-r (print report)] [-s NNNN (number of buckets in new .css file)] [-S NNNN (number of buckets in new .css file, rounded up to next 2^n + 1 boundary)]

DESCRIPTION

cssutil is one of the CRM114 utilities for handling CRM114 .css files. cssutil creates .css files, reports about the state of .css files, and offers a shell for editing .css files

Creating .css files
The cssutil program can be used to create information-less .css files:

     cssutil -b -r spam.css
     cssutil -b -r nonspam.css
. This creates the full-size files ./spam.css and ./nonspam.css, holding no information.

Editing .css files
The cssutil utility can be used to edit .css spectral files. The default behaviour of cssutil (when no optional arguments are passed), is to first give you statistics on the file somefile.css. After this you'll enter the cssutil shell, which allows you to rescale, clip, and otherwise manage your .css files. Available commands in the cssutil shell:

   Z n - zero bins at or below a value
   S n - subtract a constant from all bins
   D n - divide all bins by a constant
   R - rescan
   P - pack
   Q - quit

Inspecting .css files
You can check that the .css files are reasonable. Invoke cssutil as:

    cssutil -b -r spam.css
    cssutil -b -r nonspam.css
You should get back a report something like this:
     Sparse spectra file spam.css statistics:

     Total available buckets          :      1048576
     Total buckets in use             :       506987
     Total hashed datums in file      :      1605968
     Average datums per bucket        :         3.17
     Maximum length of overflow chain :           39
     Average length of overflow chain :         1.84
     Average packing density          :         0.48
Note that the packing density is 0.48; this means that this .css file is about half full of features. Once the packing density gets above about 0.9, you will notice that CRM114 will take longer to process text. The penalty is small below packing densities below about 0.95 and only about a factor of 2 at 0.97 . Best is to keep it below .7 to .8.

SEE ALSO

cssmerge(1), cssdiff(1), crm114(1)

The CRM114 homepage is at http://crm114.sf.net/ .

VERSION

This manpage: $Id: cssutil.azm,v 1.2 2004/02/25 14:56:06 vanbaal Exp $ This manpage describes cssutil as shipped with crm114 version 20031229-1.0-SanityCheck.

COPYRIGHT

Copyright (C) 2001, 2002, 2003 William S. Yerazunis

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program (see COPYING); if not, check with http://www.gnu.org/copyleft/gpl.html or write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111, USA.

AUTHOR

William S. Yerazunis