More improvements on EMLStructureParser

Feb 24 2011 Published by admin under News

EMLStructureParser is now able to dump text from text/* parts. We are finishing the code. Naïve Bayes will be able to take advantage from this feature.
The extracion of text from an email is only done 1 time. We use a caching scheme to avoid computing it a lot of times.
We also cheked some errors during the parsing to detect malformed rfc2822 files.
And all of this is made using a finite-state machine scheme with a stack when parsing body parts.

Comments are off for this post

Latest news
Corporate INFO

This site and the software has been
developed by the Spam Team belonging to SING Group at University of Vigo.
Contact us
Please use the following details to contact us
- David Ruano: drordas at wb4spam.info
- Noemi Pérez: npdiaz at wb4spam.info
- J. Ramón Méndez: jrmendez at wb4spam.info
Special Thanks

Special thanks for their help and ideas to:

	This site and the software has been
developed by the Spam Team belonging to SING Group at University of Vigo.

More improvements on EMLStructureParser

Latest news

Corporate INFO

Contact us

Special Thanks