GM C wrote:
> I need to use the Oracle Text functionality (finding words or keywords inside ofice documents
> (like .doc) or (not necessary for me) in PDF in a Mysql DB.
> Is that functionality existing in this database?
No.
But for MS Word documents, I have used antiword (see
http://www.winfield.demon.nl/) in a project as follows:
1) create a table (with FULLTEXT index)
CREATE TABLE docinfo (
docpath CHAR(200) NOT NULL PRIMARY KEY,
doctext MEDIUMTEXT,
FULLTEXT (doctext)
);
2) With your favourite progamming language (I prefer Perl), create a program to:
a) walk your directory tree
b) use antiword to extract the plain text from your documents
c) store this text, together with the filepath, in the docinfo table
3) Query this table with MySQL's fulltext index capabilities
You will have to repeat 2) on a regular basis to keep your table up-to-date. I use additional columns in my docinfo table (last modification date/time, filesize) to see which files haves changed and need an update.
--
felix
Please use
BBCode to format your messages in this forum.