MySQL Forums
Forum List  »  Full-Text Search

Re: Oracle Text on MySQL
Posted by: Felix Geerinckx
Date: November 25, 2005 09:36AM

GM C wrote:

> I need to use the Oracle Text functionality (finding words or keywords inside ofice documents
> (like .doc) or (not necessary for me) in PDF in a Mysql DB.
> Is that functionality existing in this database?

No.

But for MS Word documents, I have used antiword (see http://www.winfield.demon.nl/) in a project as follows:

1) create a table (with FULLTEXT index)

CREATE TABLE docinfo (
docpath CHAR(200) NOT NULL PRIMARY KEY,
doctext MEDIUMTEXT,
FULLTEXT (doctext)
);

2) With your favourite progamming language (I prefer Perl), create a program to:
a) walk your directory tree
b) use antiword to extract the plain text from your documents
c) store this text, together with the filepath, in the docinfo table

3) Query this table with MySQL's fulltext index capabilities

You will have to repeat 2) on a regular basis to keep your table up-to-date. I use additional columns in my docinfo table (last modification date/time, filesize) to see which files haves changed and need an update.

--
felix
Please use BBCode to format your messages in this forum.

Options: ReplyQuote


Subject
Views
Written By
Posted
6222
November 25, 2005 08:56AM
Re: Oracle Text on MySQL
2848
November 25, 2005 09:36AM
2468
February 26, 2007 04:28AM


Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.