moodle/search
2010-01-16 15:39:56 +00:00
..
documents fix last http://tracker.moodle.org/browse/MDL-20917 shorten errors. 2009-11-21 13:52:31 +00:00
tests MDL-18672 and MDL-18679 fixed. DIRECTORY_SEPARATOR / PATH_SEPARATOR mismatch 2009-03-25 21:28:43 +00:00
Zend MDL-19342 - small fixes from Tatsuva Shirai + UTF-8 global straighten up 2009-05-30 22:11:53 +00:00
.cvsignore Add/delete/update cron functions added. 2006-08-16 13:34:07 +00:00
add.php MDL-18672 and MDL-18679 fixed. DIRECTORY_SEPARATOR / PATH_SEPARATOR mismatch 2009-03-25 21:28:43 +00:00
cron.php MDL-14617 removed PHP5 checks from global search 2008-05-01 21:22:01 +00:00
cron_php5.php "MDL-18540, change the uses of option" 2009-03-30 10:07:24 +00:00
delete.php MDL-18672 and MDL-18679 fixed. DIRECTORY_SEPARATOR / PATH_SEPARATOR mismatch 2009-03-25 21:28:43 +00:00
index.php code review for packaging and code cleaning 2008-03-31 22:21:42 +00:00
indexer.php MDL-18670 adding proper permissions for mkdir(); merged from MOODLE_19_STABLE 2009-05-06 16:10:45 +00:00
indexersplash.php MDL-21233 moodle_url improvemewnts, code simplification, more diagnostics; fixed several regressions 2010-01-16 15:39:56 +00:00
indexlib.php Addresses the tracker MDL - 18638 and MDL 18474 2009-03-23 21:07:02 +00:00
lib.php Addresses the tracker MDL - 18638 and MDL 18474 2009-03-23 21:07:02 +00:00
LISEZMOI.txt Drift between CVS and git 2008-06-25 17:31:23 +00:00
query.php MDL-21233 moodle_url improvemewnts, code simplification, more diagnostics; fixed several regressions 2010-01-16 15:39:56 +00:00
querylib.php search MDL-19822 Upgraded deprecated calls and added set_url calls 2009-10-15 07:37:29 +00:00
README.txt Drift between CVS and git 2008-06-25 17:31:23 +00:00
README_ARCHIVE.txt Drift between CVS and git 2008-06-25 17:31:23 +00:00
searchtypes.php Addresses the tracker MDL - 18638 and MDL 18474 2009-03-23 21:07:02 +00:00
stats.php MDL-21233 moodle_url improvemewnts, code simplification, more diagnostics; fixed several regressions 2010-01-16 15:39:56 +00:00
update.php MDL-16979 - protects update queries by setting a real value in $CFG->search_indexer_update_date 2009-05-11 19:59:58 +00:00

This directoery contains the central implementation of
Moodle's Global Search Engine.

The Global Search Engine stores indexes about a huge quantity  
of information from within modules, block or resources stored 
by Moodle either in the database or the file system.

The administrator initialy indexes the existing content. Once this 
first initialization performed, the search engine maintains indexes
regularily, adding new entries, deleting obsolete one or updating
some that have changed.

Search will produce links for acceding the information in a similar
context as usually accessed, from the current user point of view.
Results filtering removes from results any link to information the
current user would not be allowed to acces on a straight situation.

Deployement
###########

The search engine is now part of Moodle core distribution.

Some extra libraries might be added for converting physical documents to text
so it can be indexed. Moodle CVS (entry contrib/patches/global_search_libraries)
provides packs for antiword and xpdf GPL libraries the search engine is ready for 
shockwave indexing, but will not provide Adobe Search converters that should be 
obtained at http://www.adobe.com/licensing/developer/

1. Go to the block administration panel and setup once the Global Search
block. This will initialize useful parameters for the global search engine.

2. Insert a new Global Search block somewhere in a course or top-level screen. 

3. Launch an empty search (you must be administrator).

4. Go to the statistics screen.

5. Activate indexation (indexersplash.php). Beware, if your Moodle has
a large amount of content, indexing process may be VERY LONG.

To search, go back to the search block and try a query.

Handled information for indexing
################################

In the actual state, the engine indexes the following information:

- assignment descriptions
- forum posts
- database records (using textual fields only)
- database comments
- glossary entries
- glossary comments on entries
- Moodle native resources
- physical MSWord files as resources (.doc)
- physical Powerpoint files as resources (.ppt)
- physical PDF files as resources 
- physical text files as resources (.txt)
- physical html files as resources (.htm and .html)
- physical xml files as resources (.xml)
- wiki pages
- chat sessions
- lesson pages

Some third party plugins are also searchable using the new Search API implementation

- Techproject

Extensions
##########

The reviewed search engine API allows: 

- indexing of blocks contents
- indexation of modules or blocks containing a complex information model
- securing the access to the results
- adding indexing handling for additional modules and plugins adding a php calibrated script
- adding physical filetype handling adding a php calibrated script

Future extensions
#################

- Should be added more information to index such as forum and glossary attachements, 
  so will other standard module contents.
- extending the search capability to a mnet network information space by aggregating remote search responses.