Michael Camilleri

Send private message

You must be logged in to send a private message.

Friends

No friends yet.

Group membership

Activity

  • Michael Camilleri replied on the discussion topic Drop-down menu
    Excellent! Thanks a lot! One last issue left: The last item in the list is currently selected by default - how can I change that to be the first item?   view reply
  • Michael Camilleri added a new discussion topic Drop-down menu in the group Elgg Technical Support
    I need a drop-down menu that lists a number of subjects, something along the lines of the snippet below. <div class="publish_l_r"><?php echo elgg_view('input/access',array('internalname' => "category", 'options' =>...
  • Michael Camilleri replied on the discussion topic Elgg search
    I think copying the file contents in tags will solve this, at least temporarily. Any idea how to do that automatically when a file is uploaded? view reply
  • Michael Camilleri added a new discussion topic Elgg search in the group Elgg Technical Support
    My site (www.scholify.com) uses Scribd to dispay uploaded documents, however Elgg search does not index the contents of the documents uploaded by users. Instead search works only with tags, comments, etc. How can I get Elgg to index document...
    • True - Elgg only searches via metadata / tags. To index  document contents --> extracting those pertinent  words / type of word -- that search is desired upon and indexing them as tags attached to the owning entity.

      One small big problem will be the selection of words to index - whether using Elgg's tags or any other indexing algorithim.

      EG

      If document content is => 

      "The quick brown fox jumps over the lazy dog"...

      Then the words index yields ==>

      • Quick
      • Brown
      • Fox 
      • Jumps
      • Dog

      -- omitting prepositions and any other stop words.

      Could try the KWIC indexing algorithm ( indexes all cyclic permutations of keywords ) published on the internet in numerous places, though not much has been seen coded in PHP.

       

       

       

    • I think copying the file contents in tags will solve this, at least temporarily. Any idea how to do that automatically when a file is uploaded?

    • i think you might have missed some points i made ;)

      "copying contents" when file is uploaded is not whole story. 'copying file contents.." might osr some hundred loc.

      needs to be whenever content [ entities (that whose "words" are to searchable) ] are --> * created * updated * deleted --> leads to create/update/delete tags ;-X

      the "brown fox" analogy ? does one really want *all the content's  words indexed ? lots of server cpu load ;-) how to be selective in fetching keywords ? (believe that this is) *not an easy programming task.. could take Brett's BigBrother PlugIn (now quite dated) and upgrade. extend to capture content words for indexing as tags.

      if a usual Page (PlugIn) add/update generates 20-30 metadata db hits, indexing content with 1-200 words will do about 2000++ db hits ;-( no joy ;;;-X