Create an account

Very important

  • To access the important data of the forums, you must be active in each forum and especially in the leaks and database leaks section, send data and after sending the data and activity, data and important content will be opened and visible for you.
  • You will only see chat messages from people who are at or below your level.
  • More than 500,000 database leaks and millions of account leaks are waiting for you, so access and view with more activity.
  • Many important data are inactive and inaccessible for you, so open them with activity. (This will be done automatically)


Thread Rating:
  • 232 Vote(s) - 3.47 Average
  • 1
  • 2
  • 3
  • 4
  • 5
What's an effective library for suggesting keywords for content?

#1
Currently designing a CMS for use on my website. I am wondering if there were any free libraries available for creating tags based on the content.

**Example**

> I like trees. Trees are plants that have leaves. Leaves on tree can be
> multi-colored.

Would produce the tags *trees* and *leaves*.

The library should be PHP or JS.

**EDIT 1:**

I have found a simple library for half my task -

[To see links please register here]


I have edited what the library specifications should be (thanks to guidance from @NullUserException)-

- Count all words (ignoring case and inflections), throw out stop words and pick the ones with the highest frequency


- Edit text to make words that are more specific to the genre (may have a lower frequency), be of higher value. For example in the example - *'multi-colored'* should become higher value because it is more specific to the subject. However it should include a prefix indicating it relates to the subject (it would become leaves-multi-colored).

**EDIT 2:**

Algorithm should remove words that have less than 3 characters **unless** they are in capitals or formatted otherwise



Reply

#2
Are the tags on your CMS already defined? If yes you could index your text in memory and search using all known tags against your text. Pick the highest scoring tags and present to the user.

Indexing and searching could be done with

[To see links please register here]


**Edit**: Note that I do suggest that your tags/keywords are defined and manageable from an administration panel (like for example in wordpress). Otherwise you'd end up with thousands of keywords generated from your articles which would never help the end user.
Reply



Forum Jump:


Users browsing this thread:
1 Guest(s)

©0Day  2016 - 2023 | All Rights Reserved.  Made with    for the community. Connected through