Help, Need A.L.I.C.E Web page Spider
Please Please help with this.
I need a spider created that will crawl a webpage and grammatically parse the page and create AIML (Artificial Inelligence Markup Language) data. This data will be saved into an AIML file and used to teach a chatterbot the contents of the web page.
The way we see it working is:
1. The spider crawls a page examining the text of each sentence.
2. Then using a grammatic parser it will reformulate that sentence data into possible patterns and responses to be entered as data in the AIML file.
3. Then it will format this into a standard AIML file and allow you to save this code or copy and paste it to another source
This will require someone experience in AIML as well as grammatic sentence parsing.
The idea for this project is to get the stand-alone script, possibilities of this being a desktop VB script. But I hear that existing perl and php extensions may make this easier. Open to other suggestions. Preferably a desktop application to start.