Tuesday, October 21, 2008

using wget through proxy

wget --execute=http_proxy=proxy.example.com:8080 --proxy-user="username" --proxy-password="password" web_address

where proxy.example.com is the address of the proxy and 8080 is the port number.

Thursday, October 2, 2008

Automatic Document Summery generation

Automatic document summery generation is the process of generating summery of any document without manual intervention. We need this kind of systems because now a days we are dealing with huge volume of data and it is simply impossible to visualize it manually.
Summery of a document can be generated in two ways...
  1. Text Extraction: Selecting those sentences from the document which are of greater importance and represent the whole document. While selecting sentences we ensure that summery should be redundancy free. That is if we have already selected a sentence then we exclude all other sentences with similar meaning.
  2. Natural Language Generation: This method is similar to the manual way of summery generation. This kind of summery don't contains sentences from the document.
It has been proved that the text extraction method is better way because it represent views of the author rather than the understanding of summery writer.

to be continued...