|
|
|
|
|
|
| |
| |
|
|
|
|
| |
| |
|
|
How about the ability to do a search on the newsgroups that are held on the
server.
Because povray.* isn't farmed out into the internet in general, newsgroup
searches such as deja.com are of no use.
I use Outlook Express which does allow me the ability to search the
newsgroups, but to save space I only download headers and only have the
message bodies for those things that I read.
There is a *lot* of very useful information in the newsgroups and I think it
would help if we could perform searches on the newsgroup message contents
and subject lines.
Just my twopenneth worth :)
Lee Brown
--
lee### [at] prodigynet
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On Tue, 20 Jun 2000 15:59:50 -0700, Lee Brown wrote:
>How about the ability to do a search on the newsgroups that are held on the
>server.
That's one of my on-again, off-again projects, but I haven't yet found an
indexing engine that can handle that much data efficiently.
--
Ron Parker http://www2.fwi.com/~parkerr/traces.html
My opinions. Mine. Not anyone else's.
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On 21 Jun 2000 09:11:59 -0400, ron### [at] povrayorg (Ron Parker)
wrote:
>>How about the ability to do a search on the newsgroups that are held on the
>>server.
>
>That's one of my on-again, off-again projects, but I haven't yet found an
>indexing engine that can handle that much data efficiently.
You know the saying, if you can't find one... :)
Peter Popov ICQ : 15002700
Personal e-mail : pet### [at] usanet
TAG e-mail : pet### [at] tagpovrayorg
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On Thu, 22 Jun 2000 01:47:19 +0300, Peter Popov wrote:
>On 21 Jun 2000 09:11:59 -0400, ron### [at] povrayorg (Ron Parker)
>wrote:
>
>>>How about the ability to do a search on the newsgroups that are held on the
>>>server.
>>
>>That's one of my on-again, off-again projects, but I haven't yet found an
>>indexing engine that can handle that much data efficiently.
>
>You know the saying, if you can't find one... :)
Easier said than done. I found one, actually, that seems like it can do
the job, but when it's trying to index the more active groups like this
one it dies when the indexing process reaches 64M of memory. I have 64M
of physical memory and 128M of swap, but it never seems to touch swap.
I haven't figured that out yet.
--
Ron Parker http://www2.fwi.com/~parkerr/traces.html
My opinions. Mine. Not anyone else's.
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On 21 Jun 2000 19:33:33 -0400, Ron Parker wrote:
>On Thu, 22 Jun 2000 01:47:19 +0300, Peter Popov wrote:
>>On 21 Jun 2000 09:11:59 -0400, ron### [at] povrayorg (Ron Parker)
>>wrote:
>>
>>>>How about the ability to do a search on the newsgroups that are held on the
>>>>server.
>>>
>>>That's one of my on-again, off-again projects, but I haven't yet found an
>>>indexing engine that can handle that much data efficiently.
>>
>>You know the saying, if you can't find one... :)
>
>Easier said than done. I found one, actually, that seems like it can do
>the job, but when it's trying to index the more active groups like this
>one it dies when the indexing process reaches 64M of memory. I have 64M
>of physical memory and 128M of swap, but it never seems to touch swap.
>I haven't figured that out yet.
(following up to myself)
I finally got it figured out. I now have word indexes of all of the
newsgroups on this server, including articles on off-topic that are long
gone. The collection is 18 megs, zipped (about 50 megs, unzipped) and
current as of 2 AM EST yesterday morning. I still have to figure out how
to do incremental updates, but that doesn't look too hard.
Now... what do I do with them? I don't know of any free web hosts that
allow custom CGI and will let me host 50 megs of indexes plus the swish-e
executable. Any suggestions? I have DSL, but it's only 128kb upstream
and my ISP doesn't let me run a server, so I'm stuck with external
solutions. For the best results, I'd also have to have a database of
names, dates, subjects, and messageIDs so I could look up info on the
results without having to hit the news server. That's easy enough to
build, but it'd take up lots of space too.
--
Ron Parker http://www2.fwi.com/~parkerr/traces.html
My opinions. Mine. Not anyone else's.
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On 22 Jun 2000 11:34:30 -0400, ron### [at] povrayorg (Ron Parker)
wrote:
>I finally got it figured out. I now have word indexes of all of the
>newsgroups on this server, including articles on off-topic that are long
>gone. The collection is 18 megs, zipped (about 50 megs, unzipped) and
>current as of 2 AM EST yesterday morning. I still have to figure out how
>to do incremental updates, but that doesn't look too hard.
>
>Now... what do I do with them?
Make some stats. Top posters, top repliers, average response time,
longest thread, top posters in the five longest threads etc., whatever
comes to mind.
Peter Popov ICQ : 15002700
Personal e-mail : pet### [at] usanet
TAG e-mail : pet### [at] tagpovrayorg
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On Fri, 23 Jun 2000 01:24:38 +0300, Peter Popov wrote:
>Make some stats. Top posters, top repliers, average response time,
>longest thread, top posters in the five longest threads etc., whatever
>comes to mind.
It's not that kind of an index. It's the kind where you can say "show
me all articles that contained the word ambidextrous and the word lunatic
in the povray.off-topic newsgroup."
--
Ron Parker http://www2.fwi.com/~parkerr/traces.html
My opinions. Mine. Not anyone else's.
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
"Ron Parker" <ron### [at] povrayorg> wrote in message
news:slr### [at] linuxparkerrfwicom...
| On Fri, 23 Jun 2000 01:24:38 +0300, Peter Popov wrote:
| >Make some stats. Top posters, top repliers, average response time,
| >longest thread, top posters in the five longest threads etc., whatever
| >comes to mind.
|
| It's not that kind of an index. It's the kind where you can say "show
| me all articles that contained the word ambidextrous and the word lunatic
| in the povray.off-topic newsgroup."
Most of them, eh?
I'd like to see a sorting based on what I'm thinking at the moment. Not gonna
happen.
Bob
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
"Ron Parker" <ron### [at] povrayorg> wrote in message
news:slr### [at] linuxparkerrfwicom...
[cut]
> (following up to myself)
>
> I finally got it figured out. I now have word indexes of all of the
> newsgroups on this server, including articles on off-topic that are long
> gone. The collection is 18 megs, zipped (about 50 megs, unzipped) and
> current as of 2 AM EST yesterday morning. I still have to figure out how
> to do incremental updates, but that doesn't look too hard.
>
> Now... what do I do with them? I don't know of any free web hosts that
> allow custom CGI and will let me host 50 megs of indexes plus the swish-e
> executable. Any suggestions? I have DSL, but it's only 128kb upstream
> and my ISP doesn't let me run a server, so I'm stuck with external
> solutions. For the best results, I'd also have to have a database of
> names, dates, subjects, and messageIDs so I could look up info on the
> results without having to hit the news server. That's easy enough to
> build, but it'd take up lots of space too.
Do you know if the index file is in some kind of standard format (like *.mdb
or .dbf). It may be possible to write a Client-Server java application that
would run on your machine (don't know if it could take the load though) to
process queries that the client side java code on the loaded page could
transmit. I don't know enough about the java database stuff to know how
easy it would be to do this. I could write some code in Delphi that would
do it, but that would mean that an .exe would have to be downloaded for
people to run on the client-side. Delphi would necessitate a known database
format which is why I asked that originally. This is not a very good
solution I must admit, but the only one that comes to mind.
Lee
--
lee### [at] prodigynet
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
| |
|
|
On Thu, 22 Jun 2000 23:49:44 -0700, Lee Brown wrote:
>Do you know if the index file is in some kind of standard format (like *.mdb
>or .dbf). It may be possible to write a Client-Server java application that
>would run on your machine (don't know if it could take the load though) to
>process queries that the client side java code on the loaded page could
>transmit. I don't know enough about the java database stuff to know how
>easy it would be to do this. I could write some code in Delphi that would
>do it, but that would mean that an .exe would have to be downloaded for
>people to run on the client-side. Delphi would necessitate a known database
>format which is why I asked that originally. This is not a very good
>solution I must admit, but the only one that comes to mind.
Interesting definition of standard you have there. No, the index file is in
swish-e format, because that's what I indexed it with. Server-side code would
have to run on Linux, and client-side would have to be sufficiently portable
to run on, at the very least, Linux, BSD, Windows, MacOS, and BeOS. I'd
rather make it work in a web browser.
Part of the problem is that I don't want to run the server locally, because
I have limited bandwidth available.
Fortunately, a solution may have presented itself. I just need to work out
the details.
--
Ron Parker http://www2.fwi.com/~parkerr/traces.html
My opinions. Mine. Not anyone else's.
Post a reply to this message
|
|
| |
| |
|
|
|
|
| |
|
|