Hi,
I want to set up a newsserver (inn) with the archive of Polish Usenet.
I already have most of archives downloaded (in some weird format I'll convert, sort by date, and feed to inn), it's around 58 million articles (around 100 GB + overview). Seems manageable.
What would be the best spooling and overview method for this?
Right now I'm thinking about creating a file, formatting it as some filesystem (which filesystem? I use ext4 for my everyday needs, but maybe something else is better for this?), tuning its parameters, and using tradspool.
The file would be extended if needed, so the chosen filesystem has to have the capability to do it (ext4 can be extended if the underlying storage is extended, but I don't know about other filesystems).
If it's ext4, I also have a couple of questions on how to best tune it. That's what I want to do -- is it a good idea?
1. Set the bytes per inode ratio to 1536 (very low, but it will give me 69 million inodes per 100 GB)
2. Set the block size to 1024 (not too low?)
3. Set the inode size to 128
4. Set uid16 to disable 32-bit UIDs
5. Disable large_file
6. Set dir_index
7. Set reserved blocks percentage to some low value (is 0% OK?)
Overview would be tradindexed, I think it will suffice.
CNFS would be better if there was a way to throttle a server when it's
about to rotate the buffer (I don't want to lose articles, ever, even if there's some massive flood that would overwhelm my storage, I want to add new buffers and unthrotle server then), but is it even possible?
Plus in case of a flood I'd have a problem with deleting articles from
CNFS that I wouldn't have with tradspool...
Some idea would be to use timecaf, but:
1. It doesn't seem to be widely used, so it's also not very well tested.
Or is it? How stable it is?
2. Is there a way to rotate a .CF file when it's full (262144 articles), instead of relying on arrival time? I want to feed new articles as fast as
I can
3. Maybe there are some tools to initially write the .CF files directly, instead of letting inn handle it? Then I'd just have to build the rest (history, overview)
The server won't accept new articles from readers -- after the initial prefeeding from my archives there will be only a single feed from my main server.--- Synchronet 3.21a-Linux NewsLink 1.2
Suggestions are welcome.
I want to set up a newsserver (inn) with the archive of Polish Usenet.
I already have most of archives downloaded (in some weird format I'll convert, sort by date, and feed to inn), it's around 58 million articles (around 100 GB + overview). Seems manageable.
Overview would be tradindexed, I think it will suffice.
CNFS would be better if there was a way to throttle a server when it's
about to rotate the buffer (I don't want to lose articles, ever, even if there's some massive flood that would overwhelm my storage, I want to add
new buffers and unthrotle server then), but is it even possible?
Some idea would be to use timecaf, but:
1. It doesn't seem to be widely used, so it's also not very well tested.
Or is it? How stable it is?
2. Is there a way to rotate a .CF file when it's full (262144 articles), instead of relying on arrival time? I want to feed new articles as fast as
I can
3. Maybe there are some tools to initially write the .CF files directly, instead of letting inn handle it? Then I'd just have to build the rest (history, overview)
I would use ovsqlite because it may perform a bit faster with millions
of articles in a single newsgroup.
| Sysop: | Amessyroom |
|---|---|
| Location: | Fayetteville, NC |
| Users: | 54 |
| Nodes: | 6 (0 / 6) |
| Uptime: | 17:58:07 |
| Calls: | 742 |
| Files: | 1,218 |
| D/L today: |
4 files (8,203K bytes) |
| Messages: | 184,414 |
| Posted today: | 1 |