Just started self hosting this instance. Nothing on the docs mentioned anything about storage considerations.
How quickly will a lemmy instance eat up storage?
3 years ago by i_lost_my_bagel to c/selfhosted
Just started self hosting this instance. Nothing on the docs mentioned anything about storage considerations.
Considering this is going to be around a 5 user instance at most I think I'll be good for awhile. Thanks!
It's all about how many communities your user(s) subscribe to since your instance basically acts as a mirror for those.
My instance has been running for 23 days, and I am pretty much the only active local user:
7.3G pictrs
5.3G postgres
edit: I may have a slight Reddit Lemmy problem
It won't scale linearly. A lot of those users will be subscribed to subs the instance is already replicating. It would only be new subs that would add to the growth.
Question if you know: does a lemmy instance have to be publically accessable to work? Like, if I make an instance on my homelab can the instance "fetch" content and serve it faster locally? Could I reply to a post and have others see it? Etc
At the end of the day the vast majority of what needs to be saved is text. If media content is embedded, the the server just has to save the path to the file not the file itself.
Feels like this will benefit from some sort of fuzzy deduplication in the pictrs storage. I bet there are a lot of similar pics in there. E.g. if one pic or a gif is very similar to another, say just different quality or size, or compression, it should keep only one copy. It might already do this for the same files uploaded by different people as those can be compared trivially via hashing, but I doubt it does similarity based deduplication.
This is my small instance with way fewer users than lemmy.world.
11G pictrs
5.2G postgres
Out of curiosity, how long has your instance been up? Just want to get a sense of how fast storage is increasing for you.
23 days.
How has your Lemmy experience been on a self hosted instance? I'm currently using lemmy.world and it's very error prone, would self hosting reduce those errors at the expense of anything? Does federation take long or do you find you're getting federated content quickly enough?
The experience has been pretty good, to be honest. No instability, easy updates, etc. I find federated content quite quickly, because I use this script to populate the "All" feed.
I didn't make it! :) I think, @fmstrat@nowsci.com made it.
You won't get any old content, so that's a downside. You'll only get content after you start federating. Unless someone votes or comments on old content.
Other than that the only downside is spending time maintaining and updating it.
476M ./postgres 1.1G ./pictrs
After 3 weeks
I think as long as the original community the post is in doesn't purge the data, it's fine for other instances to purge if necessary.
Exactly, when dealing with big data, you need a strategy to archive old data. You can't just store everything in one DB. Smaller instances may not feel like keeping all the date from all the time. Even big instances should have a mechanism to move old data do different databases.
Depends. If you have a lot of users posting a lot of pictures and you use pictrs out of the box config, then a lot. If you are just running a few users with finite communities being synced then a lot less. The number is going to vary a lot as lemmy grows and gets older so hard to document realistic expectations. But docker images are probably going to take up more disk space than actual contents unless you get quite big. I just threw my PG volume into a tgz to move servers and it's less than a gig.
The lemmy.world admin said above that their instance currently takes up less than 100GB
Though this will accrue over time I suppose.
But for self hosting? You should be good for a long long time. The only pictures stored are the ones you upload, the rest is just text.
lol lemmy died almost immediately after i posted this time to figure out what the hell caused that
it was because i set a damn server icon
lol, yeah, that would crash any instance
(jokes aside, you'll probably need to keep it somewhat low-res, and i'd also recommend cropping it to square. my instance uses a 128x128 icon)
Unless they changed all of the comment and post ids to bigints that'll probably bring the site down before it runs out of storage. In defense of the lemmy developers they have been receptive to feedback, so I don't think it'll take long for that to be fixed if it hasn't already.
My instance dormi.zone has been running for around 3½ weeks now, has a 3-digit amount of users and hosts a community with little more than 1000 subscribers. Here's how much storage it currently takes up:
In the default Ansible configuration, storage will mostly be accumulated by log files that are automatically generated by Docker and deleted whenever you restart the Docker containers.
At least 3. Maybe 4.
I disagree. One big hunk of value of a place like this is being able to look back at old threads. How many times did people say they always put "Reddit" in front of their Google searches to get the information they were looking for? This could be the same.
It's really not, at least for the text part. Text posts and comments take almost nothing and storage continues to get cheaper.
Mainstream platforms are selling out because they've always had others and shareholders who ultimately want to make money.
Info is still useful for people doing google searches. It would be nice to be able to find common troubleshooting tips on Lemmy, etc.
@lemmy.world
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam.
Posts here are to be centered around self-hosting. Please ensure it is clear in your post how it relates to self-hosting.
Don't duplicate the full text of your blog or git here. Just post the link for folks to click.
Submission headline should match the article title.
No trolling.
Resources:
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
go to feed...
@lemmy.world
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam.
Posts here are to be centered around self-hosting. Please ensure it is clear in your post how it relates to self-hosting.
Don't duplicate the full text of your blog or git here. Just post the link for folks to click.
Submission headline should match the article title.
No trolling.
Resources:
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
go to feed...
This is lemmy.world after 4 weeks:
save