[DISCUSS] Website to monitor Lemmy servers' performance/availability

bahmanm@lemmy.ml · 1 year ago

[DISCUSS] Website to monitor Lemmy servers' performance/availability

glarf@lemmy.world · 1 year ago

I stopped using my preferred instance because I couldn’t tell if it was having problems or it was my Internet. This would be very useful for people like me to sanity check things.

MrCenny@lemmy.world · 1 year ago

There does exist something similar to this: https://lemmy-status.org. It will eventually have an automatic list, but it is not implemented yet. They are currently adding instances in manually. The owner is @[email protected], one of our infra people at lemmy.world. The website is not connected to lemmy.world by any means btw.

Valmond@lemmy.mindoki.com · 1 year ago

lemmy-status.org knows my instance (lemmy.mindoki.com) but when I search for it and selects it, it just shows global fediverse data :-/

bahmanm@lemmy.ml · 1 year ago

Thanks. Yes, lemmy-status.org was where I got the initial idea 💯

automatic list

For the website I’m thinking about, I’d rather keep it exclusively opt-in. I don’t wish to add any extra load since most of the instances are running off of enthusiasts’ pockets.

MrCenny@lemmy.world · edit-2 1 year ago

Oh sorry, didn’t see that 😅

much in the same vein as lemmy-status.org

I was also thinking that an opt-in or something similar would be nice. As overloading small project raspberries with a large monitoring website wouldn’t be that nice…

Valmond@lemmy.mindoki.com · 1 year ago

Even if you ping it once a minute it won’t even be noticeable IMO. When you surf (through) your Lemmy I stance there is a lot of traffic going on.

I imagine the ping would be for uptime? Or would you repeatedly scanlot of stuff? Then just do it rarely.

bahmanm@lemmy.ml · 1 year ago

I still haven’t made up my mind as to what is a good interval. But I think I’ll take a per-endpoint approach, hitting more expensive ones less frequently.

So far I can only think of 4-5 endpoints/URLs that I should hit in every iteration as outlined in the post above.

web/mobile home feed
web/mobile create post/comment
web/mobile search

I think those will cover most of the usecases.

bahmanm@lemmy.ml · 1 year ago

Thanks all for the input 🙏

I did a quick experiment w/ the APIs and I think I have identified the ones I’d need. Obviously, all is open source (GPLv3) available on github: lemmy-clerk

As the next step, I’m going to expose that data to Prometheus for scraping.

[DISCUSS] Website to monitor Lemmy servers' performance/availability

[DISCUSS] Website to monitor Lemmy servers' performance/availability

1 The Idea

1.1 Public-facing monitoring solution external to a cluster

1.2 A set of key endpoints

1.3 Presenting stats visually via graphs

1.4 History

1.5 Notification

2 Questions