this post was submitted on 16 May 2024
54 points (98.2% liked)

Privacy

31981 readers
230 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

Chat rooms

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
 

I have my personal blog, made with Hugo and hosted on GitHub pages. Initially I did not turn on any kind of web tracking / web analytics, because I do not like tracking at all. But I want to make my blog better and to achieve it, I need a feedback loop about traffic. For example, what are the most popular publications, or how many people view my blog from mobile devices, etc.

So, my question is, what is the most appropriate (ot the less evil) way to track a web traffic?

An answer "there is no good way to do it without breaking user's privacy" is acceptable too, I did not decide yet turning on the analytics. Instead I'm interested in an opinion of the community.

Thanks in advance!

top 19 comments
sorted by: hot top controversial new old
[–] [email protected] 2 points 6 months ago (1 children)

The folks who're recommending analysis of your site's access logs are correct. However, Github Pages doesn't have any such notion. You might have to go with the recommendations of moving stuff over to a VPS.

[–] [email protected] 1 points 6 months ago

It is always a tradeoff. It could be cool to see some analytics of visits, but I'm not ready to go self hosted for that. My blog is just a hobby, not a work, no monetization, etc.

[–] [email protected] 5 points 6 months ago

Checkout goaccess. No additional tracking needed, it just reads your server logs. https://goaccess.io/

[–] [email protected] 12 points 6 months ago (1 children)
[–] [email protected] 3 points 6 months ago

I was using plausible for one of my projects and had a good experience.

[–] [email protected] 2 points 6 months ago (1 children)
[–] [email protected] 1 points 6 months ago (1 children)

awstats parses logfiles to create statistics, which op probably has not due to the page beeing hosted on github

[–] [email protected] 1 points 6 months ago

Yeah my point is you gotta self host to collect analytics without violating your visitor's privacy.

[–] [email protected] 3 points 6 months ago

Think of it like people walking into a brick and mortar retail store and what they should be able to expect from an honest local business. For most of us, the sensitivities are when your "local store" is collecting data that is used for biased information, price fixing, and manipulation. I don't think you'll find anyone here that boycotts a store because they keep a count of how many customers walk in the front door.

[–] [email protected] 5 points 6 months ago (1 children)
[–] [email protected] 11 points 6 months ago* (last edited 6 months ago) (1 children)

The right way to do this is to self-host your analytics.

I don't know which tools are popular for this nowadays, but something like Matomo On-Premise might be worth a look. I expect you can find more with a web search. Keywords: open-source self-hosted web analytics.

[–] [email protected] 2 points 6 months ago (1 children)

Interesting, thanks! I didn't think about it. But for a personal blog (without any kind of monetization) it is not an option, unfortunately, due to hosting/infra prices 😞

[–] [email protected] 6 points 6 months ago* (last edited 6 months ago)

VPS can be had very cheap: https://lowendstock.com/

Also, it might be worth looking for analytics software that can get its data from web server log files. I have done that with Apache and Nginx in the past. These days, I wouldn't be surprised if such software can ingest the log files created by Amazon's S3 free tier. You wouldn't have to manage a VPS with that approach.

Of course, if you're letting a major data collector like Github (Microsoft), Amazon, or Cloudflare serve your site, it's not particularly good for privacy to begin with.

[–] [email protected] 1 points 6 months ago

I think Matomo is the popular ethical Google analytics alternative IIRC

[–] [email protected] 1 points 6 months ago

How are you on self hosting something? You could give a couple of these a try. Data stays with you so you get to decide how much to track.

https://github.com/awesome-selfhosted/awesome-selfhosted?tab=readme-ov-file#analytics

[–] [email protected] 1 points 6 months ago

I use Ackee, it’s awesome!

https://ackee.electerious.com/

“Self-hosted, Node.js based analytics tool for those who care about privacy. Ackee runs on your own server, analyzes the traffic of your websites and provides useful statistics in a minimal interface.”

[–] [email protected] 1 points 6 months ago

Tracking how many users you have and how many of them visit an article is hardly any breach of privacy compared to what ad companies retain.

Use an open source solution about analytics and minimise data collection if not absolutely necessary.