10

Need some advices on choosing open source web analytics tools, including both of page tagging and log file analysis type. I know some of them: piwik, open web analytics, awstats and more... what are the best ones among them(page tagging type and log file analysis type) ?

HopelessN00b
  • 53,385
  • 32
  • 133
  • 208
Mickey Shine
  • 929
  • 4
  • 16
  • 33

8 Answers8

11

In the log files analyzer domain, these are the most widely used:

  • AWStats
  • Webalizer
  • Analog
  • W3Perl

Analog and Webalizer are written in C and are the faster (10000-20000 lines per second).

As mentioned earlier by @MadHatter Analog has been developed by an ex-Cambridge statistician, that makes Analog a really precise and technical tool, but since the year 2005 it's not developed anymore.

Webalizer is not developed anymore but it's easier then Analog to be used.

AWStat and W3Perl are written in Perl and they are the most active projects, but are faraway slower then Analog and Webalizer (3000/4500 lines per second). They differ from each others for the data produced and the way the data is rendered.

AWStat displays statistics in a really attractive manner, but produce less statistics then W3Perl.

I advice you these links for further info:

http://www.aardal.com/stats/docs/uk/speed.html
http://www.w3perl.com/
http://awstats.sourceforge.net/

My advice is:

  • W3Perl if you want granular statistics and you don't have too many GB of log files
  • AWStat in the case you need don't have too many GB of files and you want a nice graphical representation.
  • Webalizer if you have tons of log files to analyze.
  • Analog if you need really accurate statistics, huge log files and you have C development experience (or you know anybody who can help).

Regarding Page tagging The winner is surelly Google Analytics, has the data collected and produced is better than the other solutions, but as one day may happens that Google will ask money for it...

W3Counter and Xiti are providers that require you, for the free version, to install an image on each web page you want to monitor. Both are for small sites.

Open Web Analytics and Pikwik are great open source solutions. Both are quite mature and stable but they require MySQL database and PHP support.

If you don't need an "home made" solution I would definitively go for Google Analytics and between the open source projects I would choose Open Web Analytics as it's more mature then Piwik.

tmow
  • 1,187
  • 8
  • 20
5

I am very satisfied with piwik. I just miss the possibility to adjust the widgets dependent on the website. I use it to check my drupal sites and there is a module for drupal too.

groovehunter
  • 243
  • 2
  • 7
1

For my money, the best log analysis tool is analog. It's screamingly fast (modulo DNS lookups, which are equally slow for everyone), is written by an ex-Cambridge statistician, and has a very useful page telling you - and your management - what can and cannot properly be known from your web logs, bizarre commercial claims notwithstanding.

I can't comment on page tagging, though; sorry.

MadHatter
  • 78,442
  • 20
  • 178
  • 229
1

This is experimental software, but nevertheless, quite impressive:

Hummingbird lets you see how visitors are interacting with your website in real time. And by “real time” we don’t mean it refreshes every 5 minutes—WebSockets enable Hummingbird to update 20 times per second. Hummingbird is built on top of Node.js, a new javascript web toolkit that can handle large amounts of traffic and many concurrent users.

miku
  • 445
  • 1
  • 3
  • 12
1

I would say (although I'm biased as co-founder ;-) that SnowPlow is the most powerful open source tagging-based web analytics tool out there.

SnowPlow has a loosely coupled, distributed architecture which uses Hadoop and Hive, so it scales to millions or even billions of events - this is something that no MySQL or other RDBMS-based solution can do.

The other big innovation in SnowPlow is that your event data is stored in a clean, immutable, denormalised, atomic "flat file" structure - in other words, an analytics data warehouse. This enables a lot of very sophisticated analyses using Hive, as well as straightforward joins with your third-party data (e.g. CRM or sales data). Again, this is more powerful than other solutions, which tend to collapse atomic data into aggregates, truncate old data or use head-scratching normalised structures which are really hard to query directly or join to other sources.

You can read more about SnowPlow's technical architecture here.

Alex Dean
  • 143
  • 6
0

Boomerang is the best tool available for clientside performance analysis - but its just the js agent - I don't know of any off-the-shelf reporting tool, Pastmon is useful for network stuff, Piwik and AwStats you've already mentioned. Analog is a very similar tool to AwStats - less configurable but much faster processing.

If you're looking to roll your own solution, there's free ip to location data available

symcbean
  • 19,931
  • 1
  • 29
  • 49
0

I know people who use Webalizer for Web site log analysis. Just to mention one more option.

There's also this discussion about the possibilities.

oherrala
  • 215
  • 1
  • 2
  • 8
  • 1
    Webalizer is no longer actively developed, I'm afraid will lose its market share inexorably. – tmow Feb 14 '11 at 09:15
0

After some research on open source web analytics packages I think the clear winners are http://www.openwebanalytics.com and http://piwik.org/

Reason being the ease of use, installation real-time capabilites.

One impressive feature I would like to mention is Heat Map generated by openwebanalytics.

dcode
  • 1