Welcome to Centmin Mod Community
Register Now

Sysadmin monitoring and alert tools?

Discussion in 'System Administration' started by Zykov, May 31, 2014.

  1. rdan

    rdan Well-Known Member

    4,549
    1,090
    113
    May 25, 2014
    Ratings:
    +1,591
    Local Time:
    8:26 PM
    Mainline
    10.2
    Let me join the club :D
    upload_2018-3-21_0-20-56.png
     
    • Like Like x 3
  2. BoostN

    BoostN Active Member

    124
    27
    28
    Aug 19, 2014
    Ratings:
    +40
    Local Time:
    7:26 AM
    1.13.6
    10.0.34
    Here is mine for the past 24 hours:
    upload_2018-3-20_11-19-52.png

    Linode 4096 plan.

    Average ~2000 users daily according to my Google Analytics "Audience Overview"

    17K members,

    530K posts.

    I don't know if this is good or bad honestly. This is the only site (XF 1.5.x) I have on this box right now.

    PHP 7.1.12
    Server version: 10.0.33-MariaDB MariaDB Server
     
    • Like Like x 2
    • Informative Informative x 1
  3. deltahf

    deltahf Premium Member Premium Member

    325
    147
    43
    Jun 8, 2014
    Ratings:
    +235
    Local Time:
    8:26 AM
    Wow, really interesting to see how everyone's graphs compare!

    I am drooling at your low PHP times, @eva2000! :eek:

    I bet it's more than just traffic, though... my PHP-FPM config is probably leaving a lot of performance on the table. Another issue is probably WordPress — I'm using a good caching plugin, but it's still a real slog when it comes to performance. :poop:

    For XenForo, though, I wonder what's going on... when it's running at its best, my own forum usually feels faster than the Centminmod Community forums (my own forum actually feels faster than any other XenForo forum I've used).

    As a sanity check, I just compared with WebPageTest and it actually seems to support this. Ignoring the full page render time (because of all my ads), the time-to-first-byte on my forum home page is half that of Centminmod's (218ms vs 533ms), and my page's HTML was received in 257ms compared to CC's 581ms.

    Screen Shot 2018-03-20 at 1.16.16 PM.png

    Screen Shot 2018-03-20 at 1.16.33 PM.png

    This is surprising considering the vast difference that NewRelic/AppOptics shows in PHP application time.

    The only explanation I could think of is network RTT and proximity? My dedicated server is with ReliableSite in their NYC datacenter (I'm not using CloudFlare or a CDN). Not sure which Centminmod Community server myself or the Dulles VA WebPageTest server would be connecting to?
     
    style="display:inline-block;min-width:400px;max-width:970px;width:95%;height:90px" data-ad-client="ca-pub-6669518204467592" data-ad-slot="4024536743" data-ad-format="auto">
  4. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    Centmin Mod forum is in Fremont, CA is US West Coast opposite side of Dulles, VA webpagtest location so you will see it in WPT TTFB and numbers. Try testing WPT in California or US West location too :) Probably 60-65% of visitors on forum come from Asia/US West.

    And for WPT and page load speed key metric is Speedindex = visual render time ;) You can have fast TTFB and still have visitors experience a slower page load perceived render time :) See WebPerf - PageSpeed - How to use webpagetest.org for page load speed testing
     
  5. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
  6. Meirami

    Meirami Member

    130
    17
    18
    Dec 21, 2017
    Ratings:
    +44
    Local Time:
    3:26 PM
    Sorry for a bad picture.
    This is from my Nextcloud vps which is only for a family. One user uploading 150 pictures with mobile app and application's graph hits the roof. Should I be worried? So far everything looks ok except that graph.
    Redis graph's lower line is 5 and upper 10.
    Picture's upload took about 1 minute but application graph was really high about 40 minutes...

    coll.jpg oa2.jpg
     
  7. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    image upload/downloads through PHP will cause high response times which is normal
     
    • Informative Informative x 1
  8. Meirami

    Meirami Member

    130
    17
    18
    Dec 21, 2017
    Ratings:
    +44
    Local Time:
    3:26 PM
    Even so long after upload?
     
  9. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    depends on what php web app does after uploading
     
    • Like Like x 1
  10. rdan

    rdan Well-Known Member

    4,549
    1,090
    113
    May 25, 2014
    Ratings:
    +1,591
    Local Time:
    8:26 PM
    Mainline
    10.2
  11. deschlong

    deschlong New Member

    20
    5
    3
    Mar 1, 2018
    Ratings:
    +8
    Local Time:
    2:26 PM
    1.15
    10.1
    Hello,

    at the moment I am looking around for a good monitoring tool, mainly for centOS at the first place, but also for some debian-based servers.

    I was thinking about ELK Stack or Graylog, but I dont know much about it and I would like to get a centralized monitoring system which is also secure. I have the feeling adding ELK / Graylog adds another layer of a attack surface to my servers.

    So what do you guys use for monitoring multiple servers?

    Thanks!
     
  12. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    • Like Like x 1
  13. deschlong

    deschlong New Member

    20
    5
    3
    Mar 1, 2018
    Ratings:
    +8
    Local Time:
    2:26 PM
    1.15
    10.1
    Aww damn.. Could you merge this thread here @eva2000 ?

    Btw, any opinions on ELK / Graylog? Especially when having multiple servers and setting it up? Seems like pretty messy compared to amplify for example (which btw looks pretty nice, just configured it on one of my servers)
     
  14. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    merged :)

    no opinions on ELK/Graylog - why complicate things :D
     
  15. deschlong

    deschlong New Member

    20
    5
    3
    Mar 1, 2018
    Ratings:
    +8
    Local Time:
    2:26 PM
    1.15
    10.1
    Thanks!

    Yeah, I dont know.. seems overwhelming to me at the moment. I monitored by SSH into the servers and check the logs, processes, etc. So I was looking for something.. 'better'?!

    But all the tools you listed are doing a great job so far and they arent even that expensive, Hetrix or Nixstats for example. Easy to setup and no reading around forums, stack overflow and reddit about how to setup ELK/Graylog and then forward the logs to another server, which makes things PRETTY complicated for me at least.
     
  16. Meirami

    Meirami Member

    130
    17
    18
    Dec 21, 2017
    Ratings:
    +44
    Local Time:
    3:26 PM
    How good is Hetrix? Pros/cons?
     
  17. eva2000

    eva2000 Administrator Staff Member

    40,624
    9,016
    113
    May 24, 2014
    Brisbane, Australia
    Ratings:
    +13,883
    Local Time:
    10:26 PM
    Nginx 1.15.x
    MariaDB 5.5/10.x
    It's free to try so take it for a spin :)
     
    • Like Like x 1