====== Services ====== Technologies: nagios, prometheus basic nagios: * if everything is ok, return: * OK * else: * ERROR message * message should reflect the error and perhaps a solution strategy TODO: alert channel for slack ===== V3 LillySyncService =====
    * recipe: go to server and restart docker container “frontend_instance1” * check: http://index-prelive.picalike.corpex-kunden.de:15672/#/queues if host appears in rabbitmq * check: http://http://sg01.picalike.corpex-kunden.de:5002/by_service host should show OK after about 1 minute (or 2)
In case of errors, a notification is send to the slack alert channel with the prefix 'LillySyncService'. ===== feature extractor ===== ===== top viewed (JL) ===== * health endpoints: * frontend05-hpc:5001/service/health * frontend05-hpc:5002/health ===== top looks (JL) ===== * implemented * health endpoints: * frontend05-hpc:8012/health * frontend05-hpc:8013/health ===== also viewed (JL) ===== * frontend05-hpc:5004/service/health ===== get cat trends ===== * frontend05-hpc:5003/service/health ===== settings provider (JO) ===== ===== netcup postgresql (TS) ===== * a basic monitoring script that reports errors to the 'alert' channel is ready and deployed at dev01 (~/bin/v5_slack_monitoring.py) * implemented as https://git.picalike.corpex-kunden.de/-/snippets/12 ===== shop-conveyor-belt (BZ) ===== * is this service running * number of feeds per state * are all services in zookeeper * feed_import_fastapi * solr_updater * … ===== krawla2feed (HG) ===== * is this service running * implemented * how many products were found by the crawler * how many products were written into the feed ===== feed-import (BZ/JL) ===== * is this service running * how many feeds were imported * how many products were imported ===== osa-report-api (JO) =====
    * running?
      * implemented
    * pg up?
      * implemented
    * last updates ok?
===== solr_updater (*hopefully soon deprecated*) (BZ) =====
    * implemented
===== trend_analyzer (*soon deprecated?*) ===== ===== visualytics_notification_api (mostly for sending emails) ===== ===== similarity_api (fridtjof implementation) (TS) ===== git: https://git.picalike.corpex-kunden.de/picalike/similarity_api ===== get_trend_description_solr ===== ===== product_trend_calculator ===== ===== image-cloud (JO) ===== ===== onsight-analytics (python middleware to php frontend) (JO) ===== - is this service running? ===== osa-cluster (BZ) ===== ===== mapping-service (BZ) ===== ===== witt-reports ===== ===== Sketches API ===== health check via dev01 script is active