nginxinc · qdzlug · Dec 20, 2021 · Dec 9, 2021 · Dec 9, 2021 · Dec 9, 2021
diff --git a/pulumi/aws/README.md b/pulumi/aws/README.md
@@ -28,9 +28,8 @@ vpc - defines and installs the VPC and subnets to use with EKS
             └─logagent - deploys a logging agent (filebeat) to the EKS cluster 
               └─certmgr - deploys the open source cert-manager.io helm chart to the EKS cluster
                 └─prometheus - deploys prometheus server, node exporter, and statsd collector for metrics
-                  └─grafana - deploys the grafana visualization platform
-                    └─observability - deploys the OTEL operator and instantiates a simple collector
-                      └─sirius - deploys the Bank of Sirus application to the EKS cluster
+                  └─observability - deploys the OTEL operator and instantiates a simple collector
+                    └─sirius - deploys the Bank of Sirus application to the EKS cluster
 
 ```
 
@@ -146,15 +145,40 @@ deployment.
 ### Prometheus
 
 Prometheus is deployed and configured to enable the collection of metrics for all components that have
-properties `prometheus.io:scrape: true` set in the annotations
-(along with any other connection information). This includes the prometheus `node-exporter`
-daemonset which is deployed in this step as well.
+a defined service monitor. At installation time, the deployment will instantiate:
+- Node Exporters
+- Kubernetes Service Monitors
+- Grafana preloaded with dashboards and datasources for Kubernetes management
+- The NGINX Ingress Controller
+- Statsd receiver
+
+The former behavior of using the `prometheus.io:scrape: true` property set in the annotations
+indicating pods where metrics should be scraped has been deprecated, and these annotations will
+be removed in the near future.
+
+Also, the standalone Grafana deployment has been removed from the standard deployment scripts, but has been left as 
+a project in the event someone wishes to run this standalone.
+
+Finally, this namespace will hold service monitors created by other projects, for example the Bank of Sirius
+deployment currently deploys a service monitor for each of the postgres monitors that are deployed.
+
+Notes: 
+1. The NGINX IC needs to be configured to expose prometheus metrics; this is currently done by default.
+2. The default address binding of the `kube-proxy` component is set to `127.0.0.1` and as such will cause errors when the 
+canned prometheus scrape configurations are run. The fix is to set this address to `0.0.0.0`. An example manifest
+has been provided in [prometheus/extras](./prometheus/extras) that can be applied against your installation with 
+`kubectl apply -f ./filename`. Please only apply this change once you have verified that it will work with your 
+version of Kubernetes.
+3. The _grafana_ namespace has been maintained in the conifugration file to be used by the prometheus operator deployed
+version of Grafana. This version only accepts a password; you can still specify a username for the admin account but it 
+will be silently ignored.
 
-This also pulls data from the NGINX KIC, provided the KIC is configured to allow prometheus access (which is enabled by
-default).
 
 ### Grafana
 
+**NOTE:** This deployment has been deprecated but the project has been left as an example on how to deploy Grafana in this 
+architecture. 
+
 Grafana is deployed and configured with a connection to the prometheus datasource installed above. At the time of this
 writing, the NGINX Plus KIC dashboard is installed as part of the initial setup. Additional datasources and dashboards
 can be added by the user either in the code, or via the standard Grafana tooling.
@@ -188,7 +212,10 @@ As part of the Bank of Sirius deployment, we deploy a cluster-wide
 [self-signed](https://cert-manager.io/docs/configuration/selfsigned/)
 issuer using the cert-manager deployed above. This is then used by the Ingress object created to enable TLS access to
 the application. Note that this Issuer can be changed out by the user, for example to use the
-[ACME](https://cert-manager.io/docs/configuration/acme/) issuer.
+[ACME](https://cert-manager.io/docs/configuration/acme/) issuer. The use of the ACME issuer has been tested and works 
+without issues, provided the FQDN meets the length requirements. As of this writing the AWS ELB hostname is too long
+to work with the ACME server. Additional work in this area will be undertaken to provide dynamic DNS record creation
+as part of this process so legitimate certificates can be issued.
 
 In order to provide visibility into the Postgres databases that are running as part of the application, the Prometheus
 Postgres data exporter will be deployed into the same namespace as the application and will be configured to be scraped
@@ -204,4 +231,6 @@ provides better tools for hierarchical configuration files.
 
 In order to help enable simple load testing, a script has been provided that uses the
 `kubectl` command to port-forward monitoring and management connections to the local workstation. This command
-is [`test-foward.sh`](./extras/test-forward.sh) and is located in the [`extras`](./extras) directory. 
+is [`test-foward.sh`](./extras/test-forward.sh) and is located in the [`extras`](./extras) directory. 
+
+**NOTE:** This script has been modified to use the new Prometheus Operator based deployment.
diff --git a/pulumi/aws/config/Pulumi.stackname.yaml.example b/pulumi/aws/config/Pulumi.stackname.yaml.example
@@ -178,16 +178,6 @@ config:
   ############################################################################
 
   # Grafana Configuration
-  grafana:chart_name: grafana
-  # Chart name for the helm chart for grafana
-  grafana:chart_version: 6.13.7
-  # Chart version for the helm chart for grafana
-  grafana:helm_repo_name: grafana
-  # Name of the repo to pull the grafana chart from
-  grafana:helm_repo_url: https://grafana.github.io/helm-charts
-  # URL of the chart repo to pull grafana from
-  grafana:adminuser: admin
-  # The username for the grafana installation
   grafana:adminpass: strongpass
   # The password for the grafana installation; note that this is not exposed to the internet
   # and requires kubeproxy to access. However, this should be encrypted which is dependent on
@@ -197,7 +187,7 @@ config:
   ############################################################################
 
   # Prometheus Configuration
-  prometheus:chart_name: prometheus
+  prometheus:chart_name: kube-prometheus-stack
   # Chart name for the helm chart for prometheus
   prometheus:chart_version: 14.6.0
   # Chart version for the helm chart for prometheus

diff --git a/pulumi/aws/destroy.sh b/pulumi/aws/destroy.sh
@@ -91,7 +91,7 @@ if command -v aws > /dev/null; then
   validate_aws_credentials
 fi
 
-k8s_projects=(sirius observability grafana prometheus certmgr logagent logstore kic-helm-chart)
+k8s_projects=(sirius observability prometheus certmgr logagent logstore kic-helm-chart)
 
 # Test to see if EKS has been destroy AND there are still Kubernetes resources
 # that are being managed by Pulumi. If so, we have to destroy the stack for

diff --git a/pulumi/aws/extras/scripts/test-forward.sh b/pulumi/aws/extras/scripts/test-forward.sh
@@ -51,15 +51,15 @@ kubectl port-forward service/elastic-kibana --namespace logstore 5601:5601 &
 echo $! > $PID01
 
 ## Grafana Tunnel
-kubectl port-forward service/grafana --namespace grafana 3000:80 &
+kubectl port-forward service/prometheus-grafana --namespace prometheus 3000:80 &
 echo $! > $PID02
 
 ## Loadgenerator Tunnel
 kubectl port-forward service/loadgenerator --namespace bos 8089:8089 &
 echo $! > $PID03
 
 ## Prometheus Tunnel
-kubectl port-forward service/prometheus-server --namespace prometheus 9090:80 &
+kubectl port-forward service/prometheus-kube-prometheus-prometheus --namespace prometheus 9090:9090 &
 echo $! > $PID04
 
 ## Elasticsearch Tunnel

diff --git a/pulumi/aws/grafana/Pulumi.yaml b/pulumi/aws/grafana/Pulumi.yaml
diff --git a/pulumi/aws/grafana/__main__.py b/pulumi/aws/grafana/__main__.py
diff --git a/pulumi/aws/kic-helm-chart/__main__.py b/pulumi/aws/kic-helm-chart/__main__.py
@@ -51,7 +51,25 @@ def build_chart_values(repository: dict) -> helm.ChartOpts:
             'service': {
                 'annotations': {
                     'co.elastic.logs/module': 'nginx'
-                }
+                },
+                "extraLabels": {
+                    "app": "kic-nginx-ingress"
+                },
+                "customPorts": [
+                    {
+                        "name": "dashboard",
+                        "targetPort": 8080,
+                        "protocol": "TCP",
+                        "port": 8080
+                    },
+                    {
+                        "name": "prometheus",
+                        "targetPort": 9113,
+                        "protocol": "TCP",
+                        "port": 9113
+                    }
+                ]
+
             },
             'pod': {
                 'annotations': {
@@ -62,7 +80,10 @@ def build_chart_values(repository: dict) -> helm.ChartOpts:
         'prometheus': {
             'create': True,
             'port': 9113
-        }
+        },
+        "opentracing-tracer": "/usr/local/lib/libjaegertracing_plugin.so",
+        "opentracing-tracer-config": "{\n    \"service_name\": \"nginx-ingress\",\n    \"propagation_format\": \"w3c\",\n    \"sampler\": {\n        \"type\": \"const\",\n        \"param\": 1\n    },\n    \"reporter\": {\n        \"localAgentHostPort\": \"simplest-collector.observability.svc.cluster.local:9978\"\n    }\n}  \n",
+        "opentracing": True
     }
 
     has_image_tag = 'image_tag' in repository or 'image_tag_alias' in repository
@@ -109,7 +130,10 @@ def build_chart_values(repository: dict) -> helm.ChartOpts:
                             kubeconfig=kubeconfig)
 
 ns = k8s.core.v1.Namespace(resource_name='nginx-ingress',
-                           metadata={'name': 'nginx-ingress'},
+                           metadata={'name': 'nginx-ingress',
+                                    'labels': {
+                                        'prometheus': 'scrape' }
+                                     },
                            opts=pulumi.ResourceOptions(provider=k8s_provider))
 
 chart_values = ecr_repository.apply(build_chart_values)