10 Year Anniversary: www.jasonralph.org

April 18, 2022April 18, 2022 adminLeave a comment

I had not posted too much lately, lots of stuff going on with my work and personal life, my wife and I moved into a new house in 2022, and for work we have been grinding on a large migration. I looked at my blog this morning and noticed that I have had this spare time project running for 10 years.

So for 10 years I have had jasonralph.org up and continuously available, with analytics, it started in my apartment on an old IBM stand alone server, it now runs on a single Rocky Linux 8 VM from linode for 10 dollars a month. I hope to have some new content soon, but for now, I am happy for the 10 year anniversary.

AWS Apache Managed Airflow EMR ModuleNotFoundError: No module named ‘requests’ Bootstrap

November 2, 2021November 9, 2021 adminLeave a comment

I came across another fun one the other day, we are in the process of migrating our on premise elastic map reduce system into the cloud. We are using AWS EMR and have AWS Managed Airflow as the executor (DAG). We came across an odd situation with a pyspark application. When using Airflow with a SparkSubmitHook, the job would bootstrap looking just fine according to the run logs, however it would fail with No module named 'requests' when the application tried to import it. This was very odd since we have this application running from spark-submit just fine when calling it from the master node command line.

I decided to investigate the differences, our bootstrap script for installing python modules via pip which we call from the EMR API RunJobFlow call looks like this:

#!/bin/bash
pip_bin=pip3
${pip_bin} install --user -U pip
${pip_bin} install --user boto3
${pip_bin} install --user boto
${pip_bin} install --user requests
${pip_bin} install --user psycopg2-binary

#!/bin/bash

pip_bin=pip3

${pip_bin} install --user -U pip

${pip_bin} install --user boto3

${pip_bin} install --user boto

${pip_bin} install --user requests

${pip_bin} install --user psycopg2-binary

This is very basic, all it does is upgrade PIP and run PIP install to install each of the modules. When checking the bootstrap log I can see that PIP upgrades and goes out to the repo and installs the packages just fine. So why were we getting the No module named 'requests' error when executing through airflow. After a ton of googling and research I have found the issue and applied a solution that worked. Turns out airflow will run as the root user when bootstrapping, so if you notice we use the --user argument in pip. This will instruct the packages to be installed in the calling users home directory, the kicker is the code is run by the hadoop user on the EMR cluster nodes after executing from airflow. So turns out, the hadoop user is unable to access the requests module since root installed it with --user. I changed the bootstrap script to the following and it all started working, by removing --user and prefixing with sudo, the packages now get installed in a globally available area for all users. I am sure there are better ways to do this, I am still learning and researching, but if you run into this, the change below with get you out of the woods.

#!/bin/bash
sudo python3 -m pip install \
                        boto3 \
	                    boto \
		                requests \
                        psycopg2-binary

#!/bin/bash

sudo python3 -m pip install \

boto3 \

boto \

requests \

psycopg2-binary

After some further research, and testing we decided to utilize a requirements.txt file to be called by the bootstrap shell script in the RunJobFlow call, first create a requirements.txt file, I like to hardcode the versions so nothing changes unexpectedly as you bootstrap a new cluster and it reaches out to PyPy to get the packages.

https://docs.aws.amazon.com/emr/latest/APIReference/API_RunJobFlow.html

Add your desired packages and version numbers to a file called requirements.txt like below:

boto3==1.17.54
boto==2.49.0
requests==2.18.4
psycopg2-binary==2.8.6

boto3==1.17.54

boto==2.49.0

requests==2.18.4

psycopg2-binary==2.8.6

Then you will need to copy this file into a bucket you have access to:

aws s3 cp requirements.txt s3://YOUR_S3_BUCKET_NAME/requirements.txt

1	aws s3 cp requirements.txt s3://YOUR_S3_BUCKET_NAME/requirements.txt

Then create a shell script that has the following, call it bootstrap.sh:

#!/bin/bash

set -x 

echo '-----------RUNNING BOOTSTRAP------------------------'

echo '-----------COPYING REQUIREMENTS FILE LOCALLY--------'

aws s3 cp s3://YOUR_S3_BUCKET_NAME/requirements.txt .

echo '-----------INSTALLING REQUIREMENTS------------------'

sudo python3 -m pip install -r requirements.txt

echo '-----------DONE BOOTSTRAP---------------------------'

#!/bin/bash

set -x

echo '-----------RUNNING BOOTSTRAP------------------------'

echo '-----------COPYING REQUIREMENTS FILE LOCALLY--------'

aws s3 cp s3://YOUR_S3_BUCKET_NAME/requirements.txt .

echo '-----------INSTALLING REQUIREMENTS------------------'

sudo python3 -m pip install -r requirements.txt

echo '-----------DONE BOOTSTRAP---------------------------'

Copy that shell script to your bucket:

aws s3 cp bootstrap.sh s3://YOUR_S3_BUCKET_NAME/bootstrap.sh

1	aws s3 cp bootstrap.sh s3://YOUR_S3_BUCKET_NAME/bootstrap.sh

And execute it via the bootstrap actions in the RunJobFlow EMR API call:

"BootstrapActions": [
    {
      "Name": "string",
      "ScriptBootstrapAction": {
        "Path": "s3://YOUR_S3_BUCKET_NAME/bootstrap.sh"
      }
    }
  ],

"BootstrapActions": [

{

"Name": "string",

"ScriptBootstrapAction": {

"Path": "s3://YOUR_S3_BUCKET_NAME/bootstrap.sh"

}

As you can see the shell script will be executed which will copy the requirements.txt file locally and then run pip -r against it which will install all the packages. If you want to see the log on a running cluster, you can ssh to the master node and view the logs here to see the bootstrapping take place:

/emr/instance-controller/log/bootstrap-actions

1	/emr/instance-controller/log/bootstrap-actions

You should see the stdout log as so:

-----------RUNNING BOOTSTRAP------------------
-----------COPYING REQUIREMENTS FILE LOCALLY--------
Completed 67 Bytes/67 Bytes (629 Bytes/s) with 1 file(s) remaining
download: s3://YOUR_S3_BUCKET_NAME/requirements.txt to ./requirements.txt
-----------INSTALLING REQUIREMENTS------------------
Collecting boto==2.48.0
  Downloading boto-2.48.0-py2.py3-none-any.whl (1.4 MB)
Collecting boto3==1.6.15
  Downloading boto3-1.6.15-py2.py3-none-any.whl (128 kB)
Collecting requests==2.18.4
  Downloading requests-2.18.4-py2.py3-none-any.whl (88 kB)
Collecting psycopg2-binary==2.8.6
  Downloading psycopg2_binary-2.8.6-cp37-cp37m-manylinux1_x86_64.whl (3.0 MB)
Collecting botocore<1.10.0,>=1.9.15
  Downloading botocore-1.9.23-py2.py3-none-any.whl (4.1 MB)
Collecting s3transfer<0.2.0,>=0.1.10
  Downloading s3transfer-0.1.13-py2.py3-none-any.whl (59 kB)
Requirement already satisfied: jmespath<1.0.0,>=0.7.1 in /usr/local/lib/python3.7/site-packages (from boto3==1.6.15->-r jason_requirements.txt (line 2)) (0.10.0)
Collecting urllib3<1.23,>=1.21.1
  Downloading urllib3-1.22-py2.py3-none-any.whl (132 kB)
Collecting certifi>=2017.4.17
  Downloading certifi-2021.10.8-py2.py3-none-any.whl (149 kB)
Collecting idna<2.7,>=2.5
  Downloading idna-2.6-py2.py3-none-any.whl (56 kB)
Collecting chardet<3.1.0,>=3.0.2
  Downloading chardet-3.0.4-py2.py3-none-any.whl (133 kB)
Requirement already satisfied: docutils>=0.10 in /usr/lib/python3.7/site-packages (from botocore<1.10.0,>=1.9.15->boto3==1.6.15->-r jason_requirements.txt (line 2)) (0.14)
Collecting python-dateutil<2.7.0,>=2.1
  Downloading python_dateutil-2.6.1-py2.py3-none-any.whl (194 kB)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/site-packages (from python-dateutil<2.7.0,>=2.1->botocore<1.10.0,>=1.9.15->boto3==1.6.15->-r jason_requirements.txt (line 2)) (1.13.0)
Installing collected packages: boto, python-dateutil, botocore, s3transfer, boto3, urllib3, certifi, idna, chardet, requests, psycopg2-binary
  Attempting uninstall: boto
    Found existing installation: boto 2.49.0
    Uninstalling boto-2.49.0:
      Successfully uninstalled boto-2.49.0
Successfully installed boto-2.48.0 boto3-1.6.15 botocore-1.9.23 certifi-2021.10.8 chardet-3.0.4 idna-2.6 psycopg2-binary-2.8.6 python-dateutil-2.6.1 requests-2.18.4 s3transfer-0.1.13 urllib3-1.22
-----------DONE BOOTSTRAP---------------------

-----------RUNNING BOOTSTRAP------------------

-----------COPYING REQUIREMENTS FILE LOCALLY--------

Completed 67 Bytes/67 Bytes (629 Bytes/s) with 1 file(s) remaining

download: s3://YOUR_S3_BUCKET_NAME/requirements.txt to ./requirements.txt

-----------INSTALLING REQUIREMENTS------------------

Collecting boto==2.48.0

Downloading boto-2.48.0-py2.py3-none-any.whl (1.4 MB)

Collecting boto3==1.6.15

Downloading boto3-1.6.15-py2.py3-none-any.whl (128 kB)

Collecting requests==2.18.4

Downloading requests-2.18.4-py2.py3-none-any.whl (88 kB)

Collecting psycopg2-binary==2.8.6

Downloading psycopg2_binary-2.8.6-cp37-cp37m-manylinux1_x86_64.whl (3.0 MB)

Collecting botocore<1.10.0,>=1.9.15

Downloading botocore-1.9.23-py2.py3-none-any.whl (4.1 MB)

Collecting s3transfer<0.2.0,>=0.1.10

Downloading s3transfer-0.1.13-py2.py3-none-any.whl (59 kB)

Requirement already satisfied: jmespath<1.0.0,>=0.7.1 in /usr/local/lib/python3.7/site-packages (from boto3==1.6.15->-r jason_requirements.txt (line 2)) (0.10.0)

Collecting urllib3<1.23,>=1.21.1

Downloading urllib3-1.22-py2.py3-none-any.whl (132 kB)

Collecting certifi>=2017.4.17

Downloading certifi-2021.10.8-py2.py3-none-any.whl (149 kB)

Collecting idna<2.7,>=2.5

Downloading idna-2.6-py2.py3-none-any.whl (56 kB)

Collecting chardet<3.1.0,>=3.0.2

Downloading chardet-3.0.4-py2.py3-none-any.whl (133 kB)

Requirement already satisfied: docutils>=0.10 in /usr/lib/python3.7/site-packages (from botocore<1.10.0,>=1.9.15->boto3==1.6.15->-r jason_requirements.txt (line 2)) (0.14)

Collecting python-dateutil<2.7.0,>=2.1

Downloading python_dateutil-2.6.1-py2.py3-none-any.whl (194 kB)

Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/site-packages (from python-dateutil<2.7.0,>=2.1->botocore<1.10.0,>=1.9.15->boto3==1.6.15->-r jason_requirements.txt (line 2)) (1.13.0)

Installing collected packages: boto, python-dateutil, botocore, s3transfer, boto3, urllib3, certifi, idna, chardet, requests, psycopg2-binary

Attempting uninstall: boto

Found existing installation: boto 2.49.0

Uninstalling boto-2.49.0:

Successfully uninstalled boto-2.49.0

Successfully installed boto-2.48.0 boto3-1.6.15 botocore-1.9.23 certifi-2021.10.8 chardet-3.0.4 idna-2.6 psycopg2-binary-2.8.6 python-dateutil-2.6.1 requests-2.18.4 s3transfer-0.1.13 urllib3-1.22

-----------DONE BOOTSTRAP---------------------

Hope this helps.

Node Application Stopped Sending Updates To Slack – can’t identify protocol

June 24, 2021 adminLeave a comment

I wanted to share my experience with a node application that I support. This particular application is an API, it happens to log each and every request it receives to a internal slack channel. Our team uses this channel for many things, to verify when the API is in maintenance, to check that requests are processing, to see status on the overall health of the API etc..

Once in a while out of nowhere we would stop receiving these updates to slack. I set out to troubleshoot why this may be happening, at first we thought that we were hitting the slack rate limits, which is clearly defined here:

https://api.slack.com/docs/rate-limits

However after reading the linked doc, I was skeptical. The API does serve a lot of requests, but not enough to hit their limit. We have 2 servers that send slack messages and process the API requests and when they stopped sending it would be both servers, not just one. Also we have run into this before and restarting the service fixed the issue, so I was sure we did not hit the rate limit. Also trying to send a manual slack update using curl would not work! I knew this had to be something with the linux OS itself, and not the Slack service.

I tried to use netstat to see if we were hitting some type of OS limit, and all looked well. Next I tried one of my favorite tools, LSOF, at first I grepped for deleted to see if something was being held and not released. I did not see anything that stood out, next I grepped for node and low and behold I saw this:

[root@ip-172-x-x-x ~]# lsof | grep node
--SNIP--
node       1794 nodeuser   19u     sock                0,6       0t0     651101 can't identify protocol
node       1794 nodeuser   20w      REG              202,1 209793922     294970 /opt/afs/mc_api_logs/debug.log
node       1794 nodeuser   21w      REG              202,1   2409554     274199 /opt/afs/mc_api_logs/exceptions.log
node       1794 nodeuser   22w      REG              202,1    572278     294971 /opt/afs/mc_api_logs/error.log
node       1794 nodeuser   23w      REG              202,1   2409554     274199 /opt/afs/mc_api_logs/exceptions.log
node       1794 nodeuser   24w      REG              202,1   2258649     294980 /opt/afs/mc_api_logs/warn.log
node       1794 nodeuser   25w      REG              202,1   2409554     274199 /opt/afs/mc_api_logs/exceptions.log
node       1794 nodeuser   26w      REG              202,1         0     294989 /opt/afs/mc_api_logs/info.log
node       1794 nodeuser   27w      REG              202,1   2409554     274199 /opt/afs/mc_api_logs/exceptions.log
node       1794 nodeuser   28u     IPv4              13731       0t0        TCP *:pcsync-https (LISTEN)
node       1794 nodeuser   29u     sock                0,6       0t0     512828 can't identify protocol
node       1794 nodeuser   30u     sock                0,6       0t0      14507 can't identify protocol
node       1794 nodeuser   31u     sock                0,6       0t0      14028 can't identify protocol
node       1794 nodeuser   32u     sock                0,6       0t0      15183 can't identify protocol
node       1794 nodeuser   33u     sock                0,6       0t0      15628 can't identify protocol
node       1794 nodeuser   34u     sock                0,6       0t0      16346 can't identify protocol
node       1794 nodeuser   35u     sock                0,6       0t0      15778 can't identify protocol
node       1794 nodeuser   36u     sock                0,6       0t0      16847 can't identify protocol
node       1794 nodeuser   37u     sock                0,6       0t0      17512 can't identify protocol
node       1794 nodeuser   38u     sock                0,6       0t0      25572 can't identify protocol
node       1794 nodeuser   39u     sock                0,6       0t0      18437 can't identify protocol
--SNIP--

[root@ip-172-x-x-x ~]# lsof | grep node

--SNIP--

node 1794 nodeuser 19u sock 0,6 0t0 651101 can't identify protocol

node 1794 nodeuser 20w REG 202,1 209793922 294970 /opt/afs/mc_api_logs/debug.log

node 1794 nodeuser 21w REG 202,1 2409554 274199 /opt/afs/mc_api_logs/exceptions.log

node 1794 nodeuser 22w REG 202,1 572278 294971 /opt/afs/mc_api_logs/error.log

node 1794 nodeuser 23w REG 202,1 2409554 274199 /opt/afs/mc_api_logs/exceptions.log

node 1794 nodeuser 24w REG 202,1 2258649 294980 /opt/afs/mc_api_logs/warn.log

node 1794 nodeuser 25w REG 202,1 2409554 274199 /opt/afs/mc_api_logs/exceptions.log

node 1794 nodeuser 26w REG 202,1 0 294989 /opt/afs/mc_api_logs/info.log

node 1794 nodeuser 27w REG 202,1 2409554 274199 /opt/afs/mc_api_logs/exceptions.log

node 1794 nodeuser 28u IPv4 13731 0t0 TCP *:pcsync-https (LISTEN)

node 1794 nodeuser 29u sock 0,6 0t0 512828 can't identify protocol

node 1794 nodeuser 30u sock 0,6 0t0 14507 can't identify protocol

node 1794 nodeuser 31u sock 0,6 0t0 14028 can't identify protocol

node 1794 nodeuser 32u sock 0,6 0t0 15183 can't identify protocol

node 1794 nodeuser 33u sock 0,6 0t0 15628 can't identify protocol

node 1794 nodeuser 34u sock 0,6 0t0 16346 can't identify protocol

node 1794 nodeuser 35u sock 0,6 0t0 15778 can't identify protocol

node 1794 nodeuser 36u sock 0,6 0t0 16847 can't identify protocol

node 1794 nodeuser 37u sock 0,6 0t0 17512 can't identify protocol

node 1794 nodeuser 38u sock 0,6 0t0 25572 can't identify protocol

node 1794 nodeuser 39u sock 0,6 0t0 18437 can't identify protocol

--SNIP--

My eyes went right to the “can’t identify protocol”, I opened up a browser and started to research, first hit when searching “can’t identify protocol” was a stack overflow article with the solution.

https://stackoverflow.com/questions/7911840/seeing-too-many-lsof-cant-identify-protocol

When lsof prints “Can’t identify protocol”, this usually relates to sockets (it should also say ‘sock’ in the relevant output lines).

So, somewhere in your code you are probably connecting sockets and not closing them properly (perhaps you need a finally block).

I suggest you step through your code with a debugger (easiest to use your IDE, potentially with a remote debugger, if necesssary), while running lsof side-by-side. You should eventually be able to see which thread / line of code is creating these File Descriptors.

Turns out that the node application was opening file descriptors / sockets and not closing them properly, this caused the system to hit the hard limit on open files / file descriptors. You can view the hard and soft limit like so, switch to the user that application is running as and run:

[nodeuser@ip-172-x-x-x ~]$ ulimit -Hn
4096
[nodeuser@ip-172-x-x-x ~]$ ulimit -Sn
1024

[nodeuser@ip-172-x-x-x ~]$ ulimit -Hn

4096

[nodeuser@ip-172-x-x-x ~]$ ulimit -Sn

1024

So you can see that the nodeuser has a hard limit of 4096 open files, which due to the application not properly closing them, we hit the ceiling. This explains why restarting the server or the process fixed it. It would release the open file descriptors and the system was able to open sockets again. I spoke with the developer and we researched, looks like one of the modules we were using was the cause of the issue, perhaps we were using it wrong? I found this out from this article:
https://stackoverflow.com/questions/24922745/node-js-winston-how-to-safely-drain-a-logger

Question:

I have experimented with instantiating and closing winston loggers as (half) described on https://github.com/flatiron/winston#instantiating-your-own-logger, to no avail. I run into trouble closing file transports of Winston’s – walking through it’s source code, I found that the proper way to close off a logger would seem to be the close method. I expected this to take care of closing the transport file used by the logger – however that turned out to be not so.

Varying in frequency according to node.js server load, winston would still hold on to many transport files, infinitely long after the close method had been called for them, indefinitely long after no new writes were being initiated to them. I observed that through the node.js process file descriptors table (lsof -p). Even though close has been called for a Winston logger, it would indefinitely keep the file descriptor of the log file “in use”, i.e. the log file never gets really closed. Thus leaking file descriptors and eventually making the node.js process bump into the ulimit (-n) limit after my application has been up for long.

Should there be a specific programming pattern for draining a Winston logger such that it can be eventually closed?

Answer:

Create only one logger instance and then derive children from it. In this case, winston will hold only one open file handler. Might also be better for performance.

So that was it, the developers agreed and set out to create a patch, problem solved.

centos8 postgresql-11-check-db-dir[]: is missing or empty

October 30, 2020November 18, 2020 adminLeave a comment

We have been rolling out CENTOS8 in our lower environments for testing, we use a dedicated vmware virtual server with centos8 minimal install, we only apply hardening techniques to these systems other than the main application, which is pg11 here. These systems use a LVM mounted ext4 filesystem for the data directory.

/dev/mapper/vg01-data1				/u02/data1		ext4    defaults, nofail		0 2

1	/dev/mapper/vg01-data1 /u02/data1 ext4 defaults, nofail 0 2

Recently on 3 of the new PG VMS after reboot we noticed that PG did not start, this also seemed intermittent, even though we have enabled the systemd service to start on reboots. So I checked the pg startup log and did not find too much about the issue. So I checked /var/log/messages and found the issue.

postgresql-11-check-db-dir[1038]: "/u02/data1/pg/data11" is missing or empty.

1	postgresql-11-check-db-dir[1038]: "/u02/data1/pg/data11" is missing or empty.

I checked the systemd service file and saw that out of the box postgres had the following:

[Unit]
Description=PostgreSQL 11 database server
Documentation=https://www.postgresql.org/docs/11/static/
After=syslog.target
After=network.target

[Install]
WantedBy=multi-user.target

[Unit]

Description=PostgreSQL 11 database server

Documentation=https://www.postgresql.org/docs/11/static/

After=syslog.target

After=network.target

[Install]

WantedBy=multi-user.target

After=Syslog.target This is a special target unit in systemd and is the standardized name to pull in a syslog implementation.

After=network.target has very little meaning during start-up. It only indicates that the network management stack is up after it has been reached. Whether any network interfaces are already configured when it is reached is undefined.

WantedBy=multi-user.target normally defines a system state where all network services are started up and the system will accept logins, but a local GUI is not started. This is the typical default system state for server systems, which might be rack-mounted headless systems in a remote server room.

Those options above will not ensure that all filesystems in fstab are mounted before postgres starts. So what we were seeing was a classic race condition where postgres started before the data directory was mounted. As I previously mentioned we use a custom PGDATA location. So after some research I found my option that fixed this. You will need to edit the pg11 service and add the following, then reload systemd and reboot and all should work. You can find your LVM mount by running the following:

[root@server ~]# systemctl list-units --type=mount
UNIT                          LOAD   ACTIVE SUB     DESCRIPTION                     
-.mount                       loaded active mounted Root Mount                      
boot-efi.mount                loaded active mounted /boot/efi                       
boot.mount                    loaded active mounted /boot                           
dev-hugepages.mount           loaded active mounted Huge Pages File System          
dev-mqueue.mount              loaded active mounted POSIX Message Queue File System 
run-user-1328029883.mount     loaded active mounted /run/user/1328029883            
sys-fs-fuse-connections.mount loaded active mounted FUSE Control File System        
sys-kernel-config.mount       loaded active mounted Kernel Configuration File System
sys-kernel-debug.mount        loaded active mounted Kernel Debug File System        
u02-data1.mount               loaded active mounted /u02/data1                      

LOAD   = Reflects whether the unit definition was properly loaded.
ACTIVE = The high-level unit activation state, i.e. generalization of SUB.
SUB    = The low-level unit activation state, values depend on unit type.

10 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.

[root@server ~]# systemctl list-units --type=mount

UNIT LOAD ACTIVE SUB DESCRIPTION

-.mount loaded active mounted Root Mount

boot-efi.mount loaded active mounted /boot/efi

boot.mount loaded active mounted /boot

dev-hugepages.mount loaded active mounted Huge Pages File System

dev-mqueue.mount loaded active mounted POSIX Message Queue File System

run-user-1328029883.mount loaded active mounted /run/user/1328029883

sys-fs-fuse-connections.mount loaded active mounted FUSE Control File System

sys-kernel-config.mount loaded active mounted Kernel Configuration File System

sys-kernel-debug.mount loaded active mounted Kernel Debug File System

u02-data1.mount loaded active mounted /u02/data1

LOAD = Reflects whether the unit definition was properly loaded.

ACTIVE = The high-level unit activation state, i.e. generalization of SUB.

SUB = The low-level unit activation state, values depend on unit type.

10 loaded units listed. Pass --all to see loaded but inactive units, too.

To show all installed unit files use 'systemctl list-unit-files'.

You can see my u02-data1.mount in the output, so edit and add the override file with the following, if you have multiple mounts, you can add them as well.
Edit with: systemctl edit postgresql-11

[Unit]
After=local-fs.target u02-data1.mount

[Service]
Environment=PGDATA=/u02/data1/pg/data11

[Unit]

After=local-fs.target u02-data1.mount

[Service]

Environment=PGDATA=/u02/data1/pg/data11

Reload the daemon with: systemctl daemon-reload

After=local-fs.target systemd-fstab-generator(3) automatically adds dependencies of type Before= to all mount units that refer to local mount points for this target unit. In addition, it adds dependencies of type Wants= to this target unit for those mounts listed in /etc/fstab that have the auto mount option set.

AWS CLI Max Concurrent Requests Tuning

January 3, 2020November 11, 2021 admin4 Comments

In this post I would like to go over how I tuned a test server for copying / syncing files from the local filesystem to S3 over the internet. If you ever had the task of doing this, you will notice that as the file count grows, so does the time it takes to upload the files to S3. After some web searching I found out that AWS allows you to tune the config to allow more concurrency than default.
AWS CLI S3 Config

The parameter that we will be playing with is max_concurrent_requests
This has a default value of 10, which allows only 10 requests to the AWS API for S3. Lets see if we can make some changes to that value and get some performance gains. My test setup is as follows:

2 x Intel(R) Xeon(R) CPU E5-2699 v3 @ 2.30GHz
8GB RAM
CentOS release 6.10 (Final)

2 x Intel(R) Xeon(R) CPU E5-2699 v3 @ 2.30GHz

8GB RAM

CentOS release 6.10 (Final)

I have 56 102MB files in the test directory:

-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_7.csv.gz
-rw-r--r-- 1 jasonr domain^users 102M Sep 24 11:44 sample__0_0_53.csv.gz
-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_6.csv.gz
-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_8.csv.gz
-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_55.csv.gz
--snip--
[jasonr@jr-sandbox jason_test]$ ls| wc -l
56

-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_7.csv.gz

-rw-r--r-- 1 jasonr domain^users 102M Sep 24 11:44 sample__0_0_53.csv.gz

-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_6.csv.gz

-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_8.csv.gz

-rw-r--r-- 1 jasonr domain^users 101M Sep 24 11:44 sample__0_0_55.csv.gz

--snip--

[jasonr@jr-sandbox jason_test]$ ls| wc -l

For the first test I am going to run aws s3 sync with no changes, so out of the box it should have 10 max_concurrent_requests. Lets use the Linux time command to gather the time result to copy all 56 files to S3. I will delete the folder on S3 with each iteration to keep the test the same. You can also view the 443 requests via netstat and count them as well to show whats going on. In all the tests my best result was 250. So as you can see you will need to play with the settings to get the best result, these settings will change along with the server specs.

1. 1m25.919s with the default configuration:

[jasonr@jr-sandbox jason_test]$ time aws s3 sync . s3://dev-redshift/jason_sync_test/
upload: ./sample__0_0_0.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_0.csv.gz
upload: ./sample__0_0_10.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_10.csv.gz
upload: ./sample__0_0_11.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_11.csv.gz
upload: ./sample__0_0_12.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_12.csv.gz
upload: ./sample__0_0_13.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_13.csv.gz
--snip--

real	1m25.919s
user	0m35.153s
sys	0m15.879s

[jasonr@jr-sandbox jason_test]$ time aws s3 sync . s3://dev-redshift/jason_sync_test/

upload: ./sample__0_0_0.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_0.csv.gz

upload: ./sample__0_0_10.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_10.csv.gz

upload: ./sample__0_0_11.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_11.csv.gz

upload: ./sample__0_0_12.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_12.csv.gz

upload: ./sample__0_0_13.csv.gz to s3://dev-redshift/jason_sync_test/sample__0_0_13.csv.gz

--snip--

real 1m25.919s

user 0m35.153s

sys 0m15.879s

2. Now lets set the max conqurent requests to 20 and try again, you can do this with the command below, after running we can see a little gain.

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 20
[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config 
[default]
s3 =
    max_concurrent_requests = 20
[root@jr-sandbox ~]# netstat -an| grep 443| wc -l
20

real	1m13.277s
user	0m36.186s
sys	0m16.462s

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 20

[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config

[default]

s3 =

max_concurrent_requests = 20

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l

real 1m13.277s

user 0m36.186s

sys 0m16.462s

3. Bumped up to 50 shows a bit more gain:

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 50
[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config 
[default]
s3 =
    max_concurrent_requests = 50

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l
49
real	1m0.720s
user	0m37.669s
sys	0m19.344s

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 50

[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config

[default]

s3 =

max_concurrent_requests = 50

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l

real 1m0.720s

user 0m37.669s

sys 0m19.344s

4. Bumped up to 100, I start to notice that we lost some speed:

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 100
[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config 
[default]
s3 =
    max_concurrent_requests = 100
[root@jr-sandbox ~]# netstat -an| grep 443| wc -l
95
real	1m4.212s
user	0m39.737s
sys	0m21.847s

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 100

[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config

[default]

s3 =

max_concurrent_requests = 100

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l

real 1m4.212s

user 0m39.737s

sys 0m21.847s

5. Bumped up to 250 we see the best result so far:

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 250
[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config 
[default]
s3 =
    max_concurrent_requests = 250
[root@jr-sandbox ~]# netstat -an| grep 443| wc -l
234
real	0m55.036s
user	0m42.841s
sys	0m21.409s

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 250

[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config

[default]

s3 =

max_concurrent_requests = 250

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l

234

real 0m55.036s

user 0m42.841s

sys 0m21.409s

6. Bumped up to 500, we lose performance, most likely due to the machine resources.

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 500
[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config 
[default]
s3 =
    max_concurrent_requests = 500
[root@jr-sandbox ~]# netstat -an| grep 443| wc -l
465
real	1m16.593s
user	0m50.336s
sys	0m25.806s

[jasonr@jr-sandbox jason_test]$ aws configure set default.s3.max_concurrent_requests 500

[jasonr@jr-sandbox jason_test]$ cat ~/.aws/config

[default]

s3 =

max_concurrent_requests = 500

[root@jr-sandbox ~]# netstat -an| grep 443| wc -l

465

real 1m16.593s

user 0m50.336s

sys 0m25.806s

So to wrap up, you can tune the amount of concurrent requests allowed from the aws cli to s3, you will need to play with this setting to get the best results for your machine.

Postgres Long Running Active Queries Send To Slack

June 27, 2019June 27, 2019 admin2 Comments

I needed a utility to alert our team when any long running queries were running on a production postgres cluster. I came up with the following python code that achieves just that. This would alert slack if an active query exceeds 45 mins. The script takes in user parameters as well, I will demonstrate the way to call it. Hope it helps someone.

CRON CALL:

### postgres long running query check ###
*/15 * * * * /usr/bin/python2.7 /home/postgres/bin/pg_long_running_query.py --database proddb --dbhost proddb01 --user postgres --alert_time_mins 45 >> /home/postgres/pg_long_running_query.log 2>&1

1 2	### postgres long running query check ### /15 * * * /usr/bin/python2.7 /home/postgres/bin/pg_long_running_query.py --database proddb --dbhost proddb01 --user postgres --alert_time_mins 45 >> /home/postgres/pg_long_running_query.log 2>&1

CODE:

#!/usr/bin/python2.7

__author__ = "Jason Ralph"


import psycopg2
import psycopg2.extras
import argparse
import urllib


def send_message_to_slack(text):
    import requests
    import json

    webhook_url = 'https://hooks.slack.com/services/--redacted--'
    slack_data = {'text': "%s" % text}

    response = requests.post(
        webhook_url, data=json.dumps(slack_data),
        headers={'Content-Type': 'application/json'}
    )
    if response.status_code != 200:
        raise ValueError(
            'Request to slack returned an error %s, the response is:\n%s'
            % (response.status_code, response.text)
    )


def get_long_running_queries():
    parser = argparse.ArgumentParser(description='Check long Running '
                                                 'Queries On Postgres '
                                                 'Databases And Alert')
    parser.add_argument('--database', help='target database')
    parser.add_argument('--dbhost', help='target dbhost')
    parser.add_argument('--user', help='database user')
    parser.add_argument('--alert_time_mins', help='alert time in mins: e.g 30')
    args = parser.parse_args()

    conn = psycopg2.connect("dbname='%s' host='%s' user='%s' port=5432" 
                            % (args.database, args.dbhost, args.user))

    sql = ("""SELECT pid, usename,
              now() - pg_stat_activity.query_start AS duration,
              query, state FROM pg_stat_activity 
              WHERE (now() - pg_stat_activity.query_start) > interval
               '"%s" minutes';""") % args.alert_time_mins

    cursor = conn.cursor(cursor_factory=psycopg2.extras.DictCursor)
    cursor.execute(sql)
    count = 0
    while True:
        row = cursor.fetchone()
        if row is None:
            break
        if row['usename'] == 'postgres':
            continue
        if row['state'] == 'idle':
            continue
        count += 1
        pid = row['pid']
        user = row['usename']
        duration = row['duration']
        query = row['query']
        state = row['state']
        msg_items = ['LONG RUNNING QUERY ON HOST: %s\n'
                     % args.dbhost, 'PID: %s\n' % pid,
                     'DURATION: %s\n' % duration,
                     'QUERY: %s\n' % query,
                     'STATE: %s\n' % state,
                     'USER: %s\n' % user,
                     'COUNT: %s\n' % count]                                                      
        msg = ''.join(msg_items)
        send_message_to_slack(msg)
    conn.close()

def main():
    get_long_running_queries()

if __name__ == '__main__':
    main()

#!/usr/bin/python2.7

__author__ = "Jason Ralph"

import psycopg2

import psycopg2.extras

import argparse

import urllib

def send_message_to_slack(text):

import requests

import json

webhook_url = 'https://hooks.slack.com/services/--redacted--'

slack_data = {'text': "%s" % text}

response = requests.post(

webhook_url, data=json.dumps(slack_data),

headers={'Content-Type': 'application/json'}

)

if response.status_code != 200:

raise ValueError(

'Request to slack returned an error %s, the response is:\n%s'

% (response.status_code, response.text)

)

def get_long_running_queries():

parser = argparse.ArgumentParser(description='Check long Running '

'Queries On Postgres '

'Databases And Alert')

parser.add_argument('--database', help='target database')

parser.add_argument('--dbhost', help='target dbhost')

parser.add_argument('--user', help='database user')

parser.add_argument('--alert_time_mins', help='alert time in mins: e.g 30')

args = parser.parse_args()

conn = psycopg2.connect("dbname='%s' host='%s' user='%s' port=5432"

% (args.database, args.dbhost, args.user))

sql = ("""SELECT pid, usename,

now() - pg_stat_activity.query_start AS duration,

query, state FROM pg_stat_activity

WHERE (now() - pg_stat_activity.query_start) > interval

'"%s" minutes';""") % args.alert_time_mins

cursor = conn.cursor(cursor_factory=psycopg2.extras.DictCursor)

cursor.execute(sql)

count = 0

while True:

row = cursor.fetchone()

if row is None:

break

if row['usename'] == 'postgres':

continue

if row['state'] == 'idle':

continue

count += 1

pid = row['pid']

user = row['usename']

duration = row['duration']

query = row['query']

state = row['state']

msg_items = ['LONG RUNNING QUERY ON HOST: %s\n'

% args.dbhost, 'PID: %s\n' % pid,

'DURATION: %s\n' % duration,

'QUERY: %s\n' % query,

'STATE: %s\n' % state,

'USER: %s\n' % user,

'COUNT: %s\n' % count]

msg = ''.join(msg_items)

send_message_to_slack(msg)

conn.close()

def main():

get_long_running_queries()

if __name__ == '__main__':

main()

SLACK MESSAGE:

LONG RUNNING QUERY ON HOST: proddb01
PID: 30270
DURATION: 0:55:02.748624
QUERY: SELECT --redacted--
STATE: active
USER: dbuser
COUNT: 1

LONG RUNNING QUERY ON HOST: proddb01

PID: 30270

DURATION: 0:55:02.748624

QUERY: SELECT --redacted--

STATE: active

USER: dbuser

COUNT: 1

Python Function Execute Subprocess With Timeout

June 23, 2019October 15, 2020 adminLeave a comment

I have a project that rsync’s data from an RPM repository for a local version of this repo. The issue I was faced with was the remote mirror would sometimes stop the rsync due to overloaded network or other unforeseen issues. I wanted to use rsyncs hashing algorithm to have it start right where it left off so I wrote a function to do this. If 900 seconds was hit it usually meant there was an issue with the transfer. I also want to state here that I observed the rsync stop serving issue on many mirrors so it was not just an issue with the TCP network. I use this in production and it logs each iteration or restart. The function below will also kill the current rsync so multiple copies are not running at the same time. I also only wanted to perform 5 iterations of rsync upon error or timeout so I use a while loop here.

Here are the individual rsync commands in the INI configuration.

[rsync_cmds]
rsync01 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/
rsync02 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/
rsync03 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/ 7/centosplus/x86_64/
rsync04 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/extras/x86_64/ 7/extras/x86_64

[rsync_cmds]

rsync01 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/

rsync02 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/

rsync03 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/ 7/centosplus/x86_64/

rsync04 = /usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/extras/x86_64/ 7/extras/x86_64

Here is how I call the execute_jobs_timeout() function:

rsync_commands = dict(config.items('rsync_cmds'))
def rsync_data():
    for name, cmds in sorted(rsync_commands.items()):
        execute_jobs_timeout(cmds)

rsync_commands = dict(config.items('rsync_cmds'))

def rsync_data():

for name, cmds in sorted(rsync_commands.items()):

execute_jobs_timeout(cmds)

The function:

def execute_jobs_timeout(cmd):
    iteration = 0
    while iteration < 5:
        proc = subprocess.Popen(shlex.split(cmd),
                                start_new_session=True)
        try:
            logger.info('Start Command: [%s]' % sanitize(cmd))
            stdout_data, stderr_data = proc.communicate(timeout=900)
            if proc.returncode != 0:
                logger.critical(
                    "%r failed, status code %s stdout %r stderr %r" % (
                        sanitize(cmd), proc.returncode,
                        stdout_data, stderr_data))
                iteration += 1
                if iteration == 5:
                    logger.critical('Execute Jobs Failed After 5 Iterations.')
                    break
                continue
            logger.info('Success: [%s]' % sanitize(cmd))
            break
        except (subprocess.TimeoutExpired, subprocess.SubprocessError) as e:
            os.killpg(os.getpgid(proc.pid), signal.SIGKILL)
            logger.warning('[%s]' % e)
            logger.info('Restarting [%s]' % sanitize(cmd))
            iteration += 1
            if iteration == 5:
                logger.critical('Execute Jobs Failed After 5 Iterations.')
                break
            continue

def execute_jobs_timeout(cmd):

iteration = 0

while iteration < 5:

proc = subprocess.Popen(shlex.split(cmd),

start_new_session=True)

try:

logger.info('Start Command: [%s]' % sanitize(cmd))

stdout_data, stderr_data = proc.communicate(timeout=900)

if proc.returncode != 0:

logger.critical(

"%r failed, status code %s stdout %r stderr %r" % (

sanitize(cmd), proc.returncode,

stdout_data, stderr_data))

iteration += 1

if iteration == 5:

logger.critical('Execute Jobs Failed After 5 Iterations.')

break

continue

logger.info('Success: [%s]' % sanitize(cmd))

break

except (subprocess.TimeoutExpired, subprocess.SubprocessError) as e:

os.killpg(os.getpgid(proc.pid), signal.SIGKILL)

logger.warning('[%s]' % e)

logger.info('Restarting [%s]' % sanitize(cmd))

iteration += 1

if iteration == 5:

logger.critical('Execute Jobs Failed After 5 Iterations.')

break

continue

Log Snippet showing each command executing:

2019-05-25 03:15:03,872 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin
2019-05-25 03:15:03,875 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin
2019-05-25 03:27:53,801 - __main__ - INFO - Success: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin
2019-05-25 03:27:53,821 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 03:42:53,821 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/', '7/updates/x86_64/']' timed out after 899.9999316609465 seconds] - devdbadmin
2019-05-25 03:42:53,822 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 03:42:53,850 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 03:57:53,851 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/', '7/updates/x86_64/']' timed out after 899.9999369028956 seconds] - devdbadmin
2019-05-25 03:57:53,852 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 03:57:53,854 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 04:01:28,522 - __main__ - INFO - Success: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin
2019-05-25 04:01:28,524 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/ 7/centosplus/x86_64/] - devdbadmin
2019-05-25 04:16:28,527 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/', '7/centosplus/x86_64/']' timed out after 899.9999288369436 seconds] - devdbadmin

2019-05-25 03:15:03,872 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin

2019-05-25 03:15:03,875 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin

2019-05-25 03:27:53,801 - __main__ - INFO - Success: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/os/x86_64/ 7/x86_64/] - devdbadmin

2019-05-25 03:27:53,821 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 03:42:53,821 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/', '7/updates/x86_64/']' timed out after 899.9999316609465 seconds] - devdbadmin

2019-05-25 03:42:53,822 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 03:42:53,850 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 03:57:53,851 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/', '7/updates/x86_64/']' timed out after 899.9999369028956 seconds] - devdbadmin

2019-05-25 03:57:53,852 - __main__ - INFO - Restarting [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 03:57:53,854 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 04:01:28,522 - __main__ - INFO - Success: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/updates/x86_64/ 7/updates/x86_64/] - devdbadmin

2019-05-25 04:01:28,524 - __main__ - INFO - Start Command: [/usr/local/bin/rsync -a rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/ 7/centosplus/x86_64/] - devdbadmin

2019-05-25 04:16:28,527 - __main__ - WARNING - [Command '['/usr/local/bin/rsync', '-a', 'rsync://mirror.cogentco.com/CentOS/7/centosplus/x86_64/', '7/centosplus/x86_64/']' timed out after 899.9999288369436 seconds] - devdbadmin

CENTOS6 Postgres pg_upgrade 9 to 11 – In Place – Link – No Copy – Limited Disk Space

April 17, 2019November 13, 2019 admin1 Comment

I wanted to share my experience with upgrading postgres database server from major version 9 to 11. I am showing the steps that I took to get many servers in dev and production upgraded with limited disk space(not enough space to copy). I am hoping this will help with the problems I faced when testing this procedure. Using the –link parameter has drawbacks as noted in the documentation, however we perform full VM backups of each server so we can always restore from backup if the upgrade fails and we will not need to start the pg9.3 database again.

https://www.postgresql.org/docs/11/pgupgrade.html

-k --link

use hard links instead of copying files to the new cluster If you ran pg_upgrade with --link, the data files are shared between the old and new cluster. If you started the new cluster, the new server has written to those shared files and it is unsafe to use the old cluster.

Before we get started make a backup of the files pg_hba.conf and postgresql.conf for later use, you will need to use them later to reconstruct the pg11 configs.

[root@jr-sandbox ~]# cp /data1/data93/pg_hba.conf /root/
[root@jr-sandbox ~]# cp /data1/data93/postgresql.conf /root/

1 2	[root@jr-sandbox ~]# cp /data1/data93/pg_hba.conf /root/ [root@jr-sandbox ~]# cp /data1/data93/postgresql.conf /root/

Use WGET to grab the RPMS from https://yum.postgresql.org

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-contrib-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-debuginfo-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-devel-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-docs-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-libs-11.2-2PGDG.rhel6.x86_64.rpm
[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-server-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-contrib-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-debuginfo-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-devel-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-docs-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-libs-11.2-2PGDG.rhel6.x86_64.rpm

[root@jr-sandbox pg_11]# wget https://yum.postgresql.org/11/redhat/rhel-6-x86_64/postgresql11-server-11.2-2PGDG.rhel6.x86_64.rpm

Install the RPMS for postgres11 that we just downloaded

[root@jr-sandbox pg_11]# rpm -ivh postgresql11-*
warning: postgresql11-11.2-2PGDG.rhel6.x86_64.rpm: Header V4 DSA/SHA1 Signature, key ID 442df0f8: NOKEY
Preparing...                ########################################### [100%]
   1:postgresql11-libs      ########################################### [ 14%]
   2:postgresql11           ########################################### [ 29%]
   3:postgresql11-contrib   ########################################### [ 43%]
   4:postgresql11-devel     ########################################### [ 57%]
   5:postgresql11-server    ########################################### [ 71%]
   6:postgresql11-docs      ########################################### [ 86%]
   7:postgresql11-debuginfo ########################################### [100%]

[root@jr-sandbox pg_11]# rpm -ivh postgresql11-*

warning: postgresql11-11.2-2PGDG.rhel6.x86_64.rpm: Header V4 DSA/SHA1 Signature, key ID 442df0f8: NOKEY

Preparing... ########################################### [100%]

1:postgresql11-libs ########################################### [ 14%]

2:postgresql11 ########################################### [ 29%]

3:postgresql11-contrib ########################################### [ 43%]

4:postgresql11-devel ########################################### [ 57%]

5:postgresql11-server ########################################### [ 71%]

6:postgresql11-docs ########################################### [ 86%]

7:postgresql11-debuginfo ########################################### [100%]

We will create the data location for postgres11 where the files will be hardlinked and not copied. You can see the tablespace disk locations and the index locations from the pg9.3 install. Its important to create the new pg11 data directory on the same filesystem since we will be using the –link parameter and it uses hardlinks which will not traverse filesystems.

[root@jr-sandbox ~]# cd /data1/
[root@jr-sandbox data1]# ls -ltr
total 12
drwx------  3 postgres postgres 4096 Apr 16 22:57 ts_index1
drwx------  3 postgres postgres 4096 Apr 16 22:58 ts_data2
drwx------ 16 postgres postgres 4096 Apr 16 23:02 data93
[root@jr-sandbox data1]# mkdir data11
[root@jr-sandbox data1]# chown -R postgres:postgres data11/

[root@jr-sandbox ~]# cd /data1/

[root@jr-sandbox data1]# ls -ltr

total 12

drwx------ 3 postgres postgres 4096 Apr 16 22:57 ts_index1

drwx------ 3 postgres postgres 4096 Apr 16 22:58 ts_data2

drwx------ 16 postgres postgres 4096 Apr 16 23:02 data93

[root@jr-sandbox data1]# mkdir data11

[root@jr-sandbox data1]# chown -R postgres:postgres data11/

We will need to init a postgres database in our new location on disk data11.

[root@jr-sandbox ~]# su - postgres
-bash-4.1$ /usr/pgsql-11/bin/initdb -D /data1/data11
The files belonging to this database system will be owned by user "postgres".
This user must also own the server process.

The database cluster will be initialized with locale "en_US.UTF-8".
The default database encoding has accordingly been set to "UTF8".
The default text search configuration will be set to "english".

Data page checksums are disabled.

fixing permissions on existing directory /data1/data11 ... ok
creating subdirectories ... ok
selecting default max_connections ... 100
selecting default shared_buffers ... 128MB
selecting dynamic shared memory implementation ... posix
creating configuration files ... ok
running bootstrap script ... ok
performing post-bootstrap initialization ... ok
syncing data to disk ... ok

WARNING: enabling "trust" authentication for local connections
You can change this by editing pg_hba.conf or using the option -A, or
--auth-local and --auth-host, the next time you run initdb.

Success. You can now start the database server using:

    /usr/pgsql-11/bin/pg_ctl -D /data1/data11 -l logfile start

-bash-4.1$

[root@jr-sandbox ~]# su - postgres

-bash-4.1$ /usr/pgsql-11/bin/initdb -D /data1/data11

The files belonging to this database system will be owned by user "postgres".

This user must also own the server process.

The database cluster will be initialized with locale "en_US.UTF-8".

The default database encoding has accordingly been set to "UTF8".

The default text search configuration will be set to "english".

Data page checksums are disabled.

fixing permissions on existing directory /data1/data11 ... ok

creating subdirectories ... ok

selecting default max_connections ... 100

selecting default shared_buffers ... 128MB

selecting dynamic shared memory implementation ... posix

creating configuration files ... ok

running bootstrap script ... ok

performing post-bootstrap initialization ... ok

syncing data to disk ... ok

WARNING: enabling "trust" authentication for local connections

You can change this by editing pg_hba.conf or using the option -A, or

--auth-local and --auth-host, the next time you run initdb.

Success. You can now start the database server using:

/usr/pgsql-11/bin/pg_ctl -D /data1/data11 -l logfile start

-bash-4.1$

Now we are ready to stop pg9.3 and check pg_upgrade compatibility. pg_upgrade ships with a –check argument that will check the compatibility of the clusters and be sure the upgrade will work before changing any files. Lets stop pg9.3 and run the pg_upgrade with the –check parameter.

[root@jr-sandbox ~]# /etc/init.d/postgresql-9.3 stop
Stopping postgresql-9.3 service:                           [  OK  ]

[root@jr-sandbox ~]# su - postgres

-bash-4.1$ /usr/pgsql-11/bin/pg_upgrade --link --old-bindir=/usr/pgsql-9.3/bin/ --new-bindir=/usr/pgsql-11/bin/ --old-datadir=/data1/data93/ --new-datadir=/data1/data11/ --check
Performing Consistency Checks
-----------------------------
Checking cluster versions                                   ok
Checking database user is the install user                  ok
Checking database connection settings                       ok
Checking for prepared transactions                          ok
Checking for reg* data types in user tables                 ok
Checking for contrib/isn with bigint-passing mismatch       ok
Checking for invalid "unknown" user columns                 ok
Checking for hash indexes                                   ok
Checking for roles starting with "pg_"                      ok
Checking for incompatible "line" data type                  ok
Checking for presence of required libraries                 ok
Checking database user is the install user                  ok
Checking for prepared transactions                          ok

*Clusters are compatible*

[root@jr-sandbox ~]# /etc/init.d/postgresql-9.3 stop

Stopping postgresql-9.3 service: [ OK ]

[root@jr-sandbox ~]# su - postgres

-bash-4.1$ /usr/pgsql-11/bin/pg_upgrade --link --old-bindir=/usr/pgsql-9.3/bin/ --new-bindir=/usr/pgsql-11/bin/ --old-datadir=/data1/data93/ --new-datadir=/data1/data11/ --check

Performing Consistency Checks

-----------------------------

Checking cluster versions ok

Checking database user is the install user ok

Checking database connection settings ok

Checking for prepared transactions ok

Checking for reg* data types in user tables ok

Checking for contrib/isn with bigint-passing mismatch ok

Checking for invalid "unknown" user columns ok

Checking for hash indexes ok

Checking for roles starting with "pg_" ok

Checking for incompatible "line" data type ok

Checking for presence of required libraries ok

Checking database user is the install user ok

Checking for prepared transactions ok

*Clusters are compatible*

Ok checks have passed and the cluster versions are ready for upgrade, lets run this without the –check parameter and upgrade postgres.

[root@jr-sandbox ~]# su - postgres
-bash-4.1$ /usr/pgsql-11/bin/pg_upgrade --link --old-bindir=/usr/pgsql-9.3/bin/ --new-bindir=/usr/pgsql-11/bin/ --old-datadir=/data1/data93/ --new-datadir=/data1/data11/
Performing Consistency Checks
-----------------------------
Checking cluster versions                                   ok
Checking database user is the install user                  ok
Checking database connection settings                       ok
Checking for prepared transactions                          ok
Checking for reg* data types in user tables                 ok
Checking for contrib/isn with bigint-passing mismatch       ok
Checking for invalid "unknown" user columns                 ok
Checking for roles starting with "pg_"                      ok
Checking for incompatible "line" data type                  ok
Creating dump of global objects                             ok
Creating dump of database schemas
                                                            ok
Checking for presence of required libraries                 ok
Checking database user is the install user                  ok
Checking for prepared transactions                          ok

If pg_upgrade fails after this point, you must re-initdb the
new cluster before continuing.

Performing Upgrade
------------------
Analyzing all rows in the new cluster                       ok
Freezing all rows in the new cluster                        ok
Deleting files from new pg_xact                             ok
Copying old pg_clog to new server                           ok
Setting next transaction ID and epoch for new cluster       ok
Deleting files from new pg_multixact/offsets                ok
Copying old pg_multixact/offsets to new server              ok
Deleting files from new pg_multixact/members                ok
Copying old pg_multixact/members to new server              ok
Setting next multixact ID and offset for new cluster        ok
Resetting WAL archives                                      ok
Setting frozenxid and minmxid counters in new cluster       ok
Restoring global objects in the new cluster                 ok
Restoring database schemas in the new cluster
                                                            ok
Adding ".old" suffix to old global/pg_control               ok

If you want to start the old cluster, you will need to remove
the ".old" suffix from /data1/data93/global/pg_control.old.
Because "link" mode was used, the old cluster cannot be safely
started once the new cluster has been started.

Linking user relation files
                                                            ok
Setting next OID for new cluster                            ok
Sync data directory to disk                                 ok
Creating script to analyze new cluster                      ok
Creating script to delete old cluster                       ok
Checking for hash indexes                                   ok

Upgrade Complete
----------------
Optimizer statistics are not transferred by pg_upgrade so,
once you start the new server, consider running:
    ./analyze_new_cluster.sh

Running this script will delete the old cluster's data files:
    ./delete_old_cluster.sh
-bash-4.1$

[root@jr-sandbox ~]# su - postgres

-bash-4.1$ /usr/pgsql-11/bin/pg_upgrade --link --old-bindir=/usr/pgsql-9.3/bin/ --new-bindir=/usr/pgsql-11/bin/ --old-datadir=/data1/data93/ --new-datadir=/data1/data11/

Performing Consistency Checks

-----------------------------

Checking cluster versions ok

Checking database user is the install user ok

Checking database connection settings ok

Checking for prepared transactions ok

Checking for reg* data types in user tables ok

Checking for contrib/isn with bigint-passing mismatch ok

Checking for invalid "unknown" user columns ok

Checking for roles starting with "pg_" ok

Checking for incompatible "line" data type ok

Creating dump of global objects ok

Creating dump of database schemas

Checking for presence of required libraries ok

Checking database user is the install user ok

Checking for prepared transactions ok

If pg_upgrade fails after this point, you must re-initdb the

new cluster before continuing.

Performing Upgrade

------------------

Analyzing all rows in the new cluster ok

Freezing all rows in the new cluster ok

Deleting files from new pg_xact ok

Copying old pg_clog to new server ok

Setting next transaction ID and epoch for new cluster ok

Deleting files from new pg_multixact/offsets ok

Copying old pg_multixact/offsets to new server ok

Deleting files from new pg_multixact/members ok

Copying old pg_multixact/members to new server ok

Setting next multixact ID and offset for new cluster ok

Resetting WAL archives ok

Setting frozenxid and minmxid counters in new cluster ok

Restoring global objects in the new cluster ok

Restoring database schemas in the new cluster

Adding ".old" suffix to old global/pg_control ok

If you want to start the old cluster, you will need to remove

the ".old" suffix from /data1/data93/global/pg_control.old.

Because "link" mode was used, the old cluster cannot be safely

started once the new cluster has been started.

Linking user relation files

Setting next OID for new cluster ok

Sync data directory to disk ok

Creating script to analyze new cluster ok

Creating script to delete old cluster ok

Checking for hash indexes ok

Upgrade Complete

----------------

Optimizer statistics are not transferred by pg_upgrade so,

once you start the new server, consider running:

./analyze_new_cluster.sh

Running this script will delete the old cluster's data files:

./delete_old_cluster.sh

-bash-4.1$

OK the pg_upgrade code completed successfully and has generated 2 scripts. One to analyze the new pg11 cluster to get stats for the query planner and vacuum. The other to cleanup and remove the old pg9.3 locations on disk. Let’s start pg11, we will need to create an override file to tell pg11 where the data11 data lives, then we should be able to start postgres and check some things and verify our upgrade.

[root@jr-sandbox ~]# cd /etc/sysconfig/pgsql/
[root@jr-sandbox pgsql]# cp postgresql-9.3 postgresql-11
[root@jr-sandbox pgsql]# vim postgresql-11 
[root@jr-sandbox pgsql]# cat postgresql-11 
PGDATA=/data1/data11
PGLOG=/data1/data11/pgstartup.log

[root@jr-sandbox pgsql]# /etc/init.d/postgresql-11 start
Starting postgresql-11 service:                            [  OK  ]

[root@jr-sandbox ~]# cd /etc/sysconfig/pgsql/

[root@jr-sandbox pgsql]# cp postgresql-9.3 postgresql-11

[root@jr-sandbox pgsql]# vim postgresql-11

[root@jr-sandbox pgsql]# cat postgresql-11

PGDATA=/data1/data11

PGLOG=/data1/data11/pgstartup.log

[root@jr-sandbox pgsql]# /etc/init.d/postgresql-11 start

Starting postgresql-11 service: [ OK ]

[root@jr-sandbox pgsql]# su - postgres
-bash-4.1$ ps -ef| grep postgres| head -n 1
postgres 31047     1  0 23:30 ?        00:00:00 /usr/pgsql-11/bin/postmaster -D /data1/data11
-bash-4.1$ psql 
psql (11.2)
Type "help" for help.

postgres=# select spcname
      ,pg_tablespace_location(oid) 
from   pg_tablespace;
  spcname   | pg_tablespace_location 
------------+------------------------
 pg_default | 
 pg_global  | 
 index1     | /data1/ts_index1
 data2      | /data1/ts_data2
(4 rows)

[root@jr-sandbox pgsql]# su - postgres

-bash-4.1$ ps -ef| grep postgres| head -n 1

postgres 31047 1 0 23:30 ? 00:00:00 /usr/pgsql-11/bin/postmaster -D /data1/data11

-bash-4.1$ psql

psql (11.2)

Type "help" for help.

postgres=# select spcname

,pg_tablespace_location(oid)

from pg_tablespace;

spcname | pg_tablespace_location

------------+------------------------

pg_default |

pg_global |

index1 | /data1/ts_index1

data2 | /data1/ts_data2

(4 rows)

OK we can see we have pg11 running and we can run the generated scripts to cleanup, but lets take a look at the data and index directories to see what the upgrade produced.

[root@jr-sandbox ~]# cd /data1/
[root@jr-sandbox data1]# ls
data11  data93  ts_data2  ts_index1
[root@jr-sandbox data1]# cd data11/
[root@jr-sandbox data11]# ls -l
total 132
drwx------ 5 postgres postgres  4096 Apr 16 23:19 base
-rw------- 1 postgres postgres    30 Apr 16 23:30 current_logfiles
drwx------ 2 postgres postgres  4096 Apr 16 23:32 global
drwx------ 2 postgres postgres  4096 Apr 16 23:10 log
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_commit_ts
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_dynshmem
-rw------- 1 postgres postgres  4513 Apr 16 23:09 pg_hba.conf
-rw------- 1 postgres postgres  1636 Apr 16 23:09 pg_ident.conf
drwx------ 4 postgres postgres  4096 Apr 16 23:35 pg_logical
drwx------ 4 postgres postgres  4096 Apr 16 23:19 pg_multixact
drwx------ 2 postgres postgres  4096 Apr 16 23:30 pg_notify
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_replslot
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_serial
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_snapshots
-rw------- 1 postgres postgres   469 Apr 16 23:30 pgstartup.log
drwx------ 2 postgres postgres  4096 Apr 16 23:30 pg_stat
drwx------ 2 postgres postgres  4096 Apr 16 23:45 pg_stat_tmp
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_subtrans
drwx------ 2 postgres postgres  4096 Apr 16 23:19 pg_tblspc
drwx------ 2 postgres postgres  4096 Apr 16 23:09 pg_twophase
-rw------- 1 postgres postgres     3 Apr 16 23:09 PG_VERSION
drwx------ 3 postgres postgres  4096 Apr 16 23:19 pg_wal
drwx------ 2 postgres postgres  4096 Apr 16 23:19 pg_xact
-rw------- 1 postgres postgres    88 Apr 16 23:09 postgresql.auto.conf
-rw------- 1 postgres postgres 23863 Apr 16 23:09 postgresql.conf
-rw------- 1 postgres postgres    48 Apr 16 23:30 postmaster.opts
-rw------- 1 postgres postgres    95 Apr 16 23:30 postmaster.pid
[root@jr-sandbox data11]# cd ../ts_index1/
[root@jr-sandbox ts_index1]# ls -l
total 8
drwx------ 2 postgres postgres 4096 Apr 16 23:19 PG_11_201809051
drwx------ 2 postgres postgres 4096 Apr 16 22:57 PG_9.3_201306121
[root@jr-sandbox ts_index1]# cd ../ts_data2/
You have mail in /var/spool/mail/root
[root@jr-sandbox ts_data2]# ls -l
total 8
drwx------ 2 postgres postgres 4096 Apr 16 23:19 PG_11_201809051
drwx------ 2 postgres postgres 4096 Apr 16 22:58 PG_9.3_201306121
<strong>

[root@jr-sandbox ~]# cd /data1/

[root@jr-sandbox data1]# ls

data11 data93 ts_data2 ts_index1

[root@jr-sandbox data1]# cd data11/

[root@jr-sandbox data11]# ls -l

total 132

drwx------ 5 postgres postgres 4096 Apr 16 23:19 base

-rw------- 1 postgres postgres 30 Apr 16 23:30 current_logfiles

drwx------ 2 postgres postgres 4096 Apr 16 23:32 global

drwx------ 2 postgres postgres 4096 Apr 16 23:10 log

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_commit_ts

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_dynshmem

-rw------- 1 postgres postgres 4513 Apr 16 23:09 pg_hba.conf

-rw------- 1 postgres postgres 1636 Apr 16 23:09 pg_ident.conf

drwx------ 4 postgres postgres 4096 Apr 16 23:35 pg_logical

drwx------ 4 postgres postgres 4096 Apr 16 23:19 pg_multixact

drwx------ 2 postgres postgres 4096 Apr 16 23:30 pg_notify

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_replslot

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_serial

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_snapshots

-rw------- 1 postgres postgres 469 Apr 16 23:30 pgstartup.log

drwx------ 2 postgres postgres 4096 Apr 16 23:30 pg_stat

drwx------ 2 postgres postgres 4096 Apr 16 23:45 pg_stat_tmp

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_subtrans

drwx------ 2 postgres postgres 4096 Apr 16 23:19 pg_tblspc

drwx------ 2 postgres postgres 4096 Apr 16 23:09 pg_twophase

-rw------- 1 postgres postgres 3 Apr 16 23:09 PG_VERSION

drwx------ 3 postgres postgres 4096 Apr 16 23:19 pg_wal

drwx------ 2 postgres postgres 4096 Apr 16 23:19 pg_xact

-rw------- 1 postgres postgres 88 Apr 16 23:09 postgresql.auto.conf

-rw------- 1 postgres postgres 23863 Apr 16 23:09 postgresql.conf

-rw------- 1 postgres postgres 48 Apr 16 23:30 postmaster.opts

-rw------- 1 postgres postgres 95 Apr 16 23:30 postmaster.pid

[root@jr-sandbox data11]# cd ../ts_index1/

[root@jr-sandbox ts_index1]# ls -l

total 8

drwx------ 2 postgres postgres 4096 Apr 16 23:19 PG_11_201809051

drwx------ 2 postgres postgres 4096 Apr 16 22:57 PG_9.3_201306121

[root@jr-sandbox ts_index1]# cd ../ts_data2/

You have mail in /var/spool/mail/root

[root@jr-sandbox ts_data2]# ls -l

total 8

drwx------ 2 postgres postgres 4096 Apr 16 23:19 PG_11_201809051

drwx------ 2 postgres postgres 4096 Apr 16 22:58 PG_9.3_201306121

We can view the shell scripts that pg_upgrade produced and cleanup the old pg9.3 references and run the analyze vacuums.

[root@jr-sandbox ~]# su - postgres
-bash-4.1$ ls
11  9.3  analyze_new_cluster.sh  delete_old_cluster.sh
-bash-4.1$ cat delete_old_cluster.sh 
#!/bin/sh

rm -rf '/data1/data93'
rm -rf '/data1/ts_index1/PG_9.3_201306121'
rm -rf '/data1/ts_data2/PG_9.3_201306121'
-bash-4.1$ cat analyze_new_cluster.sh 
#!/bin/sh

echo 'This script will generate minimal optimizer statistics rapidly'
echo 'so your system is usable, and then gather statistics twice more'
echo 'with increasing accuracy.  When it is done, your system will'
echo 'have the default level of optimizer statistics.'
echo

echo 'If you have used ALTER TABLE to modify the statistics target for'
echo 'any tables, you might want to remove them and restore them after'
echo 'running this script because they will delay fast statistics generation.'
echo

echo 'If you would like default statistics as quickly as possible, cancel'
echo 'this script and run:'
echo '    "/usr/pgsql-11/bin/vacuumdb" --all --analyze-only'
echo

"/usr/pgsql-11/bin/vacuumdb" --all --analyze-in-stages
echo

echo 'Done'

[root@jr-sandbox ~]# su - postgres

-bash-4.1$ ls

11 9.3 analyze_new_cluster.sh delete_old_cluster.sh

-bash-4.1$ cat delete_old_cluster.sh

#!/bin/sh

rm -rf '/data1/data93'

rm -rf '/data1/ts_index1/PG_9.3_201306121'

rm -rf '/data1/ts_data2/PG_9.3_201306121'

-bash-4.1$ cat analyze_new_cluster.sh

#!/bin/sh

echo 'This script will generate minimal optimizer statistics rapidly'

echo 'so your system is usable, and then gather statistics twice more'

echo 'with increasing accuracy. When it is done, your system will'

echo 'have the default level of optimizer statistics.'

echo

echo 'If you have used ALTER TABLE to modify the statistics target for'

echo 'any tables, you might want to remove them and restore them after'

echo 'running this script because they will delay fast statistics generation.'

echo

echo 'If you would like default statistics as quickly as possible, cancel'

echo 'this script and run:'

echo ' "/usr/pgsql-11/bin/vacuumdb" --all --analyze-only'

echo

"/usr/pgsql-11/bin/vacuumdb" --all --analyze-in-stages

echo

echo 'Done'

This looks good, lets execute them and cleanup any pg9.3 references as well as remove the pg9.3 rpms.

[root@jr-sandbox data1]# su - postgres
-bash-4.1$ ./delete_old_cluster.sh 
-bash-4.1$ ./analyze_new_cluster.sh 
This script will generate minimal optimizer statistics rapidly
so your system is usable, and then gather statistics twice more
with increasing accuracy.  When it is done, your system will
have the default level of optimizer statistics.

If you have used ALTER TABLE to modify the statistics target for
any tables, you might want to remove them and restore them after
running this script because they will delay fast statistics generation.

If you would like default statistics as quickly as possible, cancel
this script and run:
    "/usr/pgsql-11/bin/vacuumdb" --all --analyze-only

vacuumdb: processing database "postgres": Generating minimal optimizer statistics (1 target)
vacuumdb: processing database "template1": Generating minimal optimizer statistics (1 target)
vacuumdb: processing database "postgres": Generating medium optimizer statistics (10 targets)
vacuumdb: processing database "template1": Generating medium optimizer statistics (10 targets)
vacuumdb: processing database "postgres": Generating default (full) optimizer statistics
vacuumdb: processing database "template1": Generating default (full) optimizer statistics

Done
-bash-4.1$

[root@jr-sandbox data1]# su - postgres

-bash-4.1$ ./delete_old_cluster.sh

-bash-4.1$ ./analyze_new_cluster.sh

This script will generate minimal optimizer statistics rapidly

so your system is usable, and then gather statistics twice more

with increasing accuracy. When it is done, your system will

have the default level of optimizer statistics.

If you have used ALTER TABLE to modify the statistics target for

any tables, you might want to remove them and restore them after

running this script because they will delay fast statistics generation.

If you would like default statistics as quickly as possible, cancel

this script and run:

"/usr/pgsql-11/bin/vacuumdb" --all --analyze-only

vacuumdb: processing database "postgres": Generating minimal optimizer statistics (1 target)

vacuumdb: processing database "template1": Generating minimal optimizer statistics (1 target)

vacuumdb: processing database "postgres": Generating medium optimizer statistics (10 targets)

vacuumdb: processing database "template1": Generating medium optimizer statistics (10 targets)

vacuumdb: processing database "postgres": Generating default (full) optimizer statistics

vacuumdb: processing database "template1": Generating default (full) optimizer statistics

Done

-bash-4.1$

Remove the pg9.3 rpms and references, set the new data location in the .pgsql_profile.

[root@jr-sandbox ~]# yum remove postgresql93*
Loaded plugins: fastestmirror
Setting up Remove Process
Resolving Dependencies
--> Running transaction check
---> Package postgresql93.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-contrib.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-debuginfo.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-devel.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-docs.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-libs.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
---> Package postgresql93-server.x86_64 0:9.3.24-1PGDG.rhel6 will be erased
--> Finished Dependency Resolution

Dependencies Resolved

===================================================================================================================================================================================================================
 Package                                                  Arch                                     Version                                              Repository                                            Size
===================================================================================================================================================================================================================
Removing:
 postgresql93                                             x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                   5.3 M
 postgresql93-contrib                                     x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                   1.7 M
 postgresql93-debuginfo                                   x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                    28 M
 postgresql93-devel                                       x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                   6.8 M
 postgresql93-docs                                        x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                    31 M
 postgresql93-libs                                        x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                   632 k
 postgresql93-server                                      x86_64                                   9.3.24-1PGDG.rhel6                                   @affinity6-prod-db                                    16 M

Transaction Summary
===================================================================================================================================================================================================================
Remove        7 Package(s)

Installed size: 89 M
Is this ok [y/N]: y
Downloading Packages:
Running rpm_check_debug
Running Transaction Test
Transaction Test Succeeded
Running Transaction
Warning: RPMDB altered outside of yum.
  Erasing    : postgresql93-debuginfo-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                1/7 
  Erasing    : postgresql93-devel-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                    2/7 
  Erasing    : postgresql93-server-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                   3/7 
  Erasing    : postgresql93-contrib-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                  4/7 
  Erasing    : postgresql93-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                          5/7 
  Erasing    : postgresql93-libs-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                     6/7 
  Erasing    : postgresql93-docs-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                     7/7 
  Verifying  : postgresql93-debuginfo-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                1/7 
  Verifying  : postgresql93-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                          2/7 
  Verifying  : postgresql93-docs-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                     3/7 
  Verifying  : postgresql93-contrib-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                  4/7 
  Verifying  : postgresql93-server-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                   5/7 
  Verifying  : postgresql93-devel-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                    6/7 
  Verifying  : postgresql93-libs-9.3.24-1PGDG.rhel6.x86_64                                                                                                                                                     7/7 

Removed:
  postgresql93.x86_64 0:9.3.24-1PGDG.rhel6          postgresql93-contrib.x86_64 0:9.3.24-1PGDG.rhel6     postgresql93-debuginfo.x86_64 0:9.3.24-1PGDG.rhel6     postgresql93-devel.x86_64 0:9.3.24-1PGDG.rhel6    
  postgresql93-docs.x86_64 0:9.3.24-1PGDG.rhel6     postgresql93-libs.x86_64 0:9.3.24-1PGDG.rhel6        postgresql93-server.x86_64 0:9.3.24-1PGDG.rhel6       

Complete!
[root@jr-sandbox ~]#

[root@jr-sandbox ~]# yum remove postgresql93*

Loaded plugins: fastestmirror

Setting up Remove Process

Resolving Dependencies

--> Running transaction check

---> Package postgresql93.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-contrib.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-debuginfo.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-devel.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-docs.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-libs.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

---> Package postgresql93-server.x86_64 0:9.3.24-1PGDG.rhel6 will be erased

--> Finished Dependency Resolution

Dependencies Resolved

===================================================================================================================================================================================================================

Package Arch Version Repository Size

Removing:

postgresql93 x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 5.3 M

postgresql93-contrib x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 1.7 M

postgresql93-debuginfo x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 28 M

postgresql93-devel x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 6.8 M

postgresql93-docs x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 31 M

postgresql93-libs x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 632 k

postgresql93-server x86_64 9.3.24-1PGDG.rhel6 @affinity6-prod-db 16 M

Transaction Summary

Remove 7 Package(s)

Installed size: 89 M

Is this ok [y/N]: y

Downloading Packages:

Running rpm_check_debug

Running Transaction Test

Transaction Test Succeeded

Running Transaction

Warning: RPMDB altered outside of yum.

Erasing : postgresql93-debuginfo-9.3.24-1PGDG.rhel6.x86_64 1/7

Erasing : postgresql93-devel-9.3.24-1PGDG.rhel6.x86_64 2/7

Erasing : postgresql93-server-9.3.24-1PGDG.rhel6.x86_64 3/7

Erasing : postgresql93-contrib-9.3.24-1PGDG.rhel6.x86_64 4/7

Erasing : postgresql93-9.3.24-1PGDG.rhel6.x86_64 5/7

Erasing : postgresql93-libs-9.3.24-1PGDG.rhel6.x86_64 6/7

Erasing : postgresql93-docs-9.3.24-1PGDG.rhel6.x86_64 7/7

Verifying : postgresql93-debuginfo-9.3.24-1PGDG.rhel6.x86_64 1/7

Verifying : postgresql93-9.3.24-1PGDG.rhel6.x86_64 2/7

Verifying : postgresql93-docs-9.3.24-1PGDG.rhel6.x86_64 3/7

Verifying : postgresql93-contrib-9.3.24-1PGDG.rhel6.x86_64 4/7

Verifying : postgresql93-server-9.3.24-1PGDG.rhel6.x86_64 5/7

Verifying : postgresql93-devel-9.3.24-1PGDG.rhel6.x86_64 6/7

Verifying : postgresql93-libs-9.3.24-1PGDG.rhel6.x86_64 7/7

Removed:

postgresql93.x86_64 0:9.3.24-1PGDG.rhel6 postgresql93-contrib.x86_64 0:9.3.24-1PGDG.rhel6 postgresql93-debuginfo.x86_64 0:9.3.24-1PGDG.rhel6 postgresql93-devel.x86_64 0:9.3.24-1PGDG.rhel6

postgresql93-docs.x86_64 0:9.3.24-1PGDG.rhel6 postgresql93-libs.x86_64 0:9.3.24-1PGDG.rhel6 postgresql93-server.x86_64 0:9.3.24-1PGDG.rhel6

Complete!

[root@jr-sandbox ~]#

[root@jr-sandbox ~]# cd /etc/sysconfig/pgsql/
You have new mail in /var/spool/mail/root
[root@jr-sandbox pgsql]# ls
postgresql-11  postgresql-9.3
[root@jr-sandbox pgsql]# rm -f postgresql-9.3 
[root@jr-sandbox pgsql]# su - postgres
-bash-4.1$ ls
11  9.3  analyze_new_cluster.sh  delete_old_cluster.sh
-bash-4.1$ rm -rf 9.3

[root@jr-sandbox ~]# cd /etc/sysconfig/pgsql/

You have new mail in /var/spool/mail/root

[root@jr-sandbox pgsql]# ls

postgresql-11 postgresql-9.3

[root@jr-sandbox pgsql]# rm -f postgresql-9.3

[root@jr-sandbox pgsql]# su - postgres

-bash-4.1$ ls

11 9.3 analyze_new_cluster.sh delete_old_cluster.sh

-bash-4.1$ rm -rf 9.3

You can now view the pg_hba.conf and postgresql.conf you saved in /root and add whats needed to the new pg11 configs.

That’s it!!

SINOPIA NPM allow connections to GITHUB as well as the NPM registry

October 11, 2018October 11, 2018 adminLeave a comment

SINOPIA LINK HERE
We use SINOPIA as a proxy on our internal network behind the firewall to allow users to install NODE packages without an internet connection. We basically run sinopia on a machine that has access to the internet and the clients point to the server to install packages that are not locally available. We have been running into issues where installs that needed access to github would fail with something like this:

[15:29:07] user1@sb-user1:~/app/mc_api/lib/reports $ npm install --save slack/client --loglevel verbose
npm info it worked if it ends with ok
npm verb cli [ '/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/node',
npm verb cli   '/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/npm',
npm verb cli   'install',
npm verb cli   '--save',
npm verb cli   'slack/client',
npm verb cli   '--loglevel',
npm verb cli   'verbose' ]
npm info using npm@6.4.1
npm info using node@v8.9.4
npm verb npm-session f40f30f7bf0339f9
npm timing stage:rollbackFailedOptional Completed in 1ms
npm timing stage:runTopLevelLifecycles Completed in 1114ms
npm verb stack Error: exited with error code: 128
npm verb stack     at ChildProcess.<anonymous> (/home/user1/node_local_install/.nvm/versions/node/v8.9.4/lib/node_modules/npm/node_modules/pacote/lib/util/finished.js:12:19)
npm verb stack     at emitTwo (events.js:126:13)
npm verb stack     at ChildProcess.emit (events.js:214:7)
npm verb stack     at maybeClose (internal/child_process.js:925:16)
npm verb stack     at Socket.stream.socket.on (internal/child_process.js:346:11)
npm verb stack     at emitOne (events.js:116:13)
npm verb stack     at Socket.emit (events.js:211:7)
npm verb stack     at Pipe._handle.close [as _onclose] (net.js:554:12)
npm verb cwd /home/user1/app/mc_api/lib/reports
npm verb Linux 2.6.32-754.3.5.el6.x86_64
npm verb argv "/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/node" "/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/npm" "install" "--save" "slack/client" "--loglevel" "verbose"
npm verb node v8.9.4
npm verb npm  v6.4.1
npm ERR! Error while executing:
npm ERR! /usr/bin/git ls-remote -h -t ssh://git@github.com/slack/client.git
npm ERR!
npm ERR! ssh: connect to host github.com port 22: Connection refused
npm ERR! fatal: Could not read from remote repository.
npm ERR!
npm ERR! Please make sure you have the correct access rights
npm ERR! and the repository exists.
npm ERR!
npm ERR! exited with error code: 128
npm verb exit [ 1, true ]
npm timing npm Completed in 1497ms

npm ERR! A complete log of this run can be found in:
npm ERR!     /home/user1/.npm/_logs/2018-10-10T19_34_06_306Z-debug.log

[15:29:07] user1@sb-user1:~/app/mc_api/lib/reports $ npm install --save slack/client --loglevel verbose

npm info it worked if it ends with ok

npm verb cli [ '/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/node',

npm verb cli '/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/npm',

npm verb cli 'install',

npm verb cli '--save',

npm verb cli 'slack/client',

npm verb cli '--loglevel',

npm verb cli 'verbose' ]

npm info using npm@6.4.1

npm info using node@v8.9.4

npm verb npm-session f40f30f7bf0339f9

npm timing stage:rollbackFailedOptional Completed in 1ms

npm timing stage:runTopLevelLifecycles Completed in 1114ms

npm verb stack Error: exited with error code: 128

npm verb stack at ChildProcess.<anonymous> (/home/user1/node_local_install/.nvm/versions/node/v8.9.4/lib/node_modules/npm/node_modules/pacote/lib/util/finished.js:12:19)

npm verb stack at emitTwo (events.js:126:13)

npm verb stack at ChildProcess.emit (events.js:214:7)

npm verb stack at maybeClose (internal/child_process.js:925:16)

npm verb stack at Socket.stream.socket.on (internal/child_process.js:346:11)

npm verb stack at emitOne (events.js:116:13)

npm verb stack at Socket.emit (events.js:211:7)

npm verb stack at Pipe._handle.close [as _onclose] (net.js:554:12)

npm verb cwd /home/user1/app/mc_api/lib/reports

npm verb Linux 2.6.32-754.3.5.el6.x86_64

npm verb argv "/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/node" "/home/user1/node_local_install/.nvm/versions/node/v8.9.4/bin/npm" "install" "--save" "slack/client" "--loglevel" "verbose"

npm verb node v8.9.4

npm verb npm v6.4.1

npm ERR! Error while executing:

npm ERR! /usr/bin/git ls-remote -h -t ssh://git@github.com/slack/client.git

npm ERR!

npm ERR! ssh: connect to host github.com port 22: Connection refused

npm ERR! fatal: Could not read from remote repository.

npm ERR!

npm ERR! Please make sure you have the correct access rights

npm ERR! and the repository exists.

npm ERR!

npm ERR! exited with error code: 128

npm verb exit [ 1, true ]

npm timing npm Completed in 1497ms

npm ERR! A complete log of this run can be found in:

npm ERR! /home/user1/.npm/_logs/2018-10-10T19_34_06_306Z-debug.log

As you can see, we are getting choked at:

npm ERR! ssh: connect to host github.com port 22: Connection refused
npm ERR! fatal: Could not read from remote repository.

1 2	npm ERR! ssh: connect to host github.com port 22: Connection refused npm ERR! fatal: Could not read from remote repository.

To get around this we need to change the config.yml on the server to allow proxies to github, here is the final configuration. Hope this helps other users as we had a fun time trying to figure it out. Pay attention to the uplinks section and the proxy requests where github is defined.

#
# This is the default config file. It allows all users to do anything,
# so don't use it on production systems.
#
# Look here for more config file examples:
# https://github.com/rlidwka/sinopia/tree/master/conf
#

# path to a directory with all packages
storage: ./storage

auth:
  htpasswd:
    file: ./htpasswd
    # Maximum amount of users allowed to register, defaults to "+inf".
    # You can set this to -1 to disable registration.
    #max_users: 1000

# a list of other known repositories we can talk to
uplinks:
  npmjs:
    url: https://registry.npmjs.org/
  github:
    url: https://github.com/

packages:
  '@*/*':
    # scoped packages
    access: $all
    publish: $authenticated
    proxy:
      - npmjs
      - github


  '*':
    # allow all users (including non-authenticated users) to read and
    # publish all packages
    #
    # you can specify usernames/groupnames (depending on your auth plugin)
    # and three keywords: "$all", "$anonymous", "$authenticated"
    access: $all

    # allow all known users to publish packages
    # (anyone can register by default, remember?)
    publish: $authenticated

    # if package is not available locally, proxy requests to 'npmjs' registry
    proxy:
      - npmjs
      - github
# log settings
logs:
  #- {type: stdout, format: pretty, level: http}
  - {type: file, path: sinopia.log, level: debug}

#Bind Address
listen:
  - 0.0.0.0:4873
#

# This is the default config file. It allows all users to do anything,

# so don't use it on production systems.

# Look here for more config file examples:

# https://github.com/rlidwka/sinopia/tree/master/conf

# path to a directory with all packages

storage: ./storage

auth:

htpasswd:

file: ./htpasswd

# Maximum amount of users allowed to register, defaults to "+inf".

# You can set this to -1 to disable registration.

#max_users: 1000

# a list of other known repositories we can talk to

uplinks:

npmjs:

url: https://registry.npmjs.org/

github:

url: https://github.com/

packages:

'@*/*':

# scoped packages

access: $all

publish: $authenticated

proxy:

- npmjs

- github

'*':

# allow all users (including non-authenticated users) to read and

# publish all packages

# you can specify usernames/groupnames (depending on your auth plugin)

# and three keywords: "$all", "$anonymous", "$authenticated"

access: $all

# allow all known users to publish packages

# (anyone can register by default, remember?)

publish: $authenticated

# if package is not available locally, proxy requests to 'npmjs' registry

proxy:

- npmjs

- github

# log settings

logs:

#- {type: stdout, format: pretty, level: http}

- {type: file, path: sinopia.log, level: debug}

#Bind Address

listen:

- 0.0.0.0:4873

PSQL Connect To AWS Redshift From Windows 10 PowerShell

March 16, 2018December 20, 2019 admin2 Comments

Coming from a completely Linux background, I was tasked with connecting to a aws redshift cluster or a postgres cluster via Windows powershell and PSQL. I knew it was possible and searching the internet came up with CMD prompt solutions, when I attempted via powershell, I was faced with the following error below, you will need to install postgres on windows10 to get access to the psql binary, you can get it here:
https://www.postgresql.org/download/windows/

PS C:\WINDOWS\system32> psql.exe -h afs-rs-dev02.us-east-1.redshift.amazonaws.com  -p 5439 -U awsmaster benchmark01
Password for user awsmaster:
psql: FATAL:  invalid value for parameter "client_encoding": "WIN1252"

PS C:\WINDOWS\system32> psql.exe -h afs-rs-dev02.us-east-1.redshift.amazonaws.com -p 5439 -U awsmaster benchmark01

Password for user awsmaster:

psql: FATAL: invalid value for parameter "client_encoding": "WIN1252"

Turns out a colleague of mine and I figured out you will need to set the variable PGCLIENTENCODING via the powershell command line. This was expected but we could not nail down the syntax, we found it.

PS C:\WINDOWS\system32> $env:PGCLIENTENCODING='utf-8';
PS C:\WINDOWS\system32> psql.exe -h afs-rs-dev02.us-east-1.redshift.amazonaws.com  -p 5439 -U awsmaster benchmark01
Password for user awsmaster:
psql (10.1, server 8.0.2)
WARNING: Console code page (437) differs from Windows code page (1252)
         8-bit characters might not work correctly. See psql reference
         page "Notes for Windows users" for details.
SSL connection (protocol: TLSv1.2, cipher: ECDHE-RSA-AES256-GCM-SHA384, bits: 256, compression: off)
Type "help" for help.

benchmark01=#

PS C:\WINDOWS\system32> $env:PGCLIENTENCODING='utf-8';

PS C:\WINDOWS\system32> psql.exe -h afs-rs-dev02.us-east-1.redshift.amazonaws.com -p 5439 -U awsmaster benchmark01

Password for user awsmaster:

psql (10.1, server 8.0.2)

WARNING: Console code page (437) differs from Windows code page (1252)

8-bit characters might not work correctly. See psql reference

page "Notes for Windows users" for details.

SSL connection (protocol: TLSv1.2, cipher: ECDHE-RSA-AES256-GCM-SHA384, bits: 256, compression: off)

Type "help" for help.

benchmark01=#

Once this is set, you can connect to PG as normal.

Jason R. Ralph

Linux All Day Everyday

Category: General Code

10 Year Anniversary: www.jasonralph.org

AWS Apache Managed Airflow EMR ModuleNotFoundError: No module named ‘requests’ Bootstrap

Node Application Stopped Sending Updates To Slack – can’t identify protocol

centos8 postgresql-11-check-db-dir[]: is missing or empty

AWS CLI Max Concurrent Requests Tuning

Postgres Long Running Active Queries Send To Slack

Python Function Execute Subprocess With Timeout

CENTOS6 Postgres pg_upgrade 9 to 11 – In Place – Link – No Copy – Limited Disk Space

SINOPIA NPM allow connections to GITHUB as well as the NPM registry

PSQL Connect To AWS Redshift From Windows 10 PowerShell