Understanding user management in PgBouncer

June 15, 2020

PgBouncer is a popular connection proxy and pooler for PostgreSQL. As PgBouncer presents a PostgreSQL protocol interface to client applications, it also handles client authentication. For that, it maintains its own directory of users and passwords. That is sometimes a source of confusion, so in this blog post I want to try to describe how this works.

auth_file

There are several ways in which users and passwords can be defined in PgBouncer. Let’s start with the simplest. You can define users and passwords in a text file. The location of this file is set by the configuration parameter auth_file. It is typically named userlist.txt, but you can use any name of course. This file has the following format:

"username1" "password1"
"username2" "password2"
...

This file essentially fulfills the same purpose as the system catalog pg_authid (formerly pg_shadow (formerly pg_user)) in the PostgreSQL backend. In fact, PostgreSQL used to keep its own user database in a file like this in the data directory, and the original intent of this option was that you could point PgBouncer directly at that file, to share users directly between PostgreSQL and PgBouncer. But this no longer works, because PostgreSQL no longer works that way, so for all intents and purposes this is now a textual configuration file. One way to keep this keep this file in sync with the users defined in PostgreSQL is to query pg_authid and dump the results into this file in the appropriate format. The script mkauth.py that comes with PgBouncer can be used for that.

The passwords in this file may be plain-text passwords or passwords encrypted using MD5 or SCRAM, depending on the authentication method to be used. The details of these authentication methods are not covered here. That said, if you want to store encrypted passwords, it is probably useful to decide on a way to maintain this information in an automatic or scripted way and not edit it by hand.

Note that the access permissions of this file should be restricted, typically to 0600 mode.

auth_query, auth_user

Another way to define users is to have PgBouncer query the PostgreSQL backend directly when needed. This is set by the configuration parameter auth_user, which can be set either globally or per database. When this is set, PgBouncer uses that user to connect to the PostgreSQL backend and runs the query defined by the setting auth_query to look up a user and password. If the auth_user itself needs a password for that connection, it needs to be set in userlist.txt. But you might also set it up that auth_user doesn’t need a password, perhaps using TLS certificates. (For more information on setting up TLS with PgBouncer, see this webinar.)

The default value of auth_query is

SELECT usename, passwd FROM pg_shadow WHERE usename=$1

This means that the auth_user needs to be a superuser on the PostgreSQL side. For example, you could do

[databases]
mydata = host=myhost dbname=mydata auth_user=postgres

However, it’s preferable to write SECURITY DEFINER function in the backend that can look up users and is callable only by a specific user. This blog post has an extensive discussion about how to set that up properly.

Also note that with auth_user/auth_query you can have different users and passwords per database, whereas auth_file defines users for the entire PgBouncer instance.

PAM

Finally, if you use PAM by setting auth_type = pam, then the system works differently. auth_file and auth_user are ignored. All the user and password information comes from the PAM system.

Forced users

Note that in each case, the user and password information maintained by PgBouncer is used both for authenticating connections coming in from the client as well as supplying authentication information for connections going out to the server. This is a fundamental design decision: PgBouncer wants to be a more or less transparent connection proxy as far as authentication is concerned. If you want to log in to PgBouncer with different user names or passwords than are used for the PostgreSQL backend, this will be more complicated to set up and maintain.

There is a way to assign a specific user in a database definition, like

[databases]
mydata = host=myhost dbname=mydata user=foo password=secret

This is internally called a “forced user”. This user exists only for outgoing connections to the backend server and does not really exist otherwise in the PgBouncer user database.

Which to use?

Which one of these you use depends on various considerations about your whole setup. If your PostgreSQL server itself uses PAM or perhaps LDAP for authentication, then using PAM in PgBouncer also makes sense. (PgBouncer does not support a native LDAP authentication method, but LDAP can be used via PAM.) Otherwise, auth_user/auth_query is more elegant and requires less maintenance in the long run, but it requires a more careful setup initially. auth_file is the more traditional setup that requires more maintenance if users and passwords change a lot, but it’s simpler and probably more robust and less surprising.

Share this

Relevant Blogs

Random Data

This post continues from my report on Random Numbers. I have begun working on a random data generator so I want to run some tests to see whether different random...
December 03, 2020

More Blogs

Full-text search since PostgreSQL 8.3

Welcome to the third – and last – part of this blog series, exploring how the PostgreSQL performance evolved over the years. The first part looked at OLTP workloads, represented...
November 05, 2020

Números aleatorios

He estado trabajando gradualmente en el desarrollo desde cero de herramientas para probar el rendimiento de los sistemas de bases de datos de código abierto. Uno de los componentes de...
November 04, 2020