patroni.postgresql.rewind module
- class patroni.postgresql.rewind.REWIND_STATUS(*values)
Bases:
IntEnum- CHECK = 2
- CHECKPOINT = 1
- FAILED = 6
- INITIAL = 0
- NEED = 3
- NOT_NEED = 4
- SUCCESS = 5
- class patroni.postgresql.rewind.Rewind(postgresql: Postgresql)
Bases:
object- __checkpoint(task: CriticalTask, wakeup: Callable[[...], Any]) None
- __init__(postgresql: Postgresql) None
- _archive_ready_wals() None
Try to archive WALs that have .ready files just in case archive_mode was not set to ‘always’ before promote, while after it the WALs were recycled on the promoted replica. With this we prevent the entire loss of such WALs and the consequent old leader’s start failure.
- _build_archiver_command(command: str, wal_filename: str) str
Replace placeholders in the given archiver command’s template. Applicable for archive_command and restore_command. Can also be used for archive_cleanup_command and recovery_end_command, however %r value is always set to 000000010000000000000001.
- _check_timeline_and_lsn(leader: Leader | RemoteMember) None
- _conn_kwargs(member: Leader | RemoteMember, auth: Dict[str, Any]) Dict[str, Any]
- _fetch_missing_wal(restore_command: str, wal_filename: str) bool
- _find_missing_wal(data: bytes) str | None
- _get_checkpoint_end(timeline: int, lsn: int) int
Get the end of checkpoint record from WAL.
Note
The checkpoint record size in WAL depends on postgres major version and platform (memory alignment). Hence, the only reliable way to figure out where it ends, is to read the record from file with the help of
pg_waldumpand parse the output.We are trying to read two records, and expect that it will fail to read the second record with message:
fatal: error in WAL record at 0/182E220: invalid record length at 0/182E298: wanted 24, got 0; or
fatal: error in WAL record at 0/182E220: invalid record length at 0/182E298: expected at least 24, got 0
The error message contains information about LSN of the next record, which is exactly where checkpoint ends.
- Parameters:
timeline – the checkpoint timeline from
pg_controldata.lsn – the checkpoint location as
intfrompg_controldata.
- Returns:
the end of checkpoint record as
intor0if failed to parsepg_waldumpoutput.
- _get_local_timeline_lsn() Tuple[bool | None, int | None, int | None]
- _get_local_timeline_lsn_from_controldata() Tuple[bool | None, int | None, int | None]
- static _log_primary_history(history: List[Tuple[int, int, str]], i: int) None
- _maybe_clean_pg_replslot() None
Clean pg_replslot directory if pg version is less then 11 (pg_rewind deletes $PGDATA/pg_replslot content only since pg11).
- archive_shutdown_checkpoint_wal(archive_cmd: str) None
Archive WAL file with the shutdown checkpoint.
- Parameters:
archive_cmd – archiver command to use
- property can_rewind: bool
check if pg_rewind executable is there and that pg_controldata indicates we have either wal_log_hints or checksums turned on
- property can_rewind_or_reinitialize_allowed: bool
- static check_leader_has_run_checkpoint(conn_kwargs: Dict[str, Any]) str | None
- static check_leader_is_not_in_recovery(conn_kwargs: Dict[str, Any]) bool | None
- checkpoint_after_promote() bool
- cleanup_archive_status() None
- static configuration_allows_rewind(data: Dict[str, str]) bool
- property enabled: bool
- ensure_checkpoint_after_promote(wakeup: Callable[[...], Any]) None
After promote issue a CHECKPOINT from a new thread and asynchronously check the result. In case if CHECKPOINT failed, just check that timeline in pg_control was updated.
- ensure_clean_shutdown() bool | None
- execute(leader: Leader | RemoteMember) bool | None
- property executed: bool
- property failed: bool
- get_archive_command() str | None
Get
archive_commandGUC value if defined and archiving is enabled.- Returns:
archive_commanddefined in the Postgres configuration or None.
- property is_needed: bool
- pg_rewind(conn_kwargs: Dict[str, Any]) bool
Do pg_rewind.
Note
If
pg_rewinddoesn’t support--restore-target-walparameter and exited with non zero code, Patroni will parse stderr/stdout to figure out if it failed due to a missing WAL file and will repeat an attempt after downloading the missing file usingrestore_command.- Parameters:
conn_kwargs –
dictobject with connection parameters.- Returns:
Trueifpg_rewindfinished successfully,Falseotherwise.
- read_postmaster_opts() Dict[str, str]
returns the list of option names/values from postgres.opts, Empty dict if read failed or no file
- reset_state() None
- rewind_or_reinitialize_needed_and_possible(leader: Leader | RemoteMember | None) bool
- property should_remove_data_directory_on_diverged_timelines: bool
- single_user_mode(communicate: Dict[str, Any] | None = None, options: Dict[str, str] | None = None) int | None
run a given command in a single-user mode. If the command is empty - then just start and stop
- trigger_check_diverged_lsn() None