Loading Test Data into PostgreSQL
To efficiently load test data into PostgreSQL, you can use a simple query like the following:
INSERT INTO new.students (fname, fuid, lname, dob, joined) VALUES ( (array['Oswald', 'Henry', 'Bob', 'Vennie', 'Ivan', 'Stefan'])[floor(random() * 6 + 1)], uuid_generate_v4(), (array['Leo', 'Jack', 'Den', 'Daisy', 'Woody', 'Ivanov'])[floor(random() * 6 + 1)], '1980-01-01'::date + trunc(random() * 366 * 12)::int, generate_series('1/1/1990'::date, '12/01/2024'::date, '1 day') );
This query will insert random data into the students
table. However, before running the query, you’ll need to prepare your database.
Step-by-Step Setup
1. Prepare the Database
First, create the database and switch to it:
CREATE DATABASE dba; \c dba
Next, create a schema for the data:
CREATE SCHEMA new;
2. Create the Students Table
Now, create the table to store the student data:
CREATE TABLE new.students ( id serial PRIMARY KEY, fuid UUID NOT NULL, fname VARCHAR(20), lname VARCHAR(20), dob DATE NOT NULL, joined DATE );
3. Install the UUID Extension
To use uuid_generate_v4()
for generating unique IDs in the fuid
column, install the uuid-ossp
extension:
CREATE EXTENSION IF NOT EXISTS "uuid-ossp";
4. Automate Data Insertion with a Bash Script
To streamline the data insertion process, you can use a bash script. Here’s an example script that inserts 10,000 rows:
#!/bin/bash
count=10000
for i in $(seq $count); do
psql dba -c "INSERT INTO new.students (fname,fuid,lname,dob ,joined) VALUES ( (array['Oswald', 'Henry', 'Bob', 'Vennie','Ivan','Stefan'])[floor(random() * 6 + 1)],uuid_generate_v4(),(array['Leo', 'Jack', 'Den', 'Daisy' ,'Woody', 'Ivanov'])[floor(random() * 6 + 1)], '1980-01-01'::date + trunc(random() * 366 * 12)::int, generate_series('1/1/1990'::date, '12/01/2024'::date, '1 day'));"
echo "$i"
done
5. Switch to the postgres
User
To run the script as the postgres
user, execute:
su - postgres
6. Create and Make the Script Executable
Create the script file and make it executable:
vim loaddata.sh
chmod +x loaddata.sh
7. Execute the Script
Now, run the script to insert the data:
./loaddata.sh
Results
After running the script, you can verify the number of rows inserted by running:
SELECT count(*) FROM new.students;
Output:
count --------- 127540000
Example Data Output
Here’s an example of what the data might look like in your students
table:
sql
Copy code
id | fuid | fname | lname | dob | joined
----+--------------------------------------+--------+--------+------------+------------
1 | 3ed467cc-a54a-4488-98c0-7539eb927204 | Ivan | Ivanov | 1982-10-20 | 1990-01-01
2 | 3a1f1276-9374-4a55-972f-da8d0be0cf4d | Vennie | Den | 1980-01-31 | 1990-01-02
3 | ac5e2f19-c220-4156-acef-4055386ac6f8 | Oswald | Ivanov | 1984-08-23 | 1990-01-03
4 | 29a270d7-669f-4bfd-ba4c-6928a6cb8698 | Stefan | Woody | 1988-12-11 | 1990-01-04
5 | 178e6828-0ad9-4d2d-a145-2c8ef831fcdf | Henry | Woody | 1983-04-25 | 1990-01-05
6 | a1acaee4-afae-4e52-b0e8-afd37e95cf30 | Bob | Daisy | 1983-12-16 | 1990-01-06
7 | b6fed3d8-6249-4419-8137-31c1eb06442a | Bob | Woody | 1980-01-20 | 1990-01-07
8 | ca603c46-9c1e-40d4-bc99-b9d59bd313c6 | Vennie | Ivanov | 1985-05-30 | 1990-01-08
9 | 8623ab68-929c-45ae-81ec-551a3611a11e | Henry | Ivanov | 1985-04-03 | 1990-01-09
10 | dc082b37-617b-4b87-8c94-bfcf3459f812 | Henry | Jack | 1988-04-03 | 1990-01-10
Table Size
After the data has been loaded, check the table size:
\dt+ new.students
Output:
Schema | Name | Type | Owner | Persistence | Access method | Size | Description
--------+----------+-------+----------+-------------+---------------+---------+-------------
new | students | table | postgres | permanent | heap | 8601 MB |
By following these steps, you’ll have a fully prepared PostgreSQL table with a large volume of test data, ready for performance or load testing!
Last modified on 2025-02-01