Generating 1 Million PDFs in 10 Minutes • Erik Steiger

In the last year, I saw two companies struggling with generating documents in the form of PDFs. Both old, with a legacy tech stack that makes every new grad software dev wince.

One of these companies in the finance industry was forced to change as their new system couldn’t keep up with growing demands. While the old system was on-prem on servers in their basement, the new system was to be deployed in the cloud on AWS Lambda to reap the benefits of “infinite scaling” (or so they were promised). The project was supposed to be finished and go live in a matter of weeks—or at least that’s what management thought.

More than half a year later, with 5 full-time engineers on the project, the legacy system is still running full-time, with no end in sight.

I always found this project interesting. On the one hand due to its initial simplicity task to just render a pdf, but also because of the performance requirement that came along. After I heard how inefficient the new system is being implemented I was just thinking to myself “this can be implemented so much more efficiently”. And so I did.

This is the result—a guide on how to implement a high-performant & cost-efficient PDF rendering pipeline and deploy it.

If you are interested about the tech stack before you continue, here it is:

Rust, Terraform, AWS [SQS, S3, Lambda, API Gateway]

Huge shoutout to Typst, the underlying PDF typesetting engine for making this project possible.

You can find the code for the whole project on github: papermake-aws

So let’s get going 🚴🏻

Making millions in minutes, why?

In the financial industry, generating personalized reports at scale is a common challenge. Quarterly statements, tax documents, and trade confirmations often need to be generated for millions of customers within tight time windows. Waiting days for your trade confirmation is not only extremely inconvenient for customers but also not allowed by regulations. Eventually, customers will complain to the BaFin (German Federal Financial Supervisory Authority), and your company will get fined.

So let’s try to prevent these issues from happening.

Setting the stage

Our imaginary client, MoneyBank, needs to generate end-of-day trade confirmations for all transactions processed during market hours. With approximately 1 million trades per day, they need PDF generation to complete within minutes after market close (we will target 10 minutes because that’s sounds good). Preferably the infrastructure is cost-effective and scales with demand.

Just to make the point clear: generating 1 million PDFs in 10 minutes is no joke. That’s 1,667 PDFs per second, or ~0.6ms per PDF. With common PDF generators that take around 1 second each, we would need 11.5 compute days. Or 10 minutes times 1667 vCPUs—assuming they scale nicely.

Claude isn't that optimistic about the success of this project

Using EC2 On-Demand c6i.32xlarge instances (128 vCPUs each):

Need 13 instances
Cost: 6.208€ per hour per instance (eu-central-1)
Total: 13,45€ for 10 Minutes

We are going to shave that down to 0.35€, reducing the cost by 97%. But let’s start with the foundation.

Architecture Decisions

There are many technical architectures to make this happen. As mentioned in the introduction we will base our solution upon AWS Lambda, not because I think this is the easiest or most cost-efficient way, but to give an example how it could be done. Having decided on that the rest of the puzzle pieces come naturally.

The system consists of:

API Gateway: Entry-point to our rendering service.
SQS: Managing rendering jobs.
Lambda: Coordinating incoming requests as well as doing the actual PDF rendering.
S3: Holding our document templates and the final PDFs.

New Rendering Technology

You saw earlier that Claude estimated the latency of rendering a PDF at around ~1 second, or 100ms at best. Since I have neither the intention nor the budget to compensate for this slow performance with massive computational resources, we need a faster way.

Throughout my career, I’ve seen multiple PDF rendering implementations in companies. All of them would be too slow for our goal:

Puppeteer: ~1-2sec, due to headless browser startup overhead.
Crystal Reports: ~750-900ms, even worse has legacy windows dependencies (I wouldn’t touch this shit again).
LaTeX: ~500-800ms for compilation plus rendering.

You can get LaTeX a bit faster, but the bigger issue is that LaTeX comes with a huge compilation package and can be rather memory-hungry. These aren’t the best attributes for deploying to AWS Lambda.

The big benefit of using an actual typesetter like LaTeX over Crystal Reports is that your documents will always be laid out correctly. In Crystal Reports, you move around boxes with a fixed size where your data will be placed. If your customer’s name is longer than the box, it will simply be cut off.

Three years ago, I would have chosen LaTeX myself to build such a PDF rendering service, but since then we’ve gotten Typst—a very promising new typesetting system that is both fast and provides very helpful error messages.

I started working on Papermake, a PDF rendering library that makes use of Typst, and extends it with data based rendering and schema validation, and this guide makes use of it.

Creating the template

To begin with we have to design our template. I quickly designed a trade confirmation for our imaginary MoneyBank

The template definition looks as follows

1
---
2
id: trade_confirmation
3
---
4
#let trade_confirmation(data) = {
5
  // Set document properties
6
  set document(title: "Trade Confirmation", author: data.company.name)
7
  set page(
8
    paper: "a4",
9
    margin: 2.2cm
189 collapsed lines
10
  )
11

12
  // Typography settings
13
  set text(font: "Crimson Pro", size: 10pt)
14
  show heading: set text(font: "Source Sans Pro", weight: "bold")
15

16
  // Header with company information
17
  grid(
18
    columns: (auto, 1fr),
19
    gutter: 1em,
20
    if data.company.logo != none {
21
      image(data.company.logo, width: 4cm)
22
    } else {
23
      box(
24
        width: 6cm,
25
        inset: 10pt,
26
        fill: rgb("#000000"),
27
        text(weight: "bold", size: 20pt, fill: white)[#data.company.name]
28
      )
29
    },
30
    align(right)[
31
      #text(weight: "bold", size: 14pt)[#data.company.name] \
32
      #text(style: "italic")[#data.company.address] \
33
      #text(weight: "bold")[Tel: #data.company.phone] \
34
      #text(size: 8pt)[Banking License: BaFin-ID 54397]
35
    ]
36
  )
37

38
  // Title
39
  box(
40
    width: 100%,
41
    inset: (y: 8pt),
42
    fill: rgb("#000").lighten(90%),
43
    align(center)[
44
      #text(weight: "bold", size: 18pt)[TRADE CONFIRMATION]
45
    ]
46
  )
47

48
  v(0.5cm)
49

50
  // Customer information
51
  grid(
52
    columns: (1fr, 1fr),
53
    gutter: 1em,
54
    box(
55
      width: 100%,
56
      inset: 10pt,
57
      fill: rgb("#f5f5f5"),
58
      [
59
        #text(weight: "bold")[Customer Information:] \
60
        #data.customer.name \
61
        #data.customer.address \
62
        #text(style: "italic")[Email: #data.customer.email]
63
      ]
64
    ),
65
    box(
66
      width: 100%,
67
      inset: 10pt,
68
      fill: rgb("#f5f5f5"),
69
      [
70
        #text(weight: "bold")[Transaction Details:] \
71
        *Date:* #data.transaction.date \
72
        *Reference No:* #data.transaction.reference \
73
        *Currency:* #data.transaction.currency \
74
        *Client Account:* #data.transaction.client_code \
75
        *Commission Rate:* #data.transaction.commission_percent% \
76
        *Minimum Fee:* €#data.transaction.minimum_fee
77
      ]
78
    )
79
  )
80

81
  v(0.5cm)
82

83
  // Transaction Details Table
84
  box(
85
    width: 100%,
86
    fill: white,
87
    inset: 1pt,
88
    [
89
      #table(
90
        columns: (3fr, 1fr, 1fr, 1fr, 1.5fr, 1.5fr),
91
        inset: 8pt,
92
        align: (left, center, center, right, right, right),
93
        fill: (col, row) => if row == 0 { rgb("#000") } else if calc.odd(row) { rgb("#f5f5f5") } else { white },
94
        stroke: (x, y) => (
95
          if y == 0 { (bottom: 0.5pt + rgb("#0a3d62")) }
96
          else { (bottom: 0.2pt + rgb("#cccccc")) }
97
        ),
98

99
        [#text(fill: white, weight: "bold")[Security Name]],
100
        [#text(fill: white, weight: "bold")[Lots]],
101
        [#text(fill: white, weight: "bold")[Shares]],
102
        [#text(fill: white, weight: "bold")[Price €]],
103
        [#text(fill: white, weight: "bold")[Buy Amount €]],
104
        [#text(fill: white, weight: "bold")[Sell Amount €]],
105

106
        ..data.details.flatten().map(row => (
107
          [#row.stock],
108
          [#row.lots],
109
          [#row.shares],
110
          [#row.price],
111
          [#row.buy_amount],
112
          [#row.sell_amount]
113
        )).flatten(),
114
      )
115
    ]
116
  )
117

118
  v(0.5cm)
119

120
  // Transaction Summary
121
  grid(
122
    columns: (1fr, 1fr),
123
    gutter: 1em,
124
    [
125
      #text(size: 9pt)[
126
        #text(weight: "bold")[Trade Date:] #data.transaction.date \
127
        #text(weight: "bold")[Settlement Date:] 17 April 2025 \
128
        #text(weight: "bold")[Trading Venue:] XETRA \
129
        #text(weight: "bold")[Exchange Rate:] N/A (EUR)
130
      ]
131
    ],
132
    box(
133
      width: 100%,
134
      fill: rgb("#f5f5f5"),
135
      stroke: (left: 2pt + rgb("#000")),
136
      radius: (right: 4pt),
137
      inset: (x: 10pt, y: 8pt),
138
      [
139
        #table(
140
          columns: (auto, auto),
141
          inset: (left: 0pt, right: 0pt, top: 2pt, bottom: 2pt),
142
          align: (left, right),
143
          stroke: none,
144

145
          [Gross Amount:], [€ #data.summary.gross_amount],
146
          [Brokerage Fee:], [€ #data.summary.brokerage_fee],
147
          [VAT on Brokerage Fee (19%):], [€ #data.summary.vat_brokerage_fee],
148
          [Total Charges:], [€ #data.summary.total_charges],
149
          [Sales Tax:], [€ #data.summary.sales_tax],
150
          [Withholding Tax (25%):], [€ #data.summary.withholding_tax],
151
          table.hline(stroke: 0.5pt + gray),
152
          [#text(weight: "bold")[Total Amount:]],  [#text(weight: "bold")[€ #data.total_amount]],
153
          [#text(weight: "bold", fill: rgb("#555"))[Due Amount:]],  [#text(weight: "bold", fill: rgb("#555"))[€ #data.due_amount]],
154
        )
155
      ]
156
    )
157
  )
158

159
  v(0.8cm)
160

161
  box(
162
        width: 100%,
163
        fill: rgb("#f5f5f5").lighten(20%),
164
        inset: 8pt,
165
        [
166
          #text(size: 9pt, weight: "bold")[Settlement Information:] \
167
          #text(size: 8pt)[
168
            Please ensure sufficient funds are available in your account for settlement.
169
            All transactions are subject to MoneyBank's General Terms and Conditions.
170
            For inquiries, contact our trading desk at +49 30 8765 4322 or trading\@moneybank.eu
171
          ]
172
        ]
173
      )
174

175
      box(
176
        width: 100%,
177
        fill: rgb("#f5f5f5").lighten(20%),
178
        inset: 8pt,
179
        [
180
          #text(size: 9pt, weight: "bold")[Important Notes:] \
181
          #text(size: 8pt)[
182
            Transaction details available online at secure.moneybank.eu.
183
            Custody account statements are issued quarterly.
184
            Portfolio valuation available upon request
185
          ]
186
        ]
187
      )
188

189
  v(1fr)
190

191
  // Disclaimer
192
  box(
193
    width: 100%,
194
    inset: 8pt,
195
    fill: rgb("#f8f8f8"),
196
    radius: 2pt,
197
    text(size: 7.5pt)[
198
      Contents of this statement will be considered correct if no discrepancy is reported within 24 hours.
199
      Purchase or sales of rights could be canceled by MoneyBank at its discretion in accordance with regulatory requirements.
200
      This report is generated automatically and valid without signature per §126b BGB. MoneyBank is regulated by BaFin
201
      under license number 54397. All trades executed according to EU MiFID II regulations.
202
      MoneyBank AG • Kantstraße 123 • 10623 Berlin • Germany • Commercial Register: HRB 123456 B • VAT ID: DE987654321
203
    ]
204
  )
205
}
206

207
#let data = json.decode(sys.inputs.data)
208
#trade_confirmation(data)

Two things to notice

The template definition is split into frontmatter that Papermake expects with some metadata and the typst markdown
Notice the usage of variables like #data.customer.name. This will be interpolated by Papermake with the data we are providing it in the request.

For example the document above is rendered with the data

{
  "company": {
    "logo": null,
    "name": "MoneyBank",
    "address": "Kantstraße 123, 10623 Berlin, Germany",
    "phone": "+49 30 8765 4321"
  },
  "customer": {
    "name": "Anneliese Süßebier",
    "address": "Rochus-Klingelhöfer-Platz 6, 95807 Tuttlingen, Germany",
    "email": "a.süßebier@hotmail.de"
  },
  "transaction": {
    "date": "05 April 2025",
    "reference": "MB-TR-25040562",
    "currency": "EUR",
    "client_code": "MB-C74985",
    "commission_percent": "0.19",
    "minimum_fee": "8.27"
33 collapsed lines
  },
  "details": [
    {
      "stock": "BASF SE (BAS.DE)",
      "lots": "2",
      "shares": "75",
      "price": "53,71",
      "buy_amount": "4.028,58",
      "sell_amount": ""
    },
    {
      "stock": "Siemens AG (SIE.DE)",
      "lots": "3",
      "shares": "125",
      "price": "177,08",
      "buy_amount": "",
      "sell_amount": "22.134,51"
    },
    {
      "stock": "Daimler AG (DAI.DE)",
      "lots": "3",
      "shares": "75",
      "price": "76,17",
      "buy_amount": "5.712,79",
      "sell_amount": ""
    },
    {
      "stock": "Allianz SE (ALV.DE)",
      "lots": "1",
      "shares": "125",
      "price": "228,33",
      "buy_amount": "28.541,83",
      "sell_amount": ""
    }
  ],
  "summary": {
    "gross_amount": "-16.148,68",
    "brokerage_fee": "30,68",
    "vat_brokerage_fee": "5,83",
    "total_charges": "36,51",
    "sales_tax": "0,00",
    "withholding_tax": "0,00"
  },
  "total_amount": "-16.185,20",
  "due_amount": "-16.185,20"
}

Papermake allows for schema validation, ensuring all required data fields are present, which is definitely advisable in production, but I am going to omit that here.

Implementing our two lambda functions

We implement both of our lambdas in Rust using cargo-lambda. Rust compiles to a native binary so there are no dependencies to a runtime and no coldstart wait times with for example JVM.

The following code examples are not complete. I only included the main logic, that I wanted to highlight. You can find the complete source code on github.

Request handler

The request-handler lambda has only one job: Receive a request from API Gateway with a batch of render definitions and pass them to SQS.

1
use aws_lambda_events::{apigw::{ApiGatewayProxyRequest, ApiGatewayProxyResponse}, encodings::Body};
2
use serde::{Deserialize, Serialize};
3
use serde_json::json;
4
use uuid::Uuid;
5
use lambda_runtime::{service_fn, LambdaEvent, Error, run};
6

7
#[derive(Deserialize)]
8
struct RenderRequest {
9
    template_id: String,
10
    data: serde_json::Value,
11
}
12

13
#[derive(Serialize)]
14
struct RenderJob {
15
    job_id: String,
16
    template_id: String,
17
    data: serde_json::Value,
18
}
19

20
#[tokio::main]
21
async fn main() -> Result<(), Error> {
22
    tracing_subscriber::fmt()
23
        .with_ansi(false)
24
        .without_time()
25
        .with_max_level(tracing::Level::INFO)
26
        .init();
27

28
    run(service_fn(function_handler)).await
29
}
30

31
async fn function_handler(event: LambdaEvent<ApiGatewayProxyRequest>) -> Result<ApiGatewayProxyResponse, Error> {
32
    // Parse request
33
    let body = event.payload.body.unwrap();
34
    let request: RenderRequest = serde_json::from_str(body.as_str())?;
35

36
    let queue_url = std::env::var("QUEUE_URL").expect("QUEUE_URL must be set");
37

38
    // Generate job ID
39
    let job_id = Uuid::new_v4().to_string();
40

41
    // Create job and send to SQS
42
    let job = RenderJob {
43
        job_id: job_id.clone(),
44
        template_id: request.template_id.clone(),
45
        data: request.data.clone(),
46
    };
47

48
    let config = aws_config::load_from_env().await;
49
    let sqs_client = aws_sdk_sqs::Client::new(&config);
50

51
    // Send to SQS and return immediately
52
    sqs_client.send_message()
53
        .queue_url(&queue_url)
54
        .message_body(serde_json::to_string(&job)?)
55
        .send()
56
        .await?;
57

58
    // Return job ID immediately
59
    Ok(ApiGatewayProxyResponse {
60
        status_code: 202, // Accepted
61
        body: Some(Body::Text(json!({"job_id": job_id, "status": "queued"}).to_string())),
62
        is_base64_encoded: false,
63
        ..Default::default()
64
    })
65
}

I had a serialization error of the LambdaEvent and couldn’t figure that out for hours. Turns out you should keep your crates, especially aws_lambda_events up to date.

Renderer

The Lambda function to render PDF first get’s the render from SQS, renders the PDF using papermake and then uploads the PDF to S3

1
use aws_lambda_events::sqs::SqsEvent;
2
use lambda_runtime::{run, service_fn, Error, LambdaEvent};
3
use serde::{Deserialize, Serialize};
4
use std::env;
5
use thiserror::Error;
6

7
#[derive(Debug, Deserialize, Serialize)]
8
struct RenderJob {
9
    job_id: String,
10
    template_id: String,
11
    data: serde_json::Value,
12
}
13

14
#[derive(Error, Debug)]
15
pub enum RenderError {
16
    #[error("Failed to parse job: {0}")]
17
    JobParseError(String),
18
    #[error("Failed to render PDF: {0}")]
19
    RenderingError(String),
20
    #[error("S3 operation failed: {0}")]
21
    S3Error(String),
22
    #[error("Environment variable not found: {0}")]
23
    EnvVarError(String),
24
}
25

26
async fn function_handler(event: LambdaEvent<SqsEvent>) -> Result<(), Error> {
27
    let templates_bucket = env::var("TEMPLATES_BUCKET")
28
        .map_err(|_| RenderError::EnvVarError("TEMPLATES_BUCKET".to_string()))?;
29
    let results_bucket = env::var("RESULTS_BUCKET")
30
        .map_err(|_| RenderError::EnvVarError("RESULTS_BUCKET".to_string()))?;
31

32
    // Create S3 client
33
    let config = aws_config::load_from_env().await;
34
    let s3_client = aws_sdk_s3::Client::new(&config);
35

36
    // Process each message from SQS
37
    for record in event.payload.records {
38
        let message_body = record.body.as_ref()
39
            .ok_or_else(|| RenderError::JobParseError("Empty message body".to_string()))?;
40

41
        // Parse the job from the message
42
        let job: RenderJob = match serde_json::from_str(message_body) {
43
            Ok(job) => job,
44
            Err(e) => {
45
                eprintln!("Failed to parse job: {}", e);
46
                continue; // Skip this message and move to the next one
47
            }
48
        };
49

50
        println!("Processing job {}: template={}", job.job_id, job.template_id);
51

52
        // Get template from S3
53
        let template_result = s3_client
54
            .get_object()
55
            .bucket(&templates_bucket)
56
            .key(&job.template_id)
57
            .send()
58
            .await;
59

60
        let template = match template_result {
61
            Ok(t) => t,
62
            Err(e) => {
63
                eprintln!("Failed to fetch template {}: {}", job.template_id, e);
64
                continue;
65
            }
66
        };
67

68
        let template_data = match template.body.collect().await {
69
            Ok(data) => data.to_vec(),
70
            Err(e) => {
71
                eprintln!("Failed to read template data: {}", e);
72
                continue;
73
            }
74
        };
75

76
        // Render PDF using papermake
77
        let render_result = match render_pdf(
78
            &job.template_id,
79
            &template_data.as_slice(),
80
            &job.data,
81
        ) {
82
            Ok(result) => result,
83
            Err(e) => {
84
                eprintln!("Rendering error: {}", e);
85
                continue;
86
            }
87
        };
88

89
        if let None = render_result.pdf {
90
            eprintln!("Rendering result is None for job {}", job.job_id);
91
            continue;
92
        }
93

94
        let pdf = render_result.pdf.unwrap();
95

96
        // Upload PDF to S3
97
        match s3_client
98
            .put_object()
99
            .bucket(&results_bucket)
100
            .key(format!("{}.pdf", job.job_id))
101
            .body(pdf.into())
102
            .send()
103
            .await
104
        {
105
            Ok(_) => println!("Successfully uploaded PDF for job {}", job.job_id),
106
            Err(e) => eprintln!("Failed to upload PDF for job {}: {}", job.job_id, e),
107
        }
108
    }
109

110
    // Return OK to acknowledge processing of all messages
111
    Ok(())
112
}
113

114
#[tokio::main]
115
async fn main() -> Result<(), Error> {
116
    tracing_subscriber::fmt()
117
        .with_ansi(false)
118
        .without_time()
119
        .with_max_level(tracing::Level::INFO)
120
        .init();
121

122
    run(service_fn(function_handler)).await
123
}
124

125
// Helper function to render PDF using papermake
126
fn render_pdf(
127
    id: &str,
128
    template_data: &[u8],
129
    data: &serde_json::Value,
130
) -> Result<papermake::render::RenderResult, Box<dyn std::error::Error>> {
131
    // Initialize papermake renderer
132
    let template_data = String::from_utf8(template_data.to_vec())?;
133
    let template = papermake::Template::from_file_content(id, &template_data)?;
134

135
    // Render PDF
136
    let result = papermake::render_pdf(&template, data, None)?;
137

138
    Ok(result)
139
}

Both lambdas are compiled to arm64 using the release flag and then zipped.

1
cargo lambda build --release --arm64

Don’t forget to add the needed fonts to the pdf-renderer.

Terraform definition

Having the whole stack as IaC is a pure blessing. Not only did it allow me to iterate rather quickly but I can also easily tear the whole infrastructure down on a single click—without leaving a mess of things in AWS that eat up your budget.

Let’s look at the terraform definition that creates all the needed infrastructure. The terraform module takes care of creating the S3 buckets, SQS queue, Lambda functions and API Gateway.

1
module "pdf_renderer" {
2
  source = "../../modules/pdf_renderer"
3

4
  environment = "dev"
5
  project_name = "papermake-pdf"
6

7
  # S3 bucket configurations
8
  templates_bucket_name = "papermake-templates-dev"
9
  results_bucket_name = "papermake-results-dev"
10

11
  # SQS queue configuration
12
  queue_name = "pdf-render-queue-dev"
13

14
  # Lambda configurations
15
  render_lambda_memory = 256 # can be increased if needed
16
  render_lambda_timeout = 300  # 5 minutes
17

18
  # API Gateway configuration
19
  api_name = "pdf-renderer-api-dev"
20
  api_stage = "v1"
21
}

The Lambda function uses a custom runtime based on Amazon Linux 2023 and runs on ARM64 architecture for better cost efficiency

1
locals {
2
  common_tags = {
3
    Environment = var.environment
4
    Project     = var.project_name
5
    ManagedBy   = "terraform"
6
  }
7
}
8

9
# S3 Buckets
10
resource "aws_s3_bucket" "templates" {
11
  bucket = var.templates_bucket_name
12
  tags   = local.common_tags
13
}
14

15
resource "aws_s3_bucket" "results" {
16
  bucket = var.results_bucket_name
17
  tags   = local.common_tags
18
}
19

20
# SQS Queue
21
resource "aws_sqs_queue" "render_queue" {
22
  name                       = var.queue_name
23
  visibility_timeout_seconds = 900  # 15 minutes
24
  message_retention_seconds  = 1209600  # 14 days
25
  tags                       = local.common_tags
26
}
27

28
# Request Handler Lambda Function
29
resource "aws_lambda_function" "request_handler" {
30
  filename         = "../../../lambda_functions/request_handler/pdf_request_handler.zip"
31
  function_name    = "${var.project_name}-request-handler-${var.environment}"
32
  role             = aws_iam_role.request_handler_role.arn
33
  handler          = "bootstrap"
34
  architectures    = ["arm64"]
35
  runtime          = "provided.al2023"
36
  memory_size      = var.request_handler_memory
37
  timeout          = var.request_handler_timeout
38
  source_code_hash = filebase64sha256("../../../lambda_functions/request_handler/pdf_request_handler.zip")
39

40
  environment {
41
    variables = {
42
      QUEUE_URL = aws_sqs_queue.render_queue.url
43
    }
44
  }
45

46
  tags = local.common_tags
47
}
48

49
# Renderer Lambda Function
50
resource "aws_lambda_function" "renderer" {
51
  filename         = "../../../lambda_functions/renderer/pdf_renderer.zip"
52
  function_name    = "${var.project_name}-renderer-${var.environment}"
53
  role             = aws_iam_role.renderer_role.arn
54
  handler          = "bootstrap"
55
  architectures    = ["arm64"]
56
  runtime          = "provided.al2023"
57
  memory_size      = var.renderer_memory
58
  timeout          = var.renderer_timeout
59
  source_code_hash = filebase64sha256("../../../lambda_functions/renderer/pdf_renderer.zip")
60

61
  environment {
62
    variables = {
63
      TEMPLATES_BUCKET = aws_s3_bucket.templates.id
64
      RESULTS_BUCKET   = aws_s3_bucket.results.id
65
      FONTS_DIR        = "fonts"
66
    }
67
  }
68

69
  tags = local.common_tags
70
}
71

72
# API Gateway
73
resource "aws_apigatewayv2_api" "main" {
74
  name          = var.api_name
75
  protocol_type = "HTTP"
76
  description   = "PDF Renderer API"
77
}
78

79
resource "aws_apigatewayv2_stage" "main" {
80
  api_id      = aws_apigatewayv2_api.main.id
81
  name        = var.api_stage
82
  auto_deploy = true
83
}
84

85
# API Gateway Integration with Request Handler Lambda
86
resource "aws_apigatewayv2_integration" "request_handler" {
87
  api_id           = aws_apigatewayv2_api.main.id
88
  integration_type = "AWS_PROXY"
89

90
  connection_type     = "INTERNET"
91
  description         = "Request Handler Lambda integration"
92
  integration_method  = "POST"
93
  integration_uri     = aws_lambda_function.request_handler.invoke_arn
94
}
95

96
# API Gateway Route
97
resource "aws_apigatewayv2_route" "render_pdf" {
98
  api_id    = aws_apigatewayv2_api.main.id
99
  route_key = "POST /render"
100
  target    = "integrations/${aws_apigatewayv2_integration.request_handler.id}"
101
}
102

103
# Request Handler Lambda Permission for API Gateway
104
resource "aws_lambda_permission" "api_gw" {
105
  statement_id  = "AllowAPIGatewayInvoke"
106
  action        = "lambda:InvokeFunction"
107
  function_name = aws_lambda_function.request_handler.function_name
108
  principal     = "apigateway.amazonaws.com"
109
  source_arn    = "${aws_apigatewayv2_api.main.execution_arn}/*/*"
110
}
111

112
# Lambda Event Source Mapping for Renderer
113
resource "aws_lambda_event_source_mapping" "sqs_trigger" {
114
  event_source_arn = aws_sqs_queue.render_queue.arn
115
  function_name    = aws_lambda_function.renderer.arn
116
  batch_size       = 1
117
}

After deploying the whole stack using terraform apply we can test if with a sample request:

curl --request POST \
  --url https://xxxxxxxxx.execute-api.eu-central-1.amazonaws.com/v1/render \
  --header 'Content-Type: application/json' \
  --data '{
  "jobs": [
    {
      "template_id": "trade_confirmation",
      "data": {
        "company": {
          "logo": null,
          "name": "MoneyBank 2",
          "address": "Kantstraße 123, 10623 Berlin, Germany",
          "phone": "+49 30 8765 4321"
        },
        ...
        "summary": {
          "gross_amount": "16,441.40",
          "brokerage_fee": "27.87",
          "vat_brokerage_fee": "5.30",
          "total_charges": "33.17",
          "sales_tax": "0.00",
          "withholding_tax": "45.62"
        },
        "total_amount": "16,362.61",
        "due_amount": "16,362.61"
      }
    }
  ]
}'

Getting back

HTTP/1.1 202 Accepted
Apigw-Requestid: Jal4uiuOFiAEJ7Q=
Connection: close
Content-Length: 70
Content-Type: text/plain; charset=utf-8
Date: Tue, 22 Apr 2025 08:15:52 GMT

{"job_ids":["556393ce-74c6-4115-87d7-ed9ad6856d2a"],"status":"queued"}

Looking into the logs of our rendere we see the whole lambda with fetching the template and rendering it took 141ms. Not bad, but also not good enough. We still have some work to do.

Performance Tuning

To achieve our goal of 1 million PDFs in under 10 minutes without breaking the bank, we needed to optimize several aspects of the system:

1. Lambda Concurrency

AWS Lambda has default concurrency limits that need to be increased. I deployed all of this on a brand new AWS account, so I had a limit of only 10 unreserved concurrent invocations. But a performance-ready configuration would look like this:

1
# Auto-scaling for the renderer Lambda based on SQS queue metrics
2
resource "aws_appautoscaling_target" "lambda_target" {
3
  max_capacity       = 1000  # Maximum number of concurrent Lambda instances
4
  min_capacity       = 5     # Minimum number of concurrent Lambda instances
5
  resource_id        = "function:${aws_lambda_function.renderer.function_name}:${aws_lambda_function.renderer.version}"
6
  scalable_dimension = "lambda:function:ProvisionedConcurrency"
7
  service_namespace  = "lambda"
8
}
9

10
# Scale up policy - Add more provisioned concurrency when queue depth increases
11
resource "aws_appautoscaling_policy" "scale_up" {
12
  name               = "scale-up-${var.environment}"
13
  policy_type        = "TargetTrackingScaling"
14
  resource_id        = aws_appautoscaling_target.lambda_target.resource_id
15
  scalable_dimension = aws_appautoscaling_target.lambda_target.scalable_dimension
16
  service_namespace  = aws_appautoscaling_target.lambda_target.service_namespace
17

18
  target_tracking_scaling_policy_configuration {
19
    predefined_metric_specification {
20
      predefined_metric_type = "LambdaProvisionedConcurrencyUtilization"
21
    }
22

23
    target_value       = 0.75  # Try to keep utilization around 75%
24
    scale_in_cooldown  = 120   # Wait 2 minutes before scaling in
25
    scale_out_cooldown = 30    # Only wait 30 seconds before scaling out
26
  }
27
}

2. Caching

We already have very fast cold-starts due to Rust compiling to a native binary, but creating a connection to S3 or SQS still takes some double-digit milliseconds that we could cache between hot invocations.

Not only that, assuming we are rendering a large amount of the same template just with different data (as we are doing), we can also cache the template and can save on the roundtrip to S3. And now the final ingredient: We can also cache part of the compilation of the template (referred to as world caching in the code).

1
// Shared resources across invocations
2
#[derive(Debug)]
3
struct SharedResources {
4
    s3_client: aws_sdk_s3::Client,
5
    templates_bucket: String,
6
    results_bucket: String,
7
    template_cache: RwLock<HashMap<String, Vec<u8>>>,
8
    world_cache: RwLock<HashMap<String, Arc<Mutex<TypstWorld>>>>,
9
}
10

11
static RESOURCES: OnceCell<Arc<SharedResources>> = OnceCell::const_new();
12

13
// Initialize resources asynchronously
14
async fn initialize_resources() -> Arc<SharedResources> {
15
    // Read environment variables
16
    let templates_bucket = env::var("TEMPLATES_BUCKET")
17
        .expect("TEMPLATES_BUCKET environment variable not set");
18
    let results_bucket = env::var("RESULTS_BUCKET")
19
        .expect("RESULTS_BUCKET environment variable not set");
20

21
    // Initialize AWS client
22
    let config = aws_config::load_from_env().await;
23
    let s3_client = aws_sdk_s3::Client::new(&config);
24

25
    // Create and return resources
26
    Arc::new(SharedResources {
27
        s3_client,
28
        templates_bucket,
29
        results_bucket,
30
        template_cache: RwLock::new(HashMap::new()),
31
        world_cache: RwLock::new(HashMap::new()),
32
    })
33
}

3. Batching

The network latency from sending 1 million requests to API Gateway is already high, which would be a major bottleneck when performance testing this setup. Additionally, SQS has the ability to send batches of jobs to our rendering lambda.

1
// Group data files by template to reduce template loading overhead
2
fn group_data_by_template(data_files: &[String]) -> HashMap<String, Vec<String>> {
3
    let mut groups = HashMap::new();
4

5
    for data_file in data_files {
6
        // Extract template identifier from data filename pattern
7
        let template_key = extract_template_key(data_file);
8
        groups.entry(template_key).or_default().push(data_file.clone());
9
    }
10

11
    groups
12
}

Results

With all those improvements implemented, we get the following preliminary results:

Requests: 1000
Total processing time: 11 seconds
Throughput: 91 PDFs / second

The total processing time is measured until all PDFs are uploaded to S3. Note, that this is still with only 10 unreserved concurrent invocations that both lambdas share.

We are still below our target of 1,667 PDFs/second, but this should easily scale to our target as soon as AWS increases my quota limit.

Knowing that our renderer takes roughly 35ms per PDF (hot-started, template cached, world cached), we would only need 60 concurrent invocations to get to 0.6ms per PDF.

Cost calculation

For 1 million Lambda invocations with:

Memory: 256 MB (0.25 GB)
Billed Duration: 35 ms per invocation

This results in:

Total GB-seconds: 8,750
Compute cost: $0.15
Invocation cost: $0.20
Total cost without free tier: $0.35

You could see this as a pessimistic calculation as we ignored the batching which would result in fewer invocations and the same GB-seconds. Pretty cheap. All the testing I did was still covered by the free tier. The first free tier quota that got exceeded was S3, with 2,000 requests (us saving PDFs).

Next Steps

Honestly I would love to battle test this with actually rendering 1 million PDFs, but for now I am waiting for the quota increase on concurrent invocations.

A real production PDF pipeline at a company would probably require additional bells and whistles like

SQS routing based on template id, given you have many different templates to make use of caching without overwhelming it.
Adding real-time monitoring, alerting for rendering failures, and retry logic
Multi-region deployment for disaster recovery
Implementing document signing and encryption layers

This project was meant to demonstrate the power of Rust-based PDF generation with a serverless architecture.

Want to build something similar? Check out the Papermake library I am working on, powered by the amazing Typst typesetting engine. You can also explore the complete source code for this implementation.