fix: add error handling to OpenAI-compatible serve endpoint by markstur · Pull Request #774 · generative-computing/mellea

markstur · 2026-04-01T22:29:16Z

Misc PR

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: Part of Create OpenAI API-compatible HTTP interface for mellea #521

Add proper exception handling to the chat completion endpoint in cli/serve/app.py to prevent unhandled exceptions from crashing the server. Returns appropriate HTTP status codes and error messages.

Changes:

Wrap endpoint logic in try-except block with specific handlers
AttributeError → 500 (missing output attributes)
ValueError → 400 (validation/input errors)
Exception → 500 (catch-all for unexpected errors)
Add comprehensive test coverage (5 unit tests, 100% error path coverage)

Fixes ensure the server returns OpenAI-compatible error responses instead of crashing on exceptions.

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

github-actions · 2026-04-01T22:29:27Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

ajbozarth · 2026-04-01T23:07:49Z

Claude found a few issues to look at:

Unused import (will fail ruff check) — cli/serve/app.py

Request is imported but never used:

from fastapi import FastAPI, Request  # Request is unused

Test isolation — routes accumulate on the global app — test/cli/test_serve_errors.py

Each test calls app.add_api_route(...) on the shared module-level app object without cleanup. Routes pile up across test runs, which can cause 422 conflicts or ordering-dependent failures. Each test should use a fresh FastAPI()
instance or remove the route in teardown.

Nits

Duplicate error handlers — cli/serve/app.py

AttributeError and the catch-all Exception produce identical responses (500 server_error). The AttributeError branch can be dropped — Exception already covers it and the distinction adds nothing for callers.

response_model=None — minor type safety loss; Union[ChatCompletion, OpenAIErrorResponse] would preserve schema generation, though it won't change runtime behavior in this pattern.

jakelorocco

This seems reasonable to me. I have a few concerns that maybe aren't valid / might not need to be addressed (I don't know if I'm completely up to date on the purpose of this OpenAI server, so these might have already been discussed):

Are these errors helpful to the end user of the server? For instance, if the ValueError is raised by having some non-conforming field / input, that makes sense to me.
Will these errors leak anything that an end user might not want leaked? This may be out of scope, but I'm not sure if we have any expectations that the names of requirements, internal mellea functionality doesn't get leaked through these endpoints. Maybe thats on the end user to implement if they need that obfuscation?

ajbozarth

This LGTM now, but I id discuss @jakelorocco concerns with Claude and this was what we came to, probably worth addressing in a comment before merging if Claude is correct that m serve is not designed for prod, if it it we should address it (could be in a follow up imho):

Helpfulness — The ValueError → 400 path is genuinely useful since it reflects bad input the caller can fix. The 500 path is less useful to an end user but fine for a local dev server like m serve.

Information leakage — This is the more interesting concern. The current code passes str(e) directly into the error response body. For a 500, that could expose internal details like mellea class names, attribute paths, or module
structure. For a production-facing deployment that would be a real issue, but m serve is a local developer tool, so leaking implementation details to localhost is low risk. Worth the author explicitly acknowledging that — "this
is a local dev server, not intended for production exposure" — so the reviewer knows it was considered rather than overlooked.

Add proper exception handling to the chat completion endpoint in cli/serve/app.py to prevent unhandled exceptions from crashing the server. Implements OpenAI API error format for the `m serve` endpoint to ensure compatibility with OpenAI client libraries and tools. Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

* remove unused import * fix FastAPI app route accumulation * remove duplicate error handler * add types for response_model Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

markstur · 2026-04-02T20:02:02Z

rebased and resolved conflicts

For the success case, return None for usage not a mock object. Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

markstur requested a review from a team as a code owner April 1, 2026 22:29

github-actions bot added the bug Something isn't working label Apr 1, 2026

markstur marked this pull request as draft April 1, 2026 22:36

markstur force-pushed the serve_error_handling branch from 7646265 to 907fb78 Compare April 1, 2026 22:48

markstur marked this pull request as ready for review April 1, 2026 22:50

jakelorocco reviewed Apr 2, 2026

View reviewed changes

ajbozarth approved these changes Apr 2, 2026

View reviewed changes

markstur added 2 commits April 2, 2026 12:34

fix: fixes for pr review comments

6c91522

* remove unused import * fix FastAPI app route accumulation * remove duplicate error handler * add types for response_model Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

markstur force-pushed the serve_error_handling branch from e172da1 to 6c91522 Compare April 2, 2026 20:00

fix: test_server_errors mock fix for failed CI

4777f5b

For the success case, return None for usage not a mock object. Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

psschwei enabled auto-merge April 3, 2026 15:21

psschwei added this pull request to the merge queue Apr 3, 2026

Merged via the queue into generative-computing:main with commit ecc15a6 Apr 3, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add error handling to OpenAI-compatible serve endpoint#774

fix: add error handling to OpenAI-compatible serve endpoint#774
psschwei merged 3 commits intogenerative-computing:mainfrom
markstur:serve_error_handling

markstur commented Apr 1, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

ajbozarth commented Apr 1, 2026

Uh oh!

jakelorocco left a comment

Uh oh!

ajbozarth left a comment

Uh oh!

markstur commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

markstur commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Misc PR

Type of PR

Description

Testing

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

ajbozarth commented Apr 1, 2026

Nits

Uh oh!

jakelorocco left a comment

Choose a reason for hiding this comment

Uh oh!

ajbozarth left a comment

Choose a reason for hiding this comment

Uh oh!

markstur commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

markstur commented Apr 1, 2026 •

edited

Loading