Secure Worker Bridge

OpenAI-kompatible Local-AI API

Nutze Julians lokale LM-Studio-Pipeline ueber eine sichere REST API - mit OpenAI-aehnlichen Chat-Completions und Responses.

Live testen Code kopieren

Clientjulian-neumann.org/apiWorker PullLM Studio lokal

3-Minuten-Test

Schnellstart

Base URLhttps://julian-neumann.org/api/openai.php/v1

AuthAuthorization: Bearer YOUR_API_KEY

FallbackX-API-Key: YOUR_API_KEY

Die API ist eine sichere Bridge. LM Studio und Gateway bleiben lokal auf 127.0.0.1; externe Clients sprechen nur mit julian-neumann.org.

Contract

Endpunkte

GET/openai.php/v1/modelsOpenAI-aehnliche Modellliste

POST/openai.php/v1/chat/completionsChat Completions mit Job/Wait Bridge

POST/openai.php/v1/responsesResponses API kompatible Textausgabe

POST/v1/chat.phpLegacy Chat Job API

POST/v1/summarize.phpLegacy Text Summary Job API

POST/v1/pdf-summarize.phpLegacy PDF Summary Job API

Try it

Interaktive Live-Konsole

Der API-Key wird nur im Browser fuer den Request verwendet. Speichern ist optional und nur in sessionStorage.

Bereit 0 ms HTTP -

Antworten erscheinen hier.

Request

{}

Response

{}

Copy/Paste

OpenAI-kompatible Beispiele

curl https://julian-neumann.org/api/openai.php/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "local/gemma-4-26b-a4b-it",
    "messages": [
      {"role": "system", "content": "Du antwortest kurz auf Deutsch."},
      {"role": "user", "content": "Was ist die Neumann AI API?"}
    ],
    "temperature": 0.2
  }'

$apiKey = "YOUR_API_KEY"
$body = @{
  model = "local/gemma-4-26b-a4b-it"
  messages = @(
    @{ role = "system"; content = "Du antwortest kurz auf Deutsch." },
    @{ role = "user"; content = "Was ist die Neumann AI API?" }
  )
  temperature = 0.2
} | ConvertTo-Json -Depth 10

Invoke-RestMethod "https://julian-neumann.org/api/openai.php/v1/chat/completions" `
  -Method POST `
  -Headers @{ Authorization = "Bearer $apiKey" } `
  -ContentType "application/json" `
  -Body $body

import requests

api_key = "YOUR_API_KEY"
response = requests.post(
    "https://julian-neumann.org/api/openai.php/v1/chat/completions",
    headers={"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"},
    json={
        "model": "local/gemma-4-26b-a4b-it",
        "messages": [
            {"role": "system", "content": "Du antwortest kurz auf Deutsch."},
            {"role": "user", "content": "Was ist die Neumann AI API?"},
        ],
        "temperature": 0.2,
    },
    timeout=30,
)
print(response.json()["choices"][0]["message"]["content"])

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://julian-neumann.org/api/openai.php/v1",
)

response = client.chat.completions.create(
    model="local/gemma-4-26b-a4b-it",
    messages=[
        {"role": "system", "content": "Du antwortest kurz auf Deutsch."},
        {"role": "user", "content": "Was ist die Neumann AI API?"},
    ],
)

print(response.choices[0].message.content)

Responses

Responses API Beispiel

curl

curl https://julian-neumann.org/api/openai.php/v1/responses \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"local/gemma-4-26b-a4b-it","input":"Schreibe einen kurzen Test auf Deutsch."}'

Formate

Antwortformate

Chat Completions

Liefert object: chat.completion, choices[0].message.content und geschaetzte Usage.

Responses

Liefert object: response, output_text und output[0].content.

Async fallback

Wenn die lokale KI nicht innerhalb des Wait-Fensters fertig ist, kommt 202 mit Status- und Result-URL.

Security

Lokale KI bleibt lokal

LM Studio ist nicht oeffentlich. Der lokale Worker pollt ausgehend, spricht Gateway und LM Studio auf 127.0.0.1 an und schreibt Ergebnisse zur Domain zurueck. Speichere API-Keys im Browser nur bewusst temporaer.

Fehler

Typische Statuscodes

401invalid_api_keyKey fehlt oder ist falsch.

403insufficient_scopeKey hat nicht den passenden Scope.

202neumann.async_jobJob laeuft asynchron weiter.

429rate_limit_exceededZu viele Requests im Zeitfenster.

503runtime_unavailableLokaler Stack/Gateway/Worker nicht bereit.

400streaming_not_supportedStreaming ist fuer v1 bewusst deaktiviert.