DSP Experiments

The DSP (Declarative Semantic Prompting) library represents a interesting shift in how we design, test, and deploy prompts for language models. Instead of treating prompt engineering as an ad hoc, trial-and-error process, DSP provides a structured framework—allowing developers to express prompts, model interfaces, and evaluation strategies declaratively. This design brings reproducibility and modularity to the forefront, enabling users to define what a model should do, rather than getting lost in procedural details of how to make it happen. At its core, DSP treats prompts, model parameters, and outputs as composable objects that can be versioned, tested, and reused across different workflows or projects.

One of DSP’s most powerful aspects is its flexibility in integrating with different model backends. Whether you’re working with OpenAI or Gemini APIs, local inference servers, or fine-tuned models, DSP abstracts away the complexities of each environment. This is where Ollama becomes particularly compelling—it provides a seamless interface for running large language models locally with remarkable efficiency. Using DSP with Ollama, developers can declaratively specify prompts that interface directly with local models, allowing for private, offline experimentation while maintaining the same declarative patterns used with cloud-hosted models. The combination of DSP’s prompt modularity and Ollama’s local inference capabilities creates a powerful workflow for developers who want fine-grained control over both their model logic and their execution environment.

DSP Uses Pydantic for Type Signatures - a First Ollama Example

The following Ollama DSP code leverages Pydantic-style typing to enforce structured input and output validation for prompts and model responses. In DSP, a Signature class—like MathProblem—uses a docstring or explicit field annotations to declare the expected input and output schema. When dspy.Signature is defined, DSP internally maps these declarations to Pydantic models, meaning each field (e.g., question, answer: float) gains automatic type checking, serialization, and conversion. This ensures that when a model produces an output, DSP can validate and coerce the response into the correct Python type—here, a float for answer, or a list[float] in the later ChainOfThought example.

By doing this, DSP tightly integrates language model reasoning with Python’s data model, allowing structured validation and predictable data flow across model calls. Pydantic typing not only helps catch mismatched or ill-formed responses but also provides self-documenting clarity for developers—making each prompt specification both executable and strongly typed. This makes DSP code more robust and maintainable, particularly in complex prompt pipelines or when integrating multiple model components.

Here is the documentation for DSP type signatures: https://dspy.ai/learn/programming/signatures/. The example is in file Ollama_in_Action_Book/source-code/DSP/ollama_test.py:

 1 import os
 2 import sys
 3 from pathlib import Path
 4 
 5 ROOT = Path(__file__).resolve().parents[1]
 6 if str(ROOT) not in sys.path:
 7     sys.path.insert(0, str(ROOT))
 8 
 9 import dspy
10 
11 from ollama_config import get_model
12 
13 _model_name = get_model()
14 
15 if os.environ.get("CLOUD"):
16     api_key = os.environ.get("OLLAMA_API_KEY", "")
17     lm = dspy.LM(
18         f"ollama_chat/{_model_name}",
19         api_base="https://ollama.com",
20         api_key=api_key,
21         temperature=1.0,
22         max_tokens=4096,
23     )
24 else:
25     lm = dspy.LM(
26         f"ollama_chat/{_model_name}",
27         api_base="http://localhost:11434",
28         api_key="ollama",
29         temperature=1.0,
30         max_tokens=4096,
31     )
32 
33 dspy.configure(lm=lm)
34 
35 class MathProblem(dspy.Signature):
36     """question -> answer: list[float]"""
37     # The docstring defines the input and output fields, including
38     # the required output type (float)
39     
40 class ChainOfThoughtMath(dspy.Module):
41     def __init__(self):
42         super().__init__()
43         # Use dspy.ChainOfThought to implement the MathProblem signature
44         self.prog = dspy.ChainOfThought(MathProblem)
45 
46     def forward(self, question):
47         return self.prog(question=question)
48 
49 math_model = dspy.ChainOfThought("question -> answer: list[float]")
50 question_text = "Two dice are tossed. Give me a list of the three most probable rolls."
51 prediction = math_model(question=question_text)
52 
53 print(f"Question: {question_text}")
54 print(f"Reasoning: {prediction.reasoning}")
55 print(f"Answer: {prediction.answer}")

In this code Pydantic typing is used by DSP to define and enforce structured interfaces between the model’s prompts and responses. The MathProblem signature specifies that each prompt must include a question and that the model must return an answer of type float. When the ChainOfThoughtMath module runs, DSP automatically validates that the model’s output matches this expected schema—coercing or flagging data as needed. This structured approach ensures consistency, prevents malformed outputs, and makes it easier to compose reliable model pipelines, such as when generating a list of numeric results from reasoning-based prompts.

Here is sample output:

1 $ uv run ollama_test.py
2 Question: Two dice are tossed. Give me a list of the three most probable rolls.
3 Reasoning: When two dice are tossed, there are 36 possible outcomes. The sum of the dice (roll) with the highest probability is 7, which has 6 combinations. The next most probable sums are 6 and 8, each with 5 combinations. Thus, the three most probable rolls are 7, 6, and 8. The probabilities are calculated as (number of combinations)/36.
4 Answer: [0.16666666666666666, 0.1388888888888889, 0.1388888888888889]

Optional Practice Problems

Text Classification Signature. Create a new DSPy signature class called TextClassifier where the input is a news article text and the output is a category: str (e.g. Sports, Tech, Business) and a sentiment: float from -1.0 to 1.0. Run a script using this signature to classify a few test paragraphs.
Sequential Multi-stage Pipeline. Implement a two-step DSPy pipeline. Step 1 should take a user question and output a list of keywords. Step 2 should take the original question and the keywords generated in Step 1 to produce the final detailed answer. Verify that the outputs match the structured signatures.
Predict vs. ChainOfThought. Modify ollama_test.py to compare standard dspy.Predict and dspy.ChainOfThought on a riddle or logical reasoning problem. Print both predictions and write down the comparison in your console. Note if the answer format is cleaner or if the reasoning is more accurate with ChainOfThought.
Structured JSON List Output. Define a DSPy signature that takes a paragraph describing a meeting and outputs a structured list of action items, where each action item contains an owner: str and a description: str. Verify that DSPy enforces the output schema correctly when calling your local model.

Up next

Reasoning with Large Language Models