Please update the teleprompters.md documentation in <

🚀 Here's the PR! <a href="https://gith

Comments (1)

sweep-ai commented on June 21, 2024

🚀 Here's the PR! #62

See Sweep's progress at the progress dashboard!

💎 Sweep Pro: I'm using GPT-4. You have unlimited GPT-4 tickets. (tracking ID: f34240e60d)

Actions (click)

↻ Restart Sweep

Sandbox Execution ✓

Here are the sandbox execution logs prior to making any changes:

Sandbox logs for 94d03ce

Checking docs/teleprompters/teleprompters.md for syntax errors... ✅ docs/teleprompters/teleprompters.md has no syntax errors! 1/1 ✓
Checking docs/teleprompters/teleprompters.md for syntax errors...
✅ docs/teleprompters/teleprompters.md has no syntax errors!

Sandbox passed on the latest main, so sandbox checks will be enabled for this issue.

Step 1: 🔎 Searching

I found the following snippets in your repository. I will now analyze these snippets and come up with a plan.

Some code snippets I think are relevant in decreasing order of relevance (click to expand). If some file is missing from here, you can mention the path in the ticket description.

dspy/docs/teleprompters/teleprompters.md

Lines 2 to 281 in 94d03ce

    
           Teleprompters are powerful optimizers (included in DSPy) that can learn to bootstrap and select effective prompts for the modules of any program. (The "tele-" in the name means "at a distance", i.e., automatic prompting at a distance.) 
        
           This documentation provides an overview of the DSPy Teleprompters. 
        
           ## Teleprompters 
        
           | Module | Jump To | 
        
           | --- | --- | 
        
           | LabeledFewShot | [LabeledFewShot Section](#telepromptlabeledfewshot) | 
        
           | BootstrapFewShot | [BootstrapFewShot Section](#telepromptbootstrapfewshot) | 
        
           | Ensemble | [Ensemble Section](#telepromptensemble) | 
        
           | BootstrapFewShotWithRandomSearch | [BootstrapFewShotWithRandomSearch Section](#telepromptbootstrapfewshotwithrandomsearch) | 
        
           | BootstrapFinetune | [BootstrapFinetune Section](#telepromptbootstrapfinetune) | 
        
           ## teleprompt.LabeledFewShot 
        
           ### Constructor 
        
           The constructor initializes the `LabeledFewShot` class and sets up its attributes, particularly defining `k` number of samples to be used by the predictor. 
        
           ```python 
        
           class LabeledFewShot(Teleprompter): 
        
               def __init__(self, k=16): 
        
                   self.k = k 
        
           ``` 
        
           **Parameters:** 
        
           - `k` (_int_): Number of samples to be used for each predictor. Defaults to 16. 
        
           ### Method 
        
           #### `compile(self, student, *, trainset)` 
        
           This method compiles the `LabeledFewShot` instance by configuring the `student` predictor. It assigns subsets of the `trainset` in each student's predictor's `demos` attribute. If the `trainset` is empty, the method returns the original `student`. 
        
           **Parameters:** 
        
           - `student` (_Teleprompter_): Student predictor to be compiled. 
        
           - `trainset` (_list_): Training dataset for compiling with student predictor. 
        
           **Returns:** 
        
           - The compiled `student` predictor with assigned training samples for each predictor or the original `student` if the `trainset` is empty. 
        
           ### Example 
        
           ```python 
        
           import dspy 
        
           #Assume defined trainset 
        
           class RAG(dspy.Module): 
        
               def __init__(self, num_passages=3): 
        
                   super().__init__() 
        
                   #declare retrieval and predictor modules 
        
                   self.retrieve = dspy.Retrieve(k=num_passages) 
        
                   self.generate_answer = dspy.ChainOfThought(GenerateAnswer) 
        
               #flow for answering questions using predictor and retrieval modules 
        
               def forward(self, question): 
        
                   context = self.retrieve(question).passages 
        
                   prediction = self.generate_answer(context=context, question=question) 
        
                   return dspy.Prediction(context=context, answer=prediction.answer) 
        
           #Define teleprompter 
        
           teleprompter = LabeledFewShot() 
        
           # Compile! 
        
           compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset) 
        
           ``` 
        
           ## teleprompt.BootstrapFewShot 
        
           ### Constructor 
        
           The constructor initializes the `BootstrapFewShot` class and sets up parameters for bootstrapping. 
        
           ```python 
        
           class BootstrapFewShot(Teleprompter): 
        
               def __init__(self, metric=None, teacher_settings={}, max_bootstrapped_demos=4, max_labeled_demos=16, max_rounds=1): 
        
                   self.metric = metric 
        
                   self.teacher_settings = teacher_settings 
        
                   self.max_bootstrapped_demos = max_bootstrapped_demos 
        
                   self.max_labeled_demos = max_labeled_demos 
        
                   self.max_rounds = max_rounds 
        
           ``` 
        
           **Parameters:** 
        
           - `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`. 
        
           - `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary. 
        
           - `max_bootstrapped_demos` (_int_, _optional_): Maximum number of bootstrapped demonstrations per predictor. Defaults to 4. 
        
           - `max_labeled_demos` (_int_, _optional_): Maximum number of labeled demonstrations per predictor. Defaults to 16. 
        
           - `max_rounds` (_int_, _optional_): Maximum number of bootstrapping rounds. Defaults to 1. 
        
           ### Method 
        
           #### `compile(self, student, *, teacher=None, trainset, valset=None)` 
        
           This method compiles the BootstrapFewShot instance by performing bootstrapping to refine the student predictor. 
        
           This process includes preparing the student and teacher predictors, which involves creating predictor copies, verifying the student predictor is uncompiled, and compiling the teacher predictor with labeled demonstrations via LabeledFewShot if the teacher predictor hasn't been compiled. 
        
           The next stage involves preparing predictor mappings by validating that both the student and teacher predictors have the same program structure and the same signatures but are different objects. 
        
           The final stage is performing the bootstrapping iterations. 
        
           **Parameters:** 
        
           - `student` (_Teleprompter_): Student predictor to be compiled. 
        
           - `teacher` (_Teleprompter_, _optional_): Teacher predictor used for bootstrapping. Defaults to `None`. 
        
           - `trainset` (_list_): Training dataset used in bootstrapping. 
        
           - `valset` (_list_, _optional_): Validation dataset used in compilation. Defaults to `None`. 
        
           **Returns:** 
        
           - The compiled `student` predictor after bootstrapping with refined demonstrations. 
        
           ### Example 
        
           ```python 
        
           #Assume defined trainset 
        
           #Assume defined RAG class 
        
           ... 
        
           #Define teleprompter and include teacher 
        
           teacher = dspy.OpenAI(model='gpt-3.5-turbo', api_key = openai.api_key, api_provider = "openai", model_type = "chat") 
        
           teleprompter = BootstrapFewShot(teacher_settings=dict({'lm': teacher})) 
        
           # Compile! 
        
           compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset) 
        
           ``` 
        
           ## teleprompt.Ensemble 
        
           ### Constructor 
        
           The constructor initializes the `Ensemble` class and sets up its attributes. This teleprompter is designed to create ensembled versions of multiple programs, reducing various outputs from different programs into a single output. 
        
           ```python 
        
           class Ensemble(Teleprompter): 
        
               def __init__(self, *, reduce_fn=None, size=None, deterministic=False): 
        
           ``` 
        
           **Parameters:** 
        
           - `reduce_fn` (_callable_, _optional_): Function used to reduce multiple outputs from different programs into a single output. A common choice is `dspy.majority`. Defaults to `None`. 
        
           - `size` (_int_, _optional_): Number of programs to randomly select for ensembling. If not specified, all programs will be used. Defaults to `None`. 
        
           - `deterministic` (_bool_, _optional_): Specifies whether ensemble should operate deterministically. Currently, setting this to `True` will raise an error as this feature is pending implementation. Defaults to `False`. 
        
           ### Method 
        
           #### `compile(self, programs)` 
        
           This method compiles an ensemble of programs into a single program that when run, can either randomly sample a subset of the given programs to produce outputs or use all of them. The multiple outputs can then be reduced into a single output using the `reduce_fn`. 
        
           **Parameters:** 
        
           - `programs` (_list_): List of programs to be ensembled. 
        
           **Returns:** 
        
           - `EnsembledProgram` (_Module_): An ensembled version of the input programs. 
        
           ### Example 
        
           ```python 
        
           import dspy 
        
           from dspy.teleprompt import Ensemble 
        
           # Assume a list of programs 
        
           programs = [program1, program2, program3, ...] 
        
           # Define Ensemble teleprompter 
        
           teleprompter = Ensemble(reduce_fn=dspy.majority, size=2) 
        
           # Compile to get the EnsembledProgram 
        
           ensembled_program = teleprompter.compile(programs) 
        
           ``` 
        
           ## teleprompt.BootstrapFewShotWithRandomSearch 
        
           ### Constructor 
        
           The constructor initializes the `BootstrapFewShotWithRandomSearch` class and sets up its attributes. It inherits from the `BootstrapFewShot` class and introduces additional attributes for the random search process. 
        
           ```python 
        
           class BootstrapFewShotWithRandomSearch(BootstrapFewShot): 
        
               def __init__(self, metric, teacher_settings={}, max_bootstrapped_demos=4, max_labeled_demos=16, max_rounds=1, num_candidate_programs=16, num_threads=6): 
        
                   self.metric = metric 
        
                   self.teacher_settings = teacher_settings 
        
                   self.max_rounds = max_rounds 
        
                   self.num_threads = num_threads 
        
                   self.min_num_samples = 1 
        
                   self.max_num_samples = max_bootstrapped_demos 
        
                   self.num_candidate_sets = num_candidate_programs 
        
                   self.max_num_traces = 1 + int(max_bootstrapped_demos / 2.0 * self.num_candidate_sets) 
        
                   self.max_bootstrapped_demos = self.max_num_traces 
        
                   self.max_labeled_demos = max_labeled_demos 
        
                   print("Going to sample between", self.min_num_samples, "and", self.max_num_samples, "traces per predictor.") 
        
                   print("Going to sample", self.max_num_traces, "traces in total.") 
        
                   print("Will attempt to train", self.num_candidate_sets, "candidate sets.") 
        
           ``` 
        
           **Parameters:** 
        
           - `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`. 
        
           - `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary. 
        
           - `max_bootstrapped_demos` (_int_, _optional_): Maximum number of bootstrapped demonstrations per predictor. Defaults to 4. 
        
           - `max_labeled_demos` (_int_, _optional_): Maximum number of labeled demonstrations per predictor. Defaults to 16. 
        
           - `max_rounds` (_int_, _optional_): Maximum number of bootstrapping rounds. Defaults to 1. 
        
           - `num_candidate_programs` (_int_): Number of candidate programs to generate during random search. 
        
           - `num_threads` (_int_): Number of threads used for evaluation during random search. 
        
           ### Method 
        
           Refer to [teleprompt.BootstrapFewShot](#telepromptbootstrapfewshot) documentation. 
        
           ## Example 
        
           ```python 
        
           #Assume defined trainset 
        
           #Assume defined RAG class 
        
           ... 
        
           #Define teleprompter and include teacher 
        
           teacher = dspy.OpenAI(model='gpt-3.5-turbo', api_key = openai.api_key, api_provider = "openai", model_type = "chat") 
        
           teleprompter = BootstrapFewShotWithRandomSearch(teacher_settings=dict({'lm': teacher})) 
        
           # Compile! 
        
           compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset) 
        
           ``` 
        
           ## teleprompt.BootstrapFinetune 
        
           ### Constructor 
        
           ### `__init__(self, metric=None, teacher_settings={}, multitask=True)` 
        
           The constructor initializes a `BootstrapFinetune` instance and sets up its attributes. It defines the teleprompter as a `BootstrapFewShot` instance for the finetuning compilation. 
        
           ```python 
        
           class BootstrapFinetune(Teleprompter): 
        
               def __init__(self, metric=None, teacher_settings={}, multitask=True): 
        
           ``` 
        
           **Parameters:** 
        
           - `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`. 
        
           - `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary. 
        
           - `multitask` (_bool_, _optional_): Enable multitask fine-tuning. Defaults to `True`. 
        
           ### Method 
        
           #### `compile(self, student, *, teacher=None, trainset, valset=None, target='t5-large', bsize=12, accumsteps=1, lr=5e-5, epochs=1, bf16=False)` 
        
           This method first compiles for bootstrapping with the `BootstrapFewShot` teleprompter. It then prepares fine-tuning data by generating prompt-completion pairs for training and performs finetuning. After compilation, the LMs are set to the finetuned models and the method returns a compiled and fine-tuned predictor. 
        
           **Parameters:** 
        
           - `student` (_Predict_): Student predictor to be fine-tuned. 
        
           - `teacher` (_Predict_, _optional_): Teacher predictor to help with fine-tuning. Defaults to `None`. 
        
           - `trainset` (_list_): Training dataset for fine-tuning. 
        
           - `valset` (_list_, _optional_): Validation dataset for fine-tuning. Defaults to `None`. 
        
           - `target` (_str_, _optional_): Target model for fine-tuning. Defaults to `'t5-large'`. 
        
           - `bsize` (_int_, _optional_): Batch size for training. Defaults to `12`. 
        
           - `accumsteps` (_int_, _optional_): Gradient accumulation steps. Defaults to `1`. 
        
           - `lr` (_float_, _optional_): Learning rate for fine-tuning. Defaults to `5e-5`. 
        
           - `epochs` (_int_, _optional_): Number of training epochs. Defaults to `1`. 
        
           - `bf16` (_bool_, _optional_): Enable mixed-precision training with BF16. Defaults to `False`. 
        
           **Returns:** 
        
           - `compiled2` (_Predict_): A compiled and fine-tuned `Predict` instance. 
        
           ### Example 
        
           ```python 
        
           #Assume defined trainset 
        
           #Assume defined RAG class 
        
           ... 
        
           #Define teleprompter 
        
           teleprompter = BootstrapFinetune(teacher_settings=dict({'lm': teacher})) 
        
           # Compile!

dspy/dspy/teleprompt/teleprompt.py

Lines 7 to 9 in 94d03ce

    
           class Teleprompter: 
        
               def __init__(self):

dspy/dspy/teleprompt/vanilla.py

Lines 6 to 27 in 94d03ce

    
           class LabeledFewShot(Teleprompter): 
        
               def __init__(self, k=16): 
        
                   self.k = k 
        
               def compile(self, student, *, trainset, sample=True): 
        
                   self.student = student.reset_copy() 
        
                   self.trainset = trainset 
        
                   if len(self.trainset) == 0: 
        
                       return self.student 
        
                   rng = random.Random(0) 
        
                   for predictor in self.student.predictors(): 
        
                       if sample: 
        
                           predictor.demos = rng.sample(self.trainset, min(self.k, len(self.trainset))) 
        
                       else: 
        
                           predictor.demos = self.trainset[:min(self.k, len(self.trainset))] 
        
                   return self.student

dspy/dspy/teleprompt/finetune.py

Lines 47 to 166 in 94d03ce

    
           class BootstrapFinetune(Teleprompter): 
        
               def __init__(self, metric=None, teacher_settings={}, multitask=True): 
        
                   self.metric = metric 
        
                   self.teacher_settings = teacher_settings 
        
                   self.multitask = multitask 
        
                   metric = metric or (lambda *args: True) 
        
                   self.teleprompter = BootstrapFewShot(metric=metric, 
        
                                                        max_bootstrapped_demos=999999, 
        
                                                        max_labeled_demos=0,  # FIXME: TODO: Make this zero? or param, with default as 16 or 0? 
        
                                                        teacher_settings=teacher_settings) 
        
               def compile(self, student, *, teacher=None, trainset, valset=None, 
        
                           target='t5-large', bsize=12, accumsteps=1, lr=5e-5, epochs=1, bf16=False, int8=False, peft=False, path_prefix=None): 
        
                   # It's usually better to supply a few-shot teacher, rather than uncompiled module (the student). 
        
                   if teacher is None: 
        
                       print("WARNING: Using a vanilla teacher. " 
        
                             "Are you sure you want to use BootstrapFinetune without a compiled teacher?") 
        
                   teachers = teacher if isinstance(teacher, list) else [teacher] 
        
                   finetune_data = {} 
        
                   for teacher in teachers: 
        
                       # Dummy compilation to get bootstraps. 
        
                       compiled = self.teleprompter.compile(student, teacher=teacher, trainset=trainset) 
        
                       multitask = self.multitask 
        
                       # Prepare finetune <prompt, completion> pairs. 
        
                       for name, predictor in compiled.named_predictors(): 
        
                           name_ = 'all' if multitask else name 
        
                           finetune_data[name_] = [] if name_ not in finetune_data else finetune_data[name_] 
        
                           for demo in predictor.demos: 
        
                               demo = dict(demo) 
        
                               # TODO: FIXME: generalize. 
        
                               completion = demo.pop(predictor.signature.fields[-1].output_variable) 
        
                               prompt = predictor.signature.query(dsp.Example(demos=[], **demo)).strip() 
        
                               finetune_data[name_].append(dict(prompt=prompt, completion=completion)) 
        
                   for name_ in finetune_data: 
        
                       random.Random(0).shuffle(finetune_data[name_]) 
        
                       print(name_, len(finetune_data[name_])) 
        
                   # 
        
                   # Dump as files. 
        
                   #  
        
                   finetune_paths = {} 
        
                   for name in finetune_data: 
        
                       data = finetune_data[name] 
        
                       hashed_name = name + '.' + Hasher.hash(data) 
        
                       output_path = os.path.join(training_data_directory, f'{hashed_name}.jsonl') 
        
                       print(output_path) 
        
                       with open(output_path, 'w') as f: 
        
                           for line in data: 
        
                               f.write(ujson.dumps(line) + '\n') 
        
                       finetune_paths[name] = output_path 
        
                   # 
        
                   # Train! 
        
                   # 
        
                   import string 
        
                   compiler_config = { 
        
                       'save': ''.join(random.Random(time.time()).choices(string.ascii_uppercase + string.digits, k=13)), # https://stackoverflow.com/a/2257449/1493011 
        
                       'peft': peft, 
        
                       'fp16': False, 
        
                       'bf16': bf16, 
        
                       'int8': int8, 
        
                       'fid': False, 
        
                       'rationale': False, 
        
                       'batch_size': bsize, 
        
                       'epochs': epochs, 
        
                       'gradient_accumulation_steps': accumsteps, # 2, 
        
                       'lr': lr 
        
                   } 
        
                   compiler_config['save'] = os.path.join(path_prefix, compiler_config['save']) if path_prefix else compiler_config['save'] 
        
                   from dsp.modules.finetuning import finetune_hf 
        
                   target = target 
        
                   finetune_models = {} 
        
                   for name in finetune_data: 
        
                       training_data_path = finetune_paths[name] 
        
                       compiler_config_ = dict(compiler_config) 
        
                       compiler_config_['save'] = compiler_config['save'] + '.' + name 
        
                       best_ckpt_path = finetune_hf(training_data_path, target, compiler_config_) 
        
                       print(f"#> Best checkpoint path: {best_ckpt_path} for {name}") 
        
                       finetune_models[name] = dsp.HFModel(model=target, checkpoint=best_ckpt_path) # best_ckpt_path 
        
                   # 
        
                   # Set the LMs to the finetuned ones, per module 
        
                   # 
        
                   compiled2 = compiled.reset_copy() 
        
                   assert len(compiled.named_predictors()) == len(compiled2.named_predictors()) 
        
                   for (name, predictor), (name2, predictor2) in zip(compiled.named_predictors(), compiled2.named_predictors()): 
        
                       assert name == name2 
        
                       name = 'all' if multitask else name 
        
                       # TODO: FIXME: When we assign .lm, the Predict.forward will also set only_query=True. 
        
                       # This is correct for here but we may want to make it more explicitly restricted to finetuned models. 
        
                       print(f"Assigning the LM of predictor {name}.") 
        
                       predictor2.lm = finetune_models[name] 
        
                       assert predictor2.demos == []

dspy/dspy/teleprompt/ensemble.py

Lines 10 to 39 in 94d03ce

    
           class Ensemble(Teleprompter): 
        
               def __init__(self, *, reduce_fn=None, size=None, deterministic=False): 
        
                   """A common reduce_fn is dspy.majority.""" 
        
                   assert deterministic is False, "TODO: Implement example hashing for deterministic ensemble." 
        
                   self.reduce_fn = reduce_fn 
        
                   self.size = size 
        
                   self.deterministic = deterministic 
        
               def compile(self, programs): 
        
                   size = self.size 
        
                   reduce_fn = self.reduce_fn 
        
                   import dspy 
        
                   class EnsembledProgram(dspy.Module): 
        
                       def __init__(self): 
        
                           super().__init__() 
        
                           self.programs = programs 
        
                       def forward(self, *args, **kwargs): 
        
                           programs = random.sample(self.programs, size) if size else self.programs 
        
                           outputs = [prog(*args, **kwargs) for prog in programs] 
        
                           if reduce_fn: 
        
                               return reduce_fn(outputs) 
        
                           return outputs

Step 2: ⌨️ Coding

Modify docs/teleprompters/teleprompters.md ✓ e344da5 Edit

Modify docs/teleprompters/teleprompters.md with contents:
• Review the code for each class in the `/dspy/teleprompt/*` directory.
• Update the description of each class in the `teleprompters.md` file to accurately reflect the current functionality of the class. This includes the `LabeledFewShot`, `BootstrapFewShot`, `Ensemble`, `BootstrapFewShotWithRandomSearch`, and `BootstrapFinetune` classes.
• Update the description of the constructor for each class. This includes the purpose of the constructor and the parameters it accepts.
• Update the description of the methods for each class. This includes the purpose of the method, the parameters it accepts, and what it returns.
• Update the examples for each class to ensure they accurately demonstrate how to use the class and its methods.
• Ensure that the documentation is clear, concise, and easy to understand.

--- 
+++ 
@@ -18,7 +18,7 @@
 
 ### Constructor
 
-The constructor initializes the `LabeledFewShot` class and sets up its attributes, particularly defining `k` number of samples to be used by the predictor.
+The constructor initializes the `LabeledFewShot` class with the specified number of demos `k` to be used for each predictor. If `sample` is `True`, this number of demos will be chosen randomly from the `trainset`. Otherwise, the first `k` demos from the `trainset` will be selected. to be used by the predictor.
 
 ```python
 class LabeledFewShot(Teleprompter):
@@ -33,11 +33,12 @@
 
 #### `compile(self, student, *, trainset)`
 
-This method compiles the `LabeledFewShot` instance by configuring the `student` predictor. It assigns subsets of the `trainset` in each student's predictor's `demos` attribute. If the `trainset` is empty, the method returns the original `student`.
+This method compiles the `LabeledFewShot` instance by preparing the `student` module with demo samples from the `trainset` for each of the student's predictors. It decides whether to sample randomly from the training demoes or to take the first `k` demoes based on the `sample` parameter. `k` denotes the limit on the number of demoes to use, which was set during the construction of the `LabeledFewShot` instance. It assigns subsets of the `trainset` in each student's predictor's `demos` attribute. If the `trainset` is empty, the method returns the original `student`.
 
 **Parameters:**
 - `student` (_Teleprompter_): Student predictor to be compiled.
-- `trainset` (_list_): Training dataset for compiling with student predictor.
+- `trainset` (_list_): A list of example objects to be used as training demos.
+- `sample` (_bool_, optional): Determines if the demos should be randomly sampled from the `trainset`. Defaults to `True`.
 
 **Returns:**
 - The compiled `student` predictor with assigned training samples for each predictor or the original `student` if the `trainset` is empty.
@@ -121,9 +122,9 @@
 #Assume defined RAG class
 ...
 
-#Define teleprompter and include teacher
+
 teacher = dspy.OpenAI(model='gpt-3.5-turbo', api_key = openai.api_key, api_provider = "openai", model_type = "chat")
-teleprompter = BootstrapFewShot(teacher_settings=dict({'lm': teacher}))
+
 
 # Compile!
 compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset)
@@ -141,8 +142,8 @@
 ```
 
 **Parameters:**
-- `reduce_fn` (_callable_, _optional_): Function used to reduce multiple outputs from different programs into a single output. A common choice is `dspy.majority`. Defaults to `None`.
-- `size` (_int_, _optional_): Number of programs to randomly select for ensembling. If not specified, all programs will be used. Defaults to `None`.
+- `reduce_fn` (_callable_, _optional_): Function used to reduce multiple outputs from different programs into a single output. A common choice is `dspy.majority`. If set to `None`, all sampled outputs will be returned as a list. Defaults to `None`.
+- `size` (_int_, _optional_): Number of programs to randomly select for ensembling if not all programs are to be used for reduction. If not specified, all programs will be used. Defaults to `None`.
 - `deterministic` (_bool_, _optional_): Specifies whether ensemble should operate deterministically. Currently, setting this to `True` will raise an error as this feature is pending implementation. Defaults to `False`.
 
 ### Method
@@ -180,7 +181,7 @@
 The constructor initializes the `BootstrapFewShotWithRandomSearch` class and sets up its attributes. It inherits from the `BootstrapFewShot` class and introduces additional attributes for the random search process.
 
 ```python
-class BootstrapFewShotWithRandomSearch(BootstrapFewShot):
+class BootstrapFewShotWithRandomSearch(LabeledFewShot):
     def __init__(self, metric, teacher_settings={}, max_bootstrapped_demos=4, max_labeled_demos=16, max_rounds=1, num_candidate_programs=16, num_threads=6):
         self.metric = metric
         self.teacher_settings = teacher_settings
@@ -272,7 +273,7 @@
 
 ```python
 #Assume defined trainset
-#Assume defined RAG class
+# Assume the RAG class is already defined as shown earlier
 ...
 
 #Define teleprompter

Running GitHub Actions for docs/teleprompters/teleprompters.md ✓ Edit

Check docs/teleprompters/teleprompters.md with contents:
Ran GitHub Actions for e344da508577d549d00d057789e8d1b41cd649eb:

Step 3: 🔁 Code Review

I have finished reviewing the code for completeness. I did not find errors for sweep/update_teleprompt_documentation.

🎉 Latest improvements to Sweep:

We just released a dashboard to track Sweep's progress on your issue in real-time, showing every stage of the process – from search to planning and coding.
Sweep uses OpenAI's latest Assistant API to plan code changes and modify code! This is 3x faster and significantly more reliable as it allows Sweep to edit code and validate the changes in tight iterations, the same way as a human would.
Try using the GitHub issues extension to create Sweep issues directly from your editor! GitHub Issues and Pull Requests.

💡 To recreate the pull request edit the issue title or description. To tweak the pull request, leave a comment on the pull request.
^{Join Our Discord}

from dspy.

Sweep: Update `teleprompt` documentation about dspy HOT 1 OPEN

Comments (1)

🚀 Here's the PR! #62

Actions (click)

Sandbox Execution ✓

Step 1: 🔎 Searching

Step 2: ⌨️ Coding

Step 3: 🔁 Code Review

🎉 Latest improvements to Sweep:

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent


	Teleprompters are powerful optimizers (included in DSPy) that can learn to bootstrap and select effective prompts for the modules of any program. (The "tele-" in the name means "at a distance", i.e., automatic prompting at a distance.)

	This documentation provides an overview of the DSPy Teleprompters.

	## Teleprompters

	\| Module \| Jump To \|
	\| --- \| --- \|
	\| LabeledFewShot \| [LabeledFewShot Section](#telepromptlabeledfewshot) \|
	\| BootstrapFewShot \| [BootstrapFewShot Section](#telepromptbootstrapfewshot) \|
	\| Ensemble \| [Ensemble Section](#telepromptensemble) \|
	\| BootstrapFewShotWithRandomSearch \| [BootstrapFewShotWithRandomSearch Section](#telepromptbootstrapfewshotwithrandomsearch) \|
	\| BootstrapFinetune \| [BootstrapFinetune Section](#telepromptbootstrapfinetune) \|

	## teleprompt.LabeledFewShot

	### Constructor

	The constructor initializes the `LabeledFewShot` class and sets up its attributes, particularly defining `k` number of samples to be used by the predictor.

	```python
	class LabeledFewShot(Teleprompter):
	def __init__(self, k=16):
	self.k = k
	```

	Parameters:
	- `k` (_int_): Number of samples to be used for each predictor. Defaults to 16.

	### Method

	#### `compile(self, student, *, trainset)`

	This method compiles the `LabeledFewShot` instance by configuring the `student` predictor. It assigns subsets of the `trainset` in each student's predictor's `demos` attribute. If the `trainset` is empty, the method returns the original `student`.

	Parameters:
	- `student` (_Teleprompter_): Student predictor to be compiled.
	- `trainset` (_list_): Training dataset for compiling with student predictor.

	Returns:
	- The compiled `student` predictor with assigned training samples for each predictor or the original `student` if the `trainset` is empty.

	### Example

	```python
	import dspy

	#Assume defined trainset
	class RAG(dspy.Module):
	def __init__(self, num_passages=3):
	super().__init__()

	#declare retrieval and predictor modules
	self.retrieve = dspy.Retrieve(k=num_passages)
	self.generate_answer = dspy.ChainOfThought(GenerateAnswer)

	#flow for answering questions using predictor and retrieval modules
	def forward(self, question):
	context = self.retrieve(question).passages
	prediction = self.generate_answer(context=context, question=question)
	return dspy.Prediction(context=context, answer=prediction.answer)

	#Define teleprompter
	teleprompter = LabeledFewShot()

	# Compile!
	compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset)
	```

	## teleprompt.BootstrapFewShot

	### Constructor

	The constructor initializes the `BootstrapFewShot` class and sets up parameters for bootstrapping.

	```python
	class BootstrapFewShot(Teleprompter):
	def __init__(self, metric=None, teacher_settings={}, max_bootstrapped_demos=4, max_labeled_demos=16, max_rounds=1):
	self.metric = metric
	self.teacher_settings = teacher_settings

	self.max_bootstrapped_demos = max_bootstrapped_demos
	self.max_labeled_demos = max_labeled_demos
	self.max_rounds = max_rounds
	```

	Parameters:
	- `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`.
	- `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary.
	- `max_bootstrapped_demos` (_int_, _optional_): Maximum number of bootstrapped demonstrations per predictor. Defaults to 4.
	- `max_labeled_demos` (_int_, _optional_): Maximum number of labeled demonstrations per predictor. Defaults to 16.
	- `max_rounds` (_int_, _optional_): Maximum number of bootstrapping rounds. Defaults to 1.

	### Method

	#### `compile(self, student, *, teacher=None, trainset, valset=None)`

	This method compiles the BootstrapFewShot instance by performing bootstrapping to refine the student predictor.

	This process includes preparing the student and teacher predictors, which involves creating predictor copies, verifying the student predictor is uncompiled, and compiling the teacher predictor with labeled demonstrations via LabeledFewShot if the teacher predictor hasn't been compiled.

	The next stage involves preparing predictor mappings by validating that both the student and teacher predictors have the same program structure and the same signatures but are different objects.

	The final stage is performing the bootstrapping iterations.

	Parameters:
	- `student` (_Teleprompter_): Student predictor to be compiled.
	- `teacher` (_Teleprompter_, _optional_): Teacher predictor used for bootstrapping. Defaults to `None`.
	- `trainset` (_list_): Training dataset used in bootstrapping.
	- `valset` (_list_, _optional_): Validation dataset used in compilation. Defaults to `None`.

	Returns:
	- The compiled `student` predictor after bootstrapping with refined demonstrations.

	### Example

	```python
	#Assume defined trainset
	#Assume defined RAG class
	...

	#Define teleprompter and include teacher
	teacher = dspy.OpenAI(model='gpt-3.5-turbo', api_key = openai.api_key, api_provider = "openai", model_type = "chat")
	teleprompter = BootstrapFewShot(teacher_settings=dict({'lm': teacher}))

	# Compile!
	compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset)
	```

	## teleprompt.Ensemble

	### Constructor

	The constructor initializes the `Ensemble` class and sets up its attributes. This teleprompter is designed to create ensembled versions of multiple programs, reducing various outputs from different programs into a single output.

	```python
	class Ensemble(Teleprompter):
	def __init__(self, *, reduce_fn=None, size=None, deterministic=False):
	```

	Parameters:
	- `reduce_fn` (_callable_, _optional_): Function used to reduce multiple outputs from different programs into a single output. A common choice is `dspy.majority`. Defaults to `None`.
	- `size` (_int_, _optional_): Number of programs to randomly select for ensembling. If not specified, all programs will be used. Defaults to `None`.
	- `deterministic` (_bool_, _optional_): Specifies whether ensemble should operate deterministically. Currently, setting this to `True` will raise an error as this feature is pending implementation. Defaults to `False`.

	### Method

	#### `compile(self, programs)`

	This method compiles an ensemble of programs into a single program that when run, can either randomly sample a subset of the given programs to produce outputs or use all of them. The multiple outputs can then be reduced into a single output using the `reduce_fn`.

	Parameters:
	- `programs` (_list_): List of programs to be ensembled.

	Returns:
	- `EnsembledProgram` (_Module_): An ensembled version of the input programs.

	### Example

	```python
	import dspy
	from dspy.teleprompt import Ensemble

	# Assume a list of programs
	programs = [program1, program2, program3, ...]

	# Define Ensemble teleprompter
	teleprompter = Ensemble(reduce_fn=dspy.majority, size=2)

	# Compile to get the EnsembledProgram
	ensembled_program = teleprompter.compile(programs)
	```

	## teleprompt.BootstrapFewShotWithRandomSearch

	### Constructor

	The constructor initializes the `BootstrapFewShotWithRandomSearch` class and sets up its attributes. It inherits from the `BootstrapFewShot` class and introduces additional attributes for the random search process.

	```python
	class BootstrapFewShotWithRandomSearch(BootstrapFewShot):
	def __init__(self, metric, teacher_settings={}, max_bootstrapped_demos=4, max_labeled_demos=16, max_rounds=1, num_candidate_programs=16, num_threads=6):
	self.metric = metric
	self.teacher_settings = teacher_settings
	self.max_rounds = max_rounds

	self.num_threads = num_threads

	self.min_num_samples = 1
	self.max_num_samples = max_bootstrapped_demos
	self.num_candidate_sets = num_candidate_programs
	self.max_num_traces = 1 + int(max_bootstrapped_demos / 2.0 * self.num_candidate_sets)

	self.max_bootstrapped_demos = self.max_num_traces
	self.max_labeled_demos = max_labeled_demos

	print("Going to sample between", self.min_num_samples, "and", self.max_num_samples, "traces per predictor.")
	print("Going to sample", self.max_num_traces, "traces in total.")
	print("Will attempt to train", self.num_candidate_sets, "candidate sets.")
	```

	Parameters:
	- `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`.
	- `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary.
	- `max_bootstrapped_demos` (_int_, _optional_): Maximum number of bootstrapped demonstrations per predictor. Defaults to 4.
	- `max_labeled_demos` (_int_, _optional_): Maximum number of labeled demonstrations per predictor. Defaults to 16.
	- `max_rounds` (_int_, _optional_): Maximum number of bootstrapping rounds. Defaults to 1.
	- `num_candidate_programs` (_int_): Number of candidate programs to generate during random search.
	- `num_threads` (_int_): Number of threads used for evaluation during random search.

	### Method

	Refer to [teleprompt.BootstrapFewShot](#telepromptbootstrapfewshot) documentation.

	## Example

	```python
	#Assume defined trainset
	#Assume defined RAG class
	...

	#Define teleprompter and include teacher
	teacher = dspy.OpenAI(model='gpt-3.5-turbo', api_key = openai.api_key, api_provider = "openai", model_type = "chat")
	teleprompter = BootstrapFewShotWithRandomSearch(teacher_settings=dict({'lm': teacher}))

	# Compile!
	compiled_rag = teleprompter.compile(student=RAG(), trainset=trainset)
	```

	## teleprompt.BootstrapFinetune

	### Constructor

	### `__init__(self, metric=None, teacher_settings={}, multitask=True)`

	The constructor initializes a `BootstrapFinetune` instance and sets up its attributes. It defines the teleprompter as a `BootstrapFewShot` instance for the finetuning compilation.

	```python
	class BootstrapFinetune(Teleprompter):
	def __init__(self, metric=None, teacher_settings={}, multitask=True):
	```

	Parameters:
	- `metric` (_callable_, _optional_): Metric function to evaluate examples during bootstrapping. Defaults to `None`.
	- `teacher_settings` (_dict_, _optional_): Settings for teacher predictor. Defaults to empty dictionary.
	- `multitask` (_bool_, _optional_): Enable multitask fine-tuning. Defaults to `True`.

	### Method

	#### `compile(self, student, *, teacher=None, trainset, valset=None, target='t5-large', bsize=12, accumsteps=1, lr=5e-5, epochs=1, bf16=False)`

	This method first compiles for bootstrapping with the `BootstrapFewShot` teleprompter. It then prepares fine-tuning data by generating prompt-completion pairs for training and performs finetuning. After compilation, the LMs are set to the finetuned models and the method returns a compiled and fine-tuned predictor.

	Parameters:
	- `student` (_Predict_): Student predictor to be fine-tuned.
	- `teacher` (_Predict_, _optional_): Teacher predictor to help with fine-tuning. Defaults to `None`.
	- `trainset` (_list_): Training dataset for fine-tuning.
	- `valset` (_list_, _optional_): Validation dataset for fine-tuning. Defaults to `None`.
	- `target` (_str_, _optional_): Target model for fine-tuning. Defaults to `'t5-large'`.
	- `bsize` (_int_, _optional_): Batch size for training. Defaults to `12`.
	- `accumsteps` (_int_, _optional_): Gradient accumulation steps. Defaults to `1`.
	- `lr` (_float_, _optional_): Learning rate for fine-tuning. Defaults to `5e-5`.
	- `epochs` (_int_, _optional_): Number of training epochs. Defaults to `1`.
	- `bf16` (_bool_, _optional_): Enable mixed-precision training with BF16. Defaults to `False`.

	Returns:
	- `compiled2` (_Predict_): A compiled and fine-tuned `Predict` instance.

	### Example

	```python
	#Assume defined trainset
	#Assume defined RAG class
	...

	#Define teleprompter
	teleprompter = BootstrapFinetune(teacher_settings=dict({'lm': teacher}))

	# Compile!


	class Ensemble(Teleprompter):
	def __init__(self, *, reduce_fn=None, size=None, deterministic=False):
	"""A common reduce_fn is dspy.majority."""

	assert deterministic is False, "TODO: Implement example hashing for deterministic ensemble."

	self.reduce_fn = reduce_fn
	self.size = size
	self.deterministic = deterministic

	def compile(self, programs):
	size = self.size
	reduce_fn = self.reduce_fn

	import dspy
	class EnsembledProgram(dspy.Module):
	def __init__(self):
	super().__init__()
	self.programs = programs

	def forward(self, args, *kwargs):
	programs = random.sample(self.programs, size) if size else self.programs
	outputs = [prog(args, *kwargs) for prog in programs]

	if reduce_fn:
	return reduce_fn(outputs)

	return outputs