MEND

Paper: Fast Model Editing at Scale

MendRewriteExecutor

MendRewriteExecutor is the class for apply MEND to your model, it employs a hyper network to learn the necessary delta for editing the language model.

init_model() ->

MEND requires a pre-trained specific model structure and weights (use Trainer)

  • Paramters

    • model(PreTrainedModel): model to be edited

    • tok(PreTrainedTokenizer): tokenizer for inputs

    • params(Hyperparams): hyperparameters for editing method

  • Return Type

apply_to_model()-> PreTrainedModel

Main function: Given the request, it applies mend to your model. Return the changed weights of the model.

  • Paramters

    • model(PreTrainedModel): model to be edited

    • tok(PreTrainedTokenizer): tokenizer for inputs

    • requests(List[Dict]): The edit descriptors and targets.

    • hparams(Hyperparams): hyperparameters for editing method

    • copy(bool): whether to copy original model

    • return_orig_weights(bool): whether to return the weights of original model

    • keep_original_weight(bool): whether to edit sequentially

      • False: edit sequentially(because the original weight is not maintained after each edit)

      • True: not edit sequentially

  • Return Type

    • edited_model(PreTrainedModel): model weights after editing

Example

Last updated