mirror of https://github.com/hwchase17/langchain.git synced 2025-12-02 15:20:20 +00:00

Files

nikhilkjha d57d08fd01 Initial commit for comprehend moderator (#9665 )

This PR implements a custom chain that wraps Amazon Comprehend API
calls. The custom chain is aimed to be used with LLM chains to provide
moderation capability that let’s you detect and redact PII, Toxic and
Intent content in the LLM prompt, or the LLM response. The
implementation accepts a configuration object to control what checks
will be performed on a LLM prompt and can be used in a variety of setups
using the LangChain expression language to not only detect the
configured info in chains, but also other constructs such as a
retriever.
The included sample notebook goes over the different configuration
options and how to use it with other chains.

###  Usage sample
```python
from langchain_experimental.comprehend_moderation import BaseModerationActions, BaseModerationFilters

moderation_config = { 
        "filters":[ 
                BaseModerationFilters.PII, 
                BaseModerationFilters.TOXICITY,
                BaseModerationFilters.INTENT
        ],
        "pii":{ 
                "action": BaseModerationActions.ALLOW, 
                "threshold":0.5, 
                "labels":["SSN"],
                "mask_character": "X"
        },
        "toxicity":{ 
                "action": BaseModerationActions.STOP, 
                "threshold":0.5
        },
        "intent":{ 
                "action": BaseModerationActions.STOP, 
                "threshold":0.5
        }
}

comp_moderation_with_config = AmazonComprehendModerationChain(
    moderation_config=moderation_config, #specify the configuration
    client=comprehend_client,            #optionally pass the Boto3 Client
    verbose=True
)

template = """Question: {question}

Answer:"""

prompt = PromptTemplate(template=template, input_variables=["question"])

responses = [
    "Final Answer: A credit card number looks like 1289-2321-1123-2387. A fake SSN number looks like 323-22-9980. John Doe's phone number is (999)253-9876.", 
    "Final Answer: This is a really shitty way of constructing a birdhouse. This is fucking insane to think that any birds would actually create their motherfucking nests here."
]
llm = FakeListLLM(responses=responses)

llm_chain = LLMChain(prompt=prompt, llm=llm)

chain = ( 
    prompt 
    | comp_moderation_with_config 
    | {llm_chain.input_keys[0]: lambda x: x['output'] }  
    | llm_chain 
    | { "input": lambda x: x['text'] } 
    | comp_moderation_with_config 
)

response = chain.invoke({"question": "A sample SSN number looks like this 123-456-7890. Can you give me some more samples?"})

print(response['output'])


```
### Output
```
> Entering new AmazonComprehendModerationChain chain...
Running AmazonComprehendModerationChain...
Running pii validation...
Found PII content..stopping..
The prompt contains PII entities and cannot be processed
```

---------

Co-authored-by: Piyush Jain <piyushjain@duck.com>
Co-authored-by: Anjan Biswas <anjanavb@amazon.com>
Co-authored-by: Jha <nikjha@amazon.com>
Co-authored-by: Bagatur <baskaryan@gmail.com>

2023-08-25 15:11:27 -07:00

docs

Initial commit for comprehend moderator (#9665 )

2023-08-25 15:11:27 -07:00

src

📖 docs: compact api reference (#8651 )

2023-08-24 09:01:52 -07:00

static

Update agent docs, move to use-case sub-directory (#9344 )

2023-08-25 11:28:55 -07:00

.gitignore

Doc refactor (#6300 )

2023-06-16 11:52:56 -07:00

babel.config.js

Doc refactor (#6300 )

2023-06-16 11:52:56 -07:00

code-block-loader.js

Doc refactor (#6300 )

2023-06-16 11:52:56 -07:00

docusaurus.config.js

Automatically set docs appearance to system default (#8924 )

2023-08-08 09:54:18 -07:00

generate_api_reference_links.py

fix links generation (#8471 )

2023-07-29 18:31:33 -07:00

ignore_build.sh

fix prod docs build (#6402 )

2023-06-18 20:56:12 -07:00

package-lock.json

docs: (Mendable Search) Fixes stuck when tabbing out issue (#9074 )

2023-08-10 13:46:06 -07:00

package.json

docs: (Mendable Search) Fixes stuck when tabbing out issue (#9074 )

2023-08-10 13:46:06 -07:00

README.md

Doc refactor (#6300 )

2023-06-16 11:52:56 -07:00

settings.ini

Doc refactor (#6300 )

2023-06-16 11:52:56 -07:00

sidebars.js

Add docs community page (#8992 )

2023-08-10 13:41:35 -07:00

vercel_build.sh

Update local script for docs build (#8377 )

2023-07-27 13:13:59 -07:00

vercel.json

2023-08-23 11:30:44 -07:00

README.md

Website

This website is built using Docusaurus 2, a modern static website generator.

Installation

$ yarn

Local Development

$ yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

$ yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

$ USE_SSH=true yarn deploy

Not using SSH:

$ GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.

Continuous Integration

Some common defaults for linting/formatting have been set for you. If you integrate your project with an open source Continuous Integration system (e.g. Travis CI, CircleCI), you may check for issues using the following command.

$ yarn ci