News Classification API

A complete end-to-end news classification system using DistilBERT, SageMaker, Lambda, and API Gateway.

Architecture

Client → API Gateway → Lambda → SageMaker Endpoint → DistilBERT Model

Components

Training: script.py - Fine-tunes DistilBERT on news data
Inference: inference.py - Handles model loading and prediction
Deployment: Deployment.ipynb - Deploys model to SageMaker endpoint
Lambda: aws-lambda-llm-endpoint-invoke-function.py - API handler
API Gateway: template.yaml - REST API infrastructure

Setup

Prerequisites

AWS CLI configured (aws configure)
AWS SAM CLI installed (pip install aws-sam-cli)
SageMaker endpoint deployed and running

Deploy the API

Deploy SageMaker Model (if not done):

# Run the Deployment.ipynb notebook first

Deploy API Gateway + Lambda:
```
./deploy.sh
```

Test the API:

# Get the API URL from deployment output, then:
python test_api.py <API_URL>

# Or test specific headline:
python test_api.py <API_URL> "Stock market crashes due to inflation"

API Usage

Endpoint

POST https://{api-id}.execute-api.{region}.amazonaws.com/prod/classify

Request Format

{
  "query": {
    "headline": "Scientists discover new treatment for cancer"
  }
}

Response Format

{
  "predicted_label": "Health",
  "probabilities": [[0.05, 0.10, 0.05, 0.80]]
}

Example cURL

curl -X POST https://your-api-url/prod/classify \
     -H "Content-Type: application/json" \
     -d '{"query": {"headline": "New vaccine shows 95% effectiveness"}}'

Files Overview

script.py - Training script for SageMaker
inference.py - Model inference logic
Deployment.ipynb - SageMaker deployment notebook
aws-lambda-llm-endpoint-invoke-function.py - Lambda function
template.yaml - SAM infrastructure template
deploy.sh - Deployment script
test_api.py - API testing script

Troubleshooting

Lambda Timeout: Increase timeout in template.yaml
Permissions Error: Check IAM roles for SageMaker access
Endpoint Not Found: Verify SageMaker endpoint name matches
CORS Issues: API Gateway CORS is pre-configured

Cost Considerations

SageMaker Endpoint: Runs continuously (~$100-200/month for ml.m5.xlarge)
Lambda: Pay per request (~$0.0000002 per request)
API Gateway: Pay per request (~$0.0000035 per request)

Consider using SageMaker Serverless Inference for lower costs with variable traffic.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
load-testing		load-testing
Deployment.ipynb		Deployment.ipynb
E2E-Sagemaker-Proj.png		E2E-Sagemaker-Proj.png
README.md		README.md
TrainingNotebook.ipynb		TrainingNotebook.ipynb
aws-lambda-llm-endpoint-invoke-function.py		aws-lambda-llm-endpoint-invoke-function.py
deploy.sh		deploy.sh
distilbert-transformer.excalidraw		distilbert-transformer.excalidraw
distilbert-transformer.png		distilbert-transformer.png
e2e-sagemaker-proj.excalidraw		e2e-sagemaker-proj.excalidraw
inference.py		inference.py
script.py		script.py
template.yaml		template.yaml
test_api.py		test_api.py
tokenization-distilBERT.png		tokenization-distilBERT.png
tokenization-distilbert.excalidraw		tokenization-distilbert.excalidraw

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

News Classification API

Architecture

Components

Setup

Prerequisites

Deploy the API

API Usage

Endpoint

Request Format

Response Format

Categories

Example cURL

Files Overview

Troubleshooting

Cost Considerations

About

Uh oh!

Releases

Packages

Languages

liulanze/Sagemaker-DistilBERT

Folders and files

Latest commit

History

Repository files navigation

News Classification API

Architecture

Components

Setup

Prerequisites

Deploy the API

API Usage

Endpoint

Request Format

Response Format

Categories

Example cURL

Files Overview

Troubleshooting

Cost Considerations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages