Skip to content

Conversation

@changjian-wang
Copy link
Member

This pull request adds support for training custom analyzers in Azure AI Content Understanding using labeled data stored in Azure Blob Storage. It introduces a comprehensive sample demonstrating the process, updates the test environment to support training data configuration, and adds necessary dependencies and test data.

New sample and documentation for labeled data training:

  • Added a new sample file Sample16_CreateAnalyzerWithLabels.md that explains how to train a custom analyzer using labeled data, including setup instructions, code snippets, training data structure, and key concepts.

SDK and test infrastructure enhancements:

  • Introduced a convenience constructor to LabeledDataKnowledgeSource for easier instantiation with just a container URL, simplifying usage in code and samples.
  • Extended ContentUnderstandingClientTestEnvironment to support environment variables for training data SAS URL, path, storage account, and container name, enabling flexible test configuration for labeled data scenarios.

Test and dependency updates:

  • Added the Azure.Storage.Blobs package to test dependencies to facilitate uploading training data to Azure Blob Storage in sample and test code.
  • Added a labeled training data file 17a84146-e910-460c-bf80-a625e6f64fea.jpg.labels.json to the test data folder, providing a concrete example for analyzer training.

…sociated test data

- Added a new image file and its corresponding labels and result JSON files for testing.
- Implemented Sample16_CreateAnalyzerWithLabels.cs to demonstrate how to create an analyzer using labeled data.
- Included logic for uploading training data to Azure Blob Storage and defining a field schema for receipt extraction.
- Added assertions to validate the properties of the created analyzer.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants