Link to home
Start Free TrialLog in
Avatar of Steve Hougom
Steve HougomFlag for United States of America

asked on

Azure Indexer error....Invalid document key

Azure Indexer Error.

I have been running the Azure Indexer on a blob storage container that contains pdfs.  The indexer Indexed more than 7000 blobs successfully then started throwing these errors for any additional documents.

    {
        "key": "https://yoursiteurl/169292.pdf",
        "errorMessage": "Invalid document key: 'https://yoursiteurl/processed-documents/169292.pdf'. Keys can only contain letters, digits, underscore (_), dash (-), or equal sign (=). Please see https://docs.microsoft.com/azure/search/search-howto-indexing-azure-blob-storage#DocumentKeys"
    }


I see nothing wrong with the document itself and am wondering if this has anything to do with keys.



Here is index metadata:



Here is the blob metadata:
Avatar of ste5an
ste5an
Flag of Germany image

Well, without testing: the slash is neither a letter, a digit, an underscore (_), a dash (-), nor an equal sign (=).
Avatar of Steve Hougom

ASKER

But where is it picking up the slash from?  Which field?  Sorry i dont understand.
From the key, seems that you cannot use the media storage path as key.
Also i didnt check base 64 encoding on the indexer

Do you recommend a different type of field for the index primary key?  When you create a new index it uses metadata_storage_path as the default option.

Maybe some generic id or metadata_storage_name

But i definately did not have base 64 encoding checked.  I tried updating the indexers base 64 encoding property through postman rest api but it wont let me.
ASKER CERTIFIED SOLUTION
Avatar of Steve Hougom
Steve Hougom
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial