Dictation Documentation

Medical Dictation API

The Nuxera Medical Dictation API converts dictated audio from healthcare providers into accurate, formatted text. This API is specifically optimized for post-operative notes, procedure documentation, and other medical dictations where a single speaker provides continuous clinical information.

Format Support

Nuxera AI supports multiple response/request formats to accommodate different healthcare system requirements:

  • JSON (default): Standard API responses/requests
  • HL7: Healthcare interoperability standard
  • FHIR: Fast Healthcare Interoperability Resources standard

To request a specific format, include the x-response-format header in your request:

x-response-format: hl7
x-response-format: fhir

Note: If no format header is specified, responses default to JSON format.

To send request in a specific format, include the x-request-format header in your request:

x-request-format: hl7
x-request-format: fhir

Note: If the x-request-format header is not provided, the request will be processed as JSON by default.

Endpoint

URL: /api/dictate

Method: POST

Content-Type: multipart/form-data

Authentication

Include your API key in the x-api-key header:

x-api-key: YOUR_API_KEY

Request Body

Request Parameters

ParameterTypeRequiredDescription
audioFileYesAudio file in WAV format
doctorNamestringNoName of the doctor providing the dictation
patientNamestringNoName of the patient (improves accuracy)

Response

Success Response (200 OK)

Error Response (400 Bad Request)

{
  "error": "No audio provided"
}

Error Response (500 Internal Server Error)

{
  "error": "Failed to process dictation audio"
}

Response Structure

The API response contains the following components:

FieldDescription
dictationThe processed, cleaned dictation text with duplications and unnecessary repetitions removed
rawTextThe original, unprocessed transcription (useful for comparison or debugging)
timestampA unique timestamp associated with this dictation session

Example Usage

Advanced Features

Duplicate Removal

The dictation API automatically identifies and removes:

  • Repeated words and phrases
  • Duplicated sentences
  • Stutters and false starts
  • Unnecessary fillers

This cleaning process produces a polished final text while preserving all medically relevant information.

Format Detection

The API can detect and appropriately format common dictation elements:

  • Numeric values and measurements
  • Medication names and dosages
  • Anatomical terms
  • Procedure names and techniques
  • Medical abbreviations

Session Management

Each dictation is assigned a unique timestamp that can be used to:

  • Associate multiple dictations with the same patient encounter
  • Reference the dictation in your electronic health record
  • Track dictation history for auditing purposes

Use Cases

Post-Operative Notes

Surgeons can dictate procedure details immediately after surgery, receiving formatted documentation that can be directly incorporated into the patient record.

Example Input:

  • Audio dictation by surgeon after completing a procedure

Example Output:

Patient is a 62-year-old female with a history of right breast cancer who underwent right mastectomy with sentinel lymph node biopsy. The procedure was performed under general anesthesia. A 6 cm incision was made... [detailed procedure description]... Estimated blood loss was minimal. All counts were correct. The patient tolerated the procedure well and was transferred to recovery in stable condition.

Procedure Documentation

Healthcare providers can document procedures in real-time using dictation.

Example Input:

  • Audio dictation during or immediately after a medical procedure

Example Output:

Patient presented for colonoscopy due to positive fecal occult blood test. Under moderate sedation, the colonoscope was advanced to the cecum with good visualization. A 5mm polyp was identified in the sigmoid colon and removed with a hot snare. The specimen was sent to pathology. No other abnormalities were noted. The patient tolerated the procedure well.

Clinical Assessment Notes

Providers can rapidly document patient assessments, physical findings, and recommendations.

Example Input:

  • Audio dictation of clinical assessment

Example Output:

Physical examination reveals a well-developed, well-nourished male in no apparent distress. Vital signs are within normal limits. Cardiovascular examination shows regular rate and rhythm without murmurs. Lungs are clear to auscultation bilaterally. Abdomen is soft, non-tender, and non-distended with normal bowel sounds. Assessment: Stable hypertension, well-controlled on current medication regimen. Plan: Continue current medications and follow up in 3 months.

Best Practices

  1. Audio Quality

    • Dictate in a quiet environment with minimal background noise
    • Use a good quality microphone positioned 6-8 inches from your mouth
    • Speak clearly and at a moderate pace
  2. Dictation Technique

    • Begin with patient identification and procedure type
    • Follow a consistent structure for your dictations
    • Clearly enunciate medical terms and measurements
    • Indicate punctuation when needed ("period," "new paragraph")
  3. Post-Processing

    • Review the processed dictation for accuracy before incorporating into medical records
    • Pay special attention to medication names, dosages, and numeric values
    • Verify that the text accurately represents your intended documentation
  4. File Handling

    • Keep audio files under 100MB for optimal processing
    • For longer dictations, consider splitting into logical segments
    • Save the timestamp returned by the API for reference

Error Handling

Error CodeDescriptionResolution
400No audio providedEnsure audio file is included in the request
400Invalid audio formatUpload audio in WAV format
413Audio file too largeReduce file size or split recording
500Dictation processing failedCheck audio quality and try again

Limitations

  • Maximum audio file size: 100MB
  • Maximum dictation length: 15 minutes
  • Primary language(s) support: English And Arabic(with medical terminology)
  • Secondary language(s) support: Hindi, Urdu, (with medical terminology)
  • Supported audio formats: WAV (others will be automatically converted)

Next Steps

For more implementation guidance, see our Implementation Guides.