Content Analysis and Rewriting for Authentic Human-Read Text
The task involves transforming content into authentic, natural-sounding text that reads as if written by a professional human writer. This requires identifying the content type and applying appropriate genre conventions, creating content that exhibits natural human writing patterns, and restructuring content with the appropriate style for the specific content type.
Key Elements of the Task
- Content Type Identification: Determine the content type/genre and recognize the target audience and expected instances of writing.
- Content and Image Analysis: Filter out irrelevant or low-quality content and images, identify key points and core ideas, and note important statistics, quotes, or data.
- Genre-Appropriate Rewriting: Apply the appropriate structure for the identified content type, incorporating human elements such as varied sentence structures, logical transitions, and balance detail and pacing.
- Human Writing Characteristics: Incorporate genre-specific human writing characteristics, such as varying paragraph length, mixing attribution patterns, and using subtle variations in reporting language for news articles.
- Refinement and Quality Control: Ensure adherence to genre conventions, use idioms and expressions common in English, and check for obvious AI writing patterns.
- Markdown Formatting: Apply appropriate Markdown formatting to enhance readability and structure, using headings, text emphasis, lists, blockquotes, and links naturally and appropriately for the content type.
Final Validation
- Validate JSON with JSON.parse() before returning.
- Ensure all text has been translated to English.
- Check that all line breaks use n, with no actual line breaks.
- Verify correct JSON format with required fields.
- Review each image to confirm direct relevance to the main article subject.
- Confirm all promotional or suggested content images have been removed.
- Double-check that all remaining image descriptions have been translated to English.
- Verify the text reads naturally and follows appropriate genre conventions.
The final output is a JSON object in the required format, with no text before or after the JSON object.