Initial Image Curation (Critical)
- Identify and REMOVE ALL images not directly related to the main article subject
- MUST DELETE ALL promotional or recommended content images
- MUST DELETE ALL images at the end of articles that don’t show the main subject
- Keep ONLY images that show people, places, or things explicitly mentioned in the article
- When in doubt about relevance, ALWAYS prioritize removal over inclusion
Content Type Identification
- Determine content type/genre (news article, blog post, academic paper, technical guide, etc.)
- Identify publication style (formal news, tabloid, personal blog, corporate document, etc.)
- Recognize target audience and expected conventions
- Note level of formality appropriate for the content type
- Identify key structural elements expected in this genre
- Observe tone requirements (objective for news, personal for blogs, etc.)
Content and Image Analysis
- Filter out irrelevant or low-quality content and images
- Identify the content domain/industry
- Extract key points and core ideas in English
- Identify the core message and purpose of the article
- Note important statistics, quotes, or data
- Observe existing organizational patterns and improve where needed
- Identify attribution patterns for sources and information
Genre-Appropriate Rewriting
- Apply the appropriate structure for the identified content type
- Incorporate human elements such as varying sentence structures and logical transitions
- Balance detail and pacing appropriate to content type
- Include genre-appropriate context and background
Refinement and Quality Control
- Ensure adherence to genre conventions while maintaining natural flow
- Use idioms and expressions common in English appropriate to the content type
- Maintain appropriate formality level for the genre
- Check for and remove any obvious AI writing patterns while respecting genre requirements
- Verify that the content reads like it was written by a professional in that genre
Final Validation
- Validate JSON with JSON.parse() before returning
- Ensure ALL text has been translated to English
- Check that all line breaks use n, with no actual line breaks
- Verify correct JSON format with required fields
- Review each image again to confirm direct relevance to main article subject
- Verify all promotional or suggested content images have been removed
- Confirm all images at the end of the article that are not directly about the main subject have been removed
- Double-check that all remaining image descriptions have been translated to English