Brand Strategist
Brand building and reputation — positioning, messaging, visual identity, GEO (Generative Engine Optimization), crisis management, brand experience (2026)
Brand building and reputation — positioning, messaging, visual identity, GEO (Generative Engine Optimization), crisis management, brand experience (2026)
Multi-model audio and music generation prompt engineer — Suno v3.5, Udio v1.5, ElevenLabs, Stable Audio 3; genre taxonomy, instrumentation layering, BPM/key anchoring, mixing terminology, spatial audio, voice-design parameters, model-specific syntax (2026)
Vision-text-data integration — image analysis, document processing, chart interpretation, scene understanding, cross-modal reasoning (2026)
Cinematic AI video generation — shot vocabulary, camera movement, lighting design, color grading, lens optics, narrative continuity, model-specific syntax (2026)
AI-driven 3D content creation — NeRF, Gaussian Splatting, diffusion-based 3D generation, mesh optimization, PBR texturing, real-time rendering pipeline (2026)
Sustained visual-textual search across 100-turn horizons — file-based visual context management, progressive on-demand image loading, multi-hop visual reasoning, horizon drift prevention; based on LMM-Searcher (arXiv 2604.12890, April 2026)
Cross-modal agent architecture — active perception, visual/audio grounding, token-efficient context management, modality-aware tool design, GUI automation (2026)
Multi-model video prompting — Sora 2, Runway Gen 4.5, Kling 2.6, Veo 3; shot vocab, camera moves, model-specific patterns (2026)
Multi-model image generation prompt engineer — GPT-Image-2, Midjourney V7, Flux 1.2+, Stable Diffusion 3.5, Ideogram 3, DALL-E 3; composition grammar, photography optics, art-direction taxonomy, lighting design, material language, character-consistency workflows, text-in-image...
Full guide + template for Flux prompting — camera/lens/lighting/style system (2025)
AI-first product design — agentic workflows, generative UI, human-in-the-loop at the right level, self-improving loops, trust & transparency architecture (2026)
Local-first social-clip producer — Whisper transcript scanning for punchlines/reversals, 16:9→9:16 face-pan or split-screen reframe, opus-style word-by-word caption burn; ffmpeg + NumPy pipeline, no cloud APIs; based on louisedesadeleer/clipify (May 2026, 399 stars)
K8s operations — cluster architecture, RBAC, network policies, GitOps (ArgoCD/Flux), service mesh (Istio/Linkerd), multi-tenancy, CIS Benchmark, cost optimization (2026)
Social-media image-card architect for Xiaohongshu carousels and WeChat cover pairs — Editorial Magazine × Swiss Internationalism dual systems, 28 registered layouts, 10 locked theme presets, image-source hygiene, anti-slop guardrails; single-file HTML → Playwright PNG; based o...
Content experiment strategist — turns every post into a calibrated 5-phase loop (score → blind-predict → ship → retro → evolve); rubric-driven scoring, immutable prediction discipline, and compounding judgment over time; format-agnostic (video, essay, thread, podcast); based o...
AI video editing engineer — audio-first cut craft, ffmpeg EDL pipelines, parallel animation sub-agents, color grade, subtitle burn; strategy confirmation before execution, self-evaluation before delivery; based on browser-use/video-use (Apr 2026, 6.9k+ stars)
NotebookLM-powered multimodal research orchestrator — ingest URLs, PDFs, YouTube, audio, video, and images; chat with indexed sources; generate podcasts, videos, slide decks, reports, quizzes, flashcards, and mind maps; deep web research with subagent patterns; batch downloads...
Programmatic video architect — design video as HTML compositions with data-timed tracks, GSAP/CSS seekable animations, and deterministic FFmpeg rendering; production loop (plan → layout → animate → lint → inspect → preview → render), sub-composition reuse, parameterized variab...
Midjourney prompt generator — token vectors, weighting, interactive optimization
Text adventure romance simulator with DALL-E image generation (Chinese)