Text-to-image diffusion models in artificial intelligence

Stable Diffusion is a captivating text-to-image model that generates images based on text input. However, a major challenge is that it is pretrained on a specific dataset, limiting its ability to generate images outside of the given data. In this paper, we propose to harness two models based on neural networks, Hypernetworks and DreamBooth, to allow the introduction of any image into Stable Diffusion, addressing versatility with minimal additional training data. This work targets AI applications such as augmenting next-generation multipurpose robots, enhancing human-robot collaboration, feeding intelligent tutoring systems, training autonomous cars, injecting subjects for photo personalization, producing high quality movie animations etc.. It can contribute to AI in smart cities for facets such as smart living and mobility.

کلیدواژه ها:

ANN ، Data Mining ، Image Processing ، Movie Animations ، Photo Personalization ، Stable Diffusion ، Text-to-Image Creation

نویسندگان

amir mohammad moradi

pouria pourvejdani

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/1682276

شناسه ملی سند علمی:

CARSE07_272

تاریخ نمایه سازی: 5 تیر 1402

نحوه استناد به مقاله:

در صورتی که می خواهید در اثر پژوهشی خود به این مقاله ارجاع دهید، به سادگی می توانید از عبارت زیر در بخش منابع و مراجع استفاده نمایید:

moradi, amir mohammad and pourvejdani, pouria,1402,Text-to-image diffusion models in artificial intelligence,The 7th International Conference on Applied Research in Science and Engineeringg,https://civilica.com/doc/1682276

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این مقاله اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1402, moradi, amir mohammad؛ pouria pourvejdani)
برای بار دوم به بعد: (1402, moradi؛ pourvejdani)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.

مقالات مرتبط جدید