B4M : Breaking Low-Rank Adapter for Making Content-Style Customization

dc.contributor.authorXu, Yu
dc.contributor.authorTang, Fan
dc.contributor.authorCao, Juan
dc.contributor.authorZhang, Yuxin
dc.contributor.authorDeussen, Oliver
dc.contributor.authorDong, Weiming
dc.contributor.authorLi, Jintao
dc.contributor.authorLee, Tong-Yee
dc.date.accessioned2025-04-10T09:14:54Z
dc.date.available2025-04-10T09:14:54Z
dc.date.issued2025-04-30
dc.description.abstractPersonalized generation paradigms empower designers to customize visual intellectual properties with the help of textual descriptions by adapting pre-trained text-to-image models on a few images. Recent studies focus on simultaneously customizing content and detailed visual style in images but often struggle with entangling the two. In this study, we reconsider the customization of content and style concepts from the perspective of parameter space construction. Unlike existing methods that utilize a shared parameter space for content and style learning, we propose a novel framework that separates the parameter space to facilitate individual learning of content and style by introducing “partly learnable projection” (PLP) matrices to separate the original adapters into divided sub-parameter spaces. A “break-for-make” customization learning pipeline based on PLP is proposed: we first break the original adapters into “up projection” and “down projection” for content and style concept under orthogonal prior and then make the entity parameter space by reconstructing the content and style PLPs matrices by using Riemannian precondition to adaptively balance content and style learning. Experiments on various styles, including textures, materials, and artistic style, show that our method outperforms state-of-the-art single/multiple concept learning pipelines regarding content-style-prompt alignment. Code is available at: https://github.com/ICTMCG/Break-for-make.
dc.description.versionpublisheddeu
dc.identifier.doi10.1145/3728461
dc.identifier.ppn1924818477
dc.identifier.urihttps://kops.uni-konstanz.de/handle/123456789/72996
dc.language.isoeng
dc.rightsterms-of-use
dc.rights.urihttps://rightsstatements.org/page/InC/1.0/
dc.subjectCustomize generation
dc.subjectcontent-style fusion
dc.subjecttext-to-image generation
dc.subject.ddc004
dc.titleB4M : Breaking Low-Rank Adapter for Making Content-Style Customizationeng
dc.typeJOURNAL_ARTICLE
dspace.entity.typePublication
kops.citation.bibtex
@article{Xu2025-04-30Break-72996,
  title={B4M : Breaking Low-Rank Adapter for Making Content-Style Customization},
  year={2025},
  doi={10.1145/3728461},
  number={2},
  volume={44},
  issn={0730-0301},
  journal={ACM Transactions on Graphics},
  author={Xu, Yu and Tang, Fan and Cao, Juan and Zhang, Yuxin and Deussen, Oliver and Dong, Weiming and Li, Jintao and Lee, Tong-Yee},
  note={Article Number: 21}
}
kops.citation.iso690XU, Yu, Fan TANG, Juan CAO, Yuxin ZHANG, Oliver DEUSSEN, Weiming DONG, Jintao LI, Tong-Yee LEE, 2025. B4M : Breaking Low-Rank Adapter for Making Content-Style Customization. In: ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461deu
kops.citation.iso690XU, Yu, Fan TANG, Juan CAO, Yuxin ZHANG, Oliver DEUSSEN, Weiming DONG, Jintao LI, Tong-Yee LEE, 2025. B4M : Breaking Low-Rank Adapter for Making Content-Style Customization. In: ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Available under: doi: 10.1145/3728461eng
kops.citation.rdf
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/72996">
    <dc:contributor>Dong, Weiming</dc:contributor>
    <dc:creator>Xu, Yu</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:issued>2025-04-30</dcterms:issued>
    <dc:creator>Li, Jintao</dc:creator>
    <dc:contributor>Deussen, Oliver</dc:contributor>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/72996"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43615"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-04-10T09:14:54Z</dc:date>
    <dc:language>eng</dc:language>
    <dc:creator>Tang, Fan</dc:creator>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/72996/1/Xu_2-1r3ers966rekm9.pdf"/>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Cao, Juan</dc:contributor>
    <dc:creator>Zhang, Yuxin</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-04-10T09:14:54Z</dcterms:available>
    <dc:creator>Deussen, Oliver</dc:creator>
    <dc:creator>Dong, Weiming</dc:creator>
    <dc:creator>Cao, Juan</dc:creator>
    <dc:contributor>Lee, Tong-Yee</dc:contributor>
    <dcterms:abstract>Personalized generation paradigms empower designers to customize visual intellectual properties with the help of textual descriptions by adapting pre-trained text-to-image models on a few images. Recent studies focus on simultaneously customizing content and detailed visual style in images but often struggle with entangling the two. In this study, we reconsider the customization of content and style concepts from the perspective of parameter space construction. Unlike existing methods that utilize a shared parameter space for content and style learning, we propose a novel framework that separates the parameter space to facilitate individual learning of content and style by introducing “partly learnable projection” (PLP) matrices to separate the original adapters into divided sub-parameter spaces. A “break-for-make” customization learning pipeline based on PLP is proposed: we first break the original adapters into “up projection” and “down projection” for content and style concept under orthogonal prior and then make the entity parameter space by reconstructing the content and style PLPs matrices by using Riemannian precondition to adaptively balance content and style learning. Experiments on various styles, including textures, materials, and artistic style, show that our method outperforms state-of-the-art single/multiple concept learning pipelines regarding content-style-prompt alignment. Code is available at: https://github.com/ICTMCG/Break-for-make.</dcterms:abstract>
    <dc:contributor>Zhang, Yuxin</dc:contributor>
    <dc:contributor>Tang, Fan</dc:contributor>
    <dc:creator>Lee, Tong-Yee</dc:creator>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/72996/1/Xu_2-1r3ers966rekm9.pdf"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Li, Jintao</dc:contributor>
    <dcterms:title>B4M : Breaking Low-Rank Adapter for Making Content-Style Customization</dcterms:title>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:rights>terms-of-use</dc:rights>
    <dc:contributor>Xu, Yu</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43615"/>
  </rdf:Description>
</rdf:RDF>
kops.description.funding{"first":"dfg","second":"EXC 2117-422037984"}
kops.description.openAccessopenaccesshybrid
kops.flag.isPeerReviewedtrue
kops.flag.knbibliographytrue
kops.identifier.nbnurn:nbn:de:bsz:352-2-1r3ers966rekm9
kops.sourcefieldACM Transactions on Graphics. ACM. 2025, <b>44</b>(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461deu
kops.sourcefield.plainACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461deu
kops.sourcefield.plainACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Available under: doi: 10.1145/3728461eng
relation.isAuthorOfPublicationb73b5935-736c-45ce-b7c0-bdeaecbca1f0
relation.isAuthorOfPublication4e85f041-bb89-4e27-b7d6-acd814feacb8
relation.isAuthorOfPublication.latestForDiscoveryb73b5935-736c-45ce-b7c0-bdeaecbca1f0
source.bibliographicInfo.articleNumber21
source.bibliographicInfo.issue2
source.bibliographicInfo.volume44
source.identifier.eissn1557-7368
source.identifier.issn0730-0301
source.periodicalTitleACM Transactions on Graphics
source.publisherACM
temp.description.funding{"second":"Z231100005923033","first":"Beijing Science and Technology Plan Project"}

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
Xu_2-1r3ers966rekm9.pdf
Größe:
1.97 MB
Format:
Adobe Portable Document Format
Xu_2-1r3ers966rekm9.pdf
Xu_2-1r3ers966rekm9.pdfGröße: 1.97 MBDownloads: 156