B4M : Breaking Low-Rank Adapter for Making Content-Style Customization

Xu, Yu; Tang, Fan; Cao, Juan; Zhang, Yuxin; Deussen, Oliver; Dong, Weiming; Li, Jintao; Lee, Tong-Yee

doi:10.1145/3728461

B4M : Breaking Low-Rank Adapter for Making Content-Style Customization

dc.contributor.author	Xu, Yu
dc.contributor.author	Tang, Fan
dc.contributor.author	Cao, Juan
dc.contributor.author	Zhang, Yuxin
dc.contributor.author	Deussen, Oliver
dc.contributor.author	Dong, Weiming
dc.contributor.author	Li, Jintao
dc.contributor.author	Lee, Tong-Yee
dc.date.accessioned	2025-04-10T09:14:54Z
dc.date.available	2025-04-10T09:14:54Z
dc.date.issued	2025-04-30
dc.description.abstract	Personalized generation paradigms empower designers to customize visual intellectual properties with the help of textual descriptions by adapting pre-trained text-to-image models on a few images. Recent studies focus on simultaneously customizing content and detailed visual style in images but often struggle with entangling the two. In this study, we reconsider the customization of content and style concepts from the perspective of parameter space construction. Unlike existing methods that utilize a shared parameter space for content and style learning, we propose a novel framework that separates the parameter space to facilitate individual learning of content and style by introducing “partly learnable projection” (PLP) matrices to separate the original adapters into divided sub-parameter spaces. A “break-for-make” customization learning pipeline based on PLP is proposed: we first break the original adapters into “up projection” and “down projection” for content and style concept under orthogonal prior and then make the entity parameter space by reconstructing the content and style PLPs matrices by using Riemannian precondition to adaptively balance content and style learning. Experiments on various styles, including textures, materials, and artistic style, show that our method outperforms state-of-the-art single/multiple concept learning pipelines regarding content-style-prompt alignment. Code is available at: https://github.com/ICTMCG/Break-for-make.
dc.description.version	published	deu
dc.identifier.doi	10.1145/3728461
dc.identifier.ppn	1924818477
dc.identifier.uri	https://kops.uni-konstanz.de/handle/123456789/72996
dc.language.iso	eng
dc.rights	terms-of-use
dc.rights.uri	https://rightsstatements.org/page/InC/1.0/
dc.subject	Customize generation
dc.subject	content-style fusion
dc.subject	text-to-image generation
dc.subject.ddc	004
dc.title	B4M : Breaking Low-Rank Adapter for Making Content-Style Customization	eng
dc.type	JOURNAL_ARTICLE
dspace.entity.type	Publication
kops.citation.bibtex	@article{Xu2025-04-30Break-72996, title={B4M : Breaking Low-Rank Adapter for Making Content-Style Customization}, year={2025}, doi={10.1145/3728461}, number={2}, volume={44}, issn={0730-0301}, journal={ACM Transactions on Graphics}, author={Xu, Yu and Tang, Fan and Cao, Juan and Zhang, Yuxin and Deussen, Oliver and Dong, Weiming and Li, Jintao and Lee, Tong-Yee}, note={Article Number: 21} }
kops.citation.iso690	XU, Yu, Fan TANG, Juan CAO, Yuxin ZHANG, Oliver DEUSSEN, Weiming DONG, Jintao LI, Tong-Yee LEE, 2025. B4M : Breaking Low-Rank Adapter for Making Content-Style Customization. In: ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461	deu
kops.citation.iso690	XU, Yu, Fan TANG, Juan CAO, Yuxin ZHANG, Oliver DEUSSEN, Weiming DONG, Jintao LI, Tong-Yee LEE, 2025. B4M : Breaking Low-Rank Adapter for Making Content-Style Customization. In: ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Available under: doi: 10.1145/3728461	eng
kops.citation.rdf	<rdf:RDF xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:void="http://rdfs.org/ns/void#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/72996"> <dc:contributor>Dong, Weiming</dc:contributor> <dc:creator>Xu, Yu</dc:creator> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dcterms:issued>2025-04-30</dcterms:issued> <dc:creator>Li, Jintao</dc:creator> <dc:contributor>Deussen, Oliver</dc:contributor> <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/72996"/> <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43615"/> <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-04-10T09:14:54Z</dc:date> <dc:language>eng</dc:language> <dc:creator>Tang, Fan</dc:creator> <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/72996/1/Xu_2-1r3ers966rekm9.pdf"/> <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/> <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/> <dc:contributor>Cao, Juan</dc:contributor> <dc:creator>Zhang, Yuxin</dc:creator> <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2025-04-10T09:14:54Z</dcterms:available> <dc:creator>Deussen, Oliver</dc:creator> <dc:creator>Dong, Weiming</dc:creator> <dc:creator>Cao, Juan</dc:creator> <dc:contributor>Lee, Tong-Yee</dc:contributor> <dcterms:abstract>Personalized generation paradigms empower designers to customize visual intellectual properties with the help of textual descriptions by adapting pre-trained text-to-image models on a few images. Recent studies focus on simultaneously customizing content and detailed visual style in images but often struggle with entangling the two. In this study, we reconsider the customization of content and style concepts from the perspective of parameter space construction. Unlike existing methods that utilize a shared parameter space for content and style learning, we propose a novel framework that separates the parameter space to facilitate individual learning of content and style by introducing “partly learnable projection” (PLP) matrices to separate the original adapters into divided sub-parameter spaces. A “break-for-make” customization learning pipeline based on PLP is proposed: we first break the original adapters into “up projection” and “down projection” for content and style concept under orthogonal prior and then make the entity parameter space by reconstructing the content and style PLPs matrices by using Riemannian precondition to adaptively balance content and style learning. Experiments on various styles, including textures, materials, and artistic style, show that our method outperforms state-of-the-art single/multiple concept learning pipelines regarding content-style-prompt alignment. Code is available at: https://github.com/ICTMCG/Break-for-make.</dcterms:abstract> <dc:contributor>Zhang, Yuxin</dc:contributor> <dc:contributor>Tang, Fan</dc:contributor> <dc:creator>Lee, Tong-Yee</dc:creator> <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/72996/1/Xu_2-1r3ers966rekm9.pdf"/> <foaf:homepage rdf:resource="http://localhost:8080/"/> <dc:contributor>Li, Jintao</dc:contributor> <dcterms:title>B4M : Breaking Low-Rank Adapter for Making Content-Style Customization</dcterms:title> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/> <dc:rights>terms-of-use</dc:rights> <dc:contributor>Xu, Yu</dc:contributor> <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/43615"/> </rdf:Description> </rdf:RDF>
kops.description.funding	{"first":"dfg","second":"EXC 2117-422037984"}
kops.description.openAccess	openaccesshybrid
kops.flag.isPeerReviewed	true
kops.flag.knbibliography	true
kops.identifier.nbn	urn:nbn:de:bsz:352-2-1r3ers966rekm9
kops.sourcefield	ACM Transactions on Graphics. ACM. 2025, <b>44</b>(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461	deu
kops.sourcefield.plain	ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Verfügbar unter: doi: 10.1145/3728461	deu
kops.sourcefield.plain	ACM Transactions on Graphics. ACM. 2025, 44(2), 21. ISSN 0730-0301. eISSN 1557-7368. Available under: doi: 10.1145/3728461	eng
relation.isAuthorOfPublication	b73b5935-736c-45ce-b7c0-bdeaecbca1f0
relation.isAuthorOfPublication	4e85f041-bb89-4e27-b7d6-acd814feacb8
relation.isAuthorOfPublication.latestForDiscovery	b73b5935-736c-45ce-b7c0-bdeaecbca1f0
source.bibliographicInfo.articleNumber	21
source.bibliographicInfo.issue	2
source.bibliographicInfo.volume	44
source.identifier.eissn	1557-7368
source.identifier.issn	0730-0301
source.periodicalTitle	ACM Transactions on Graphics
source.publisher	ACM
temp.description.funding	{"second":"Z231100005923033","first":"Beijing Science and Technology Plan Project"}

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1

Name:: Xu_2-1r3ers966rekm9.pdf
Größe:: 1.97 MB
Format:: Adobe Portable Document Format

Xu_2-1r3ers966rekm9.pdfGröße: 1.97 MBDownloads: 156

Sammlungen

Informatik und Informationswissenschaft: Publikationen
Exzellenzcluster "Centre for the Advanced Study of Collective Behaviour": Publikationen