Add recombinase assembly algorithm for attB/attP -- Generalized integrase issue #435 by areebamomin · Pull Request #496 · pydna-group/pydna

areebamomin · 2025-12-10T21:46:30Z

This PR implements a recombinase-based assembly algorithm for pydna by adding a new function, make_recombinase_algorithm, to src/pydna/assembly2.py. The function identifies homologous recombination regions by extracting the lowercase core shared between attB and attP recognition sites and returning match tuples in the format expected by Assembly to behave consistently with other supported assembly strategies. A corresponding test suite (tests/test_recombinase_overlap.py) was added to verify homology detection, edge cases, multiple matches, and full integration with the Assembly class. All tests pass successfully using both python run_test.py and pytest, and all doctests in assembly2.py also run without errors.

Hopefully closes or makes some progress on #435 !

Thank you for letting me have a go at learning more about the program and hopefully can build on this to a successful contribution!

manulera · 2025-12-11T15:43:58Z

areebamomin · 2025-12-13T20:28:04Z

Hi @manulera

Thank you for looking it over and the feedback! I will work on this in the following weeks.

manulera · 2025-12-15T08:22:43Z

Degenerate nucleotide codes represent a position that could be occupied by more than one nucleotide in a consensus sequence. You have them listed here: https://people.bath.ac.uk/jm2219/biology/degenerate.htm

You don't have to handle how to find these degenerate sequences in your new code. You can use the function dseqrecord_finditer included in the library. Below is a minimal usage example:

from pydna.sequence_regex import dseqrecord_finditer, compute_regex_site
from pydna.dseqrecord import Dseqrecord

seq = Dseqrecord('CTaaaACGTaaaAC')

# Turn degenerate sequence into regex pattern (case insensitive)
regex_pattern = compute_regex_site('ACNT')

print('regex pattern', regex_pattern)
# Find it in the sequence
result = dseqrecord_finditer(regex_pattern, seq)

print([r for r in result])

# Handles circular sequences, note that it
# returns 12,16 as the span for the circular-spanning motif
seq2 = Dseqrecord('CTaaaACGTaaaAC', circular=True)
result2 = dseqrecord_finditer(regex_pattern, seq2)
print([r for r in result2])

BjornFJohansson · 2026-01-20T08:00:54Z

@areebamomin @manulera

Any updates on this?

areebamomin · 2026-01-21T15:33:49Z

@BjornFJohansson @manulera

I was traveling for the last month / holidays but hoping to work more on this in the upcoming couple of weeks!

BjornFJohansson · 2026-01-21T15:41:06Z

@areebamomin Good to hear, mind that pydna has undergone some fundamental internal changes, the Dseq class now relies on a single string instead of two. See last release v5.5.5

manulera · 2026-02-16T08:45:27Z

Hi @areebamomin pinging you here, do you think you will have time to finish this?

areebamomin · 2026-02-17T16:13:00Z

Hi @manulera just emailed you!

Add recombinase assembly algorithm for attB/attP

74af4ce

manulera mentioned this pull request Feb 25, 2026

Generalised recombinase functionality #564

Merged

manulera closed this Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add recombinase assembly algorithm for attB/attP -- Generalized integrase issue #435#496

Add recombinase assembly algorithm for attB/attP -- Generalized integrase issue #435#496
areebamomin wants to merge 1 commit intopydna-group:masterfrom
areebamomin:issue_435

areebamomin commented Dec 10, 2025

Uh oh!

manulera commented Dec 11, 2025

Uh oh!

areebamomin commented Dec 13, 2025

Uh oh!

manulera commented Dec 15, 2025

Uh oh!

BjornFJohansson commented Jan 20, 2026

Uh oh!

areebamomin commented Jan 21, 2026

Uh oh!

BjornFJohansson commented Jan 21, 2026

Uh oh!

manulera commented Feb 16, 2026

Uh oh!

areebamomin commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

areebamomin commented Dec 10, 2025

Uh oh!

manulera commented Dec 11, 2025

Need to have

Nice to have

Uh oh!

areebamomin commented Dec 13, 2025

Uh oh!

manulera commented Dec 15, 2025

Uh oh!

BjornFJohansson commented Jan 20, 2026

Uh oh!

areebamomin commented Jan 21, 2026

Uh oh!

BjornFJohansson commented Jan 21, 2026

Uh oh!

manulera commented Feb 16, 2026

Uh oh!

areebamomin commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants