1.2.9 • Published 3 days ago

@datagrok/sequence-translator v1.2.9

Weekly downloads
-
License
-
Repository
github
Last release
3 days ago

Sequence Translator

Sequence Translator

SequenceTranslator is a package for the Datagrok platform, used to translate oligonucleotide sequences between different representations .

Presentation: https://www.youtube.com/watch?v=2xuxJjpjXi4&t=3783s

Oligonucleotides are characterized by the sequence of nucleotide modifications that make up the entire molecule. The length of the oligonucleotide is usually 13-25 nucleotides long (denoted by " -mer").

Oligonucleotides are chemically synthesized. Chain assembly proceeds in the 3' to 5' direction by following a routine procedure referred to as a "synthetic cycle". Completion of a single synthetic cycle results in the addition of one nucleotide residue to the growing chain.

Use cases

NameSteps
Translate one sequence of types listPaste sequence into the text field in MAIN tab of App
Translate many sequences of types list1. Open SequenceTranslator 2. Drag & drop an Excel or CSV file with sequences into Datagrok. The platform will detect columns with sequences if they match corresponding RegExp rules 3. Right-click on the column header, then see the Convert menu 4. Choose translation function This will add the result column to the right of the table
Translate sequences using new Axolabs pattern1. Go to AXOLABS tab 2. Drag & drop an Excel or CSV file with sequences into Datagrok 3. Choose your table in the Table field 4. Choose SS Column, and, if needed, AS Column and ID Column(needed to add column with concatenated ID and pattern name) 5. Press Convert Sequences button This will add the result column(s) to the right of the table. Save pattern, if needed
Translate sequences using existing Axolabs pattern1. Go to AXOLABS tab 2. Select your pattern in Load Pattern field 3. Do steps #2-5 from previous use case

Sequence representations

In the modification section on the right side of the screen you can select modification for each base in your input sequence and check if PTO after the base is required.

Representations are splitted into categories by synthesizer's sequence format(BioSpring/Axolabs/MerMade) and type of molecule (Gapmers / siRNA).

NameExampleRegular expression
DNA nucleotidesAGGTCTTCATGACTTCGGCC^[ATGC]{10,}$
RNA nucleotidesUUCAACUGCUUACGUCUUU^[AUGC]{10,}$
BioSpring / Gapmers6*8*8*5*7*T*T*9*A*T*G*A*9*T*T*7*8*8*7*7^[*56789ATGC]{30,}$
GCRS / GapmersmoeAnpsmoeGnpsmoeGnpsmoeUnpsmoe5mCnpsTpsTpsCpsApsTpsGpsApsCpsTpsTpsmoe5mCnpsmoeGnpsmoeGnpsmoe5mCnpsmoe5mC^(?=.*moe)(?=.*5mC)(?=.*ps){30,}
BioSpring / siRNA5*1*766354715274575*5*5^[*1-8]{30,}$
Axolabs / siRNAusUfscaaCfuGfcUfuAfcGfucususu^[fsACGUacgu]{20,}$
GCRSmUpsfUpsmCmAmAfCmUfGmCfUmUfAmCfGmUmCmUpsmUpsmU^[fmpsACGU]{30,}$
OP100ug*aa*uu*ag*ag*ga*ga*cg*ga*cac^[acgu*]{10,}$
MerMade (MM)kgKHKGLHIELGJFHKiehK^[IiJjKkLlEeFfGgHhQq]{10,}$
Applied Biosystems Inc. (ABI)58877TTTACCACGT56788^[5678ATGC]{10,}$

Axolabs nucleotide modifications

NameDescriptionSymbols
RNARNA nucleotidesA, C, G, U
DNADNA nucleotidesdA, dC, dG, dT
2'-Fluoro2'-Fluoro nucleotidesAf, Cf, Gf, Uf
2'-O-Methyl2'-O-Methyl nucleotidesa, c, g, u
2'-O-MOE2'-O-MOE nucleotides (including 5-Methyl C)Am, Cm, Gm, Tm
GNAGlycol nucleic acid(GNA-A), (GNA-C), (GNA-G), (GNA-T)
LNALocked nucleic acid (including 5-Methyl C)Ab, Cb, Gb, Tb
UNAUnlocked nucleotidesAo, Co, Go, Uo
AAdeninea
CCytosinec
GGuanineg
UUracilu
X-NewX
Y-NewY
Z-NewZ
InvAbasicInverted abasic capped(invabasic)
5'-vinylps5'-vinylphosphonate-2'-OMe-uridine(vinu)
InvAbasic(o)Inverted abasic capped (overhang)(invabasic)
2'-OMe-U(o)Nucleotide Uridine with 2’O-Methyl protection (overhang)mU

App glossary

TermDefinition
PatternDefines translation rules. Contains: pattern name, modifications and PTO linkages for both strands, and comment, displayed on image
PTO linkageIndicates whether oligonucleotide has phosphorothioated bond (ps linkage) after the base
Sense strand (SS)Contains the exact nucleotide sequence to the mRNA which encodes for a functional protein. Has the information that would be readable on the RNA, and that's called the coding side.
Antisense strand (AS)Non-coding DNA strand of a gene. A cell uses antisense DNA strand as a template for producing messenger RNA (mRNA) that directs the synthesis of a protein.
1.2.9

3 days ago

1.2.7

3 months ago

1.2.6

3 months ago

1.2.5

4 months ago

1.2.0

6 months ago

1.1.0

9 months ago

1.2.4

6 months ago

1.1.5

8 months ago

1.2.3

6 months ago

1.1.4

8 months ago

1.2.2

6 months ago

1.2.1

6 months ago

1.0.17

1 year ago

1.0.16

1 year ago

1.0.15

1 year ago

1.0.14

1 year ago

1.0.13

1 year ago

1.0.11

2 years ago

1.0.10

2 years ago

1.0.12

2 years ago

1.0.9

2 years ago

1.0.8

2 years ago

1.0.7

2 years ago

1.0.6

2 years ago

1.0.2

2 years ago

1.0.1

2 years ago

1.0.0

2 years ago

1.0.5

2 years ago

1.0.4

2 years ago

1.0.3

2 years ago

0.0.12

2 years ago

0.0.9

2 years ago

0.0.8

2 years ago

0.0.5

2 years ago

0.0.4

2 years ago

0.0.6

2 years ago

0.0.3

2 years ago

0.0.1

2 years ago