Amino acid dipepetide frequency for Dabieshan Tick Virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.894AlaAla: 5.894 ± 2.603
1.572AlaCys: 1.572 ± 0.692
2.358AlaAsp: 2.358 ± 0.48
5.894AlaGlu: 5.894 ± 0.739
3.143AlaPhe: 3.143 ± 1.244
4.322AlaGly: 4.322 ± 2.543
1.965AlaHis: 1.965 ± 0.528
3.536AlaIle: 3.536 ± 1.133
2.358AlaLys: 2.358 ± 0.48
7.073AlaLeu: 7.073 ± 2.282
3.929AlaMet: 3.929 ± 1.408
1.572AlaAsn: 1.572 ± 0.622
2.358AlaPro: 2.358 ± 1.061
1.965AlaGln: 1.965 ± 1.487
5.501AlaArg: 5.501 ± 2.724
6.68AlaSer: 6.68 ± 0.912
3.536AlaThr: 3.536 ± 1.133
3.929AlaVal: 3.929 ± 0.796
0.393AlaTrp: 0.393 ± 0.173
0.393AlaTyr: 0.393 ± 0.173
0.0AlaXaa: 0.0 ± 0.0
Cys
0.393CysAla: 0.393 ± 1.434
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.393CysGlu: 0.393 ± 0.173
0.786CysPhe: 0.786 ± 0.885
1.179CysGly: 1.179 ± 1.226
0.0CysHis: 0.0 ± 0.0
0.393CysIle: 0.393 ± 0.173
1.572CysLys: 1.572 ± 0.692
1.965CysLeu: 1.965 ± 0.865
0.393CysMet: 0.393 ± 0.173
0.393CysAsn: 0.393 ± 0.173
0.393CysPro: 0.393 ± 0.173
0.786CysGln: 0.786 ± 0.346
1.572CysArg: 1.572 ± 0.622
1.572CysSer: 1.572 ± 0.692
0.393CysThr: 0.393 ± 0.173
0.393CysVal: 0.393 ± 0.173
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.536AspAla: 3.536 ± 2.235
0.0AspCys: 0.0 ± 0.0
3.536AspAsp: 3.536 ± 0.666
3.536AspGlu: 3.536 ± 0.666
1.179AspPhe: 1.179 ± 0.519
3.536AspGly: 3.536 ± 2.233
0.786AspHis: 0.786 ± 0.346
2.75AspIle: 2.75 ± 0.492
2.75AspLys: 2.75 ± 1.21
4.715AspLeu: 4.715 ± 1.093
1.965AspMet: 1.965 ± 0.528
0.393AspAsn: 0.393 ± 0.173
3.536AspPro: 3.536 ± 1.141
2.75AspGln: 2.75 ± 1.362
3.929AspArg: 3.929 ± 1.207
4.322AspSer: 4.322 ± 0.941
1.965AspThr: 1.965 ± 0.528
3.929AspVal: 3.929 ± 1.056
1.179AspTrp: 1.179 ± 0.519
1.965AspTyr: 1.965 ± 0.865
0.0AspXaa: 0.0 ± 0.0
Glu
5.894GluAla: 5.894 ± 1.039
1.179GluCys: 1.179 ± 0.519
5.108GluAsp: 5.108 ± 1.251
9.037GluGlu: 9.037 ± 3.024
3.929GluPhe: 3.929 ± 1.729
3.536GluGly: 3.536 ± 1.556
1.965GluHis: 1.965 ± 0.528
3.929GluIle: 3.929 ± 0.796
6.287GluLys: 6.287 ± 2.767
5.501GluLeu: 5.501 ± 2.724
1.965GluMet: 1.965 ± 0.865
0.786GluAsn: 0.786 ± 0.346
1.572GluPro: 1.572 ± 0.622
1.965GluGln: 1.965 ± 1.627
6.287GluArg: 6.287 ± 3.235
3.143GluSer: 3.143 ± 1.383
3.929GluThr: 3.929 ± 1.207
5.501GluVal: 5.501 ± 1.413
1.965GluTrp: 1.965 ± 0.528
1.572GluTyr: 1.572 ± 0.692
0.0GluXaa: 0.0 ± 0.0
Phe
3.536PheAla: 3.536 ± 0.892
0.786PheCys: 0.786 ± 0.346
1.965PheAsp: 1.965 ± 0.865
2.358PheGlu: 2.358 ± 0.48
4.322PhePhe: 4.322 ± 1.299
1.572PheGly: 1.572 ± 0.692
2.358PheHis: 2.358 ± 1.037
1.179PheIle: 1.179 ± 0.519
3.929PheLys: 3.929 ± 1.729
4.322PheLeu: 4.322 ± 0.667
0.786PheMet: 0.786 ± 0.346
1.965PheAsn: 1.965 ± 1.092
2.75PhePro: 2.75 ± 0.492
1.179PheGln: 1.179 ± 1.918
2.358PheArg: 2.358 ± 1.49
3.143PheSer: 3.143 ± 1.082
2.75PheThr: 2.75 ± 0.492
1.965PheVal: 1.965 ± 0.865
0.0PheTrp: 0.0 ± 0.0
2.358PheTyr: 2.358 ± 1.037
0.0PheXaa: 0.0 ± 0.0
Gly
2.75GlyAla: 2.75 ± 2.369
0.393GlyCys: 0.393 ± 1.434
5.108GlyAsp: 5.108 ± 1.083
1.572GlyGlu: 1.572 ± 1.77
1.965GlyPhe: 1.965 ± 0.865
3.929GlyGly: 3.929 ± 3.877
0.786GlyHis: 0.786 ± 1.323
3.536GlyIle: 3.536 ± 0.666
1.965GlyLys: 1.965 ± 0.865
3.536GlyLeu: 3.536 ± 0.666
3.536GlyMet: 3.536 ± 1.797
1.572GlyAsn: 1.572 ± 0.692
2.358GlyPro: 2.358 ± 1.037
0.786GlyGln: 0.786 ± 0.346
3.143GlyArg: 3.143 ± 0.559
4.715GlySer: 4.715 ± 1.093
2.358GlyThr: 2.358 ± 1.49
5.894GlyVal: 5.894 ± 1.812
0.393GlyTrp: 0.393 ± 1.434
2.75GlyTyr: 2.75 ± 1.362
0.0GlyXaa: 0.0 ± 0.0
His
2.358HisAla: 2.358 ± 0.48
0.786HisCys: 0.786 ± 0.346
0.393HisAsp: 0.393 ± 0.173
1.572HisGlu: 1.572 ± 0.692
0.786HisPhe: 0.786 ± 0.885
2.358HisGly: 2.358 ± 1.037
2.358HisHis: 2.358 ± 3.622
3.143HisIle: 3.143 ± 1.244
1.572HisLys: 1.572 ± 0.692
1.572HisLeu: 1.572 ± 1.148
1.179HisMet: 1.179 ± 0.745
1.179HisAsn: 1.179 ± 0.519
2.358HisPro: 2.358 ± 1.33
0.393HisGln: 0.393 ± 0.173
1.572HisArg: 1.572 ± 0.622
2.358HisSer: 2.358 ± 1.037
0.786HisThr: 0.786 ± 0.885
1.179HisVal: 1.179 ± 0.519
0.0HisTrp: 0.0 ± 0.0
2.358HisTyr: 2.358 ± 1.037
0.0HisXaa: 0.0 ± 0.0
Ile
3.143IleAla: 3.143 ± 1.029
0.786IleCys: 0.786 ± 0.885
3.143IleAsp: 3.143 ± 1.383
4.715IleGlu: 4.715 ± 2.075
1.179IlePhe: 1.179 ± 0.519
3.143IleGly: 3.143 ± 2.579
2.75IleHis: 2.75 ± 1.21
3.143IleIle: 3.143 ± 0.559
3.536IleLys: 3.536 ± 1.556
4.715IleLeu: 4.715 ± 1.093
1.179IleMet: 1.179 ± 0.519
1.179IleAsn: 1.179 ± 0.745
2.358IlePro: 2.358 ± 0.48
1.572IleGln: 1.572 ± 2.636
4.715IleArg: 4.715 ± 0.96
3.536IleSer: 3.536 ± 2.235
3.536IleThr: 3.536 ± 1.773
1.572IleVal: 1.572 ± 0.692
0.393IleTrp: 0.393 ± 1.035
1.179IleTyr: 1.179 ± 0.519
0.0IleXaa: 0.0 ± 0.0
Lys
6.287LysAla: 6.287 ± 1.936
0.393LysCys: 0.393 ± 0.173
3.536LysAsp: 3.536 ± 0.666
5.108LysGlu: 5.108 ± 1.527
2.75LysPhe: 2.75 ± 1.21
1.179LysGly: 1.179 ± 0.519
1.179LysHis: 1.179 ± 0.519
3.536LysIle: 3.536 ± 0.666
4.322LysLys: 4.322 ± 0.941
5.894LysLeu: 5.894 ± 1.576
1.179LysMet: 1.179 ± 0.519
1.965LysAsn: 1.965 ± 1.092
3.536LysPro: 3.536 ± 1.141
3.536LysGln: 3.536 ± 0.666
3.929LysArg: 3.929 ± 1.729
3.929LysSer: 3.929 ± 0.796
3.929LysThr: 3.929 ± 1.207
3.536LysVal: 3.536 ± 1.141
0.786LysTrp: 0.786 ± 0.346
1.965LysTyr: 1.965 ± 0.865
0.0LysXaa: 0.0 ± 0.0
Leu
6.287LeuAla: 6.287 ± 3.594
1.179LeuCys: 1.179 ± 0.745
1.572LeuAsp: 1.572 ± 1.648
5.501LeuGlu: 5.501 ± 1.413
5.501LeuPhe: 5.501 ± 0.608
5.894LeuGly: 5.894 ± 1.584
3.143LeuHis: 3.143 ± 0.559
3.143LeuIle: 3.143 ± 2.371
8.644LeuLys: 8.644 ± 1.881
6.68LeuLeu: 6.68 ± 1.218
3.929LeuMet: 3.929 ± 1.056
3.929LeuAsn: 3.929 ± 2.183
4.322LeuPro: 4.322 ± 2.145
3.929LeuGln: 3.929 ± 0.796
5.894LeuArg: 5.894 ± 1.039
6.287LeuSer: 6.287 ± 1.742
9.43LeuThr: 9.43 ± 0.83
5.108LeuVal: 5.108 ± 1.083
0.786LeuTrp: 0.786 ± 0.346
2.75LeuTyr: 2.75 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
2.75MetAla: 2.75 ± 1.362
0.393MetCys: 0.393 ± 0.173
3.536MetAsp: 3.536 ± 0.666
1.965MetGlu: 1.965 ± 0.865
1.572MetPhe: 1.572 ± 1.148
1.965MetGly: 1.965 ± 1.627
1.965MetHis: 1.965 ± 0.528
1.572MetIle: 1.572 ± 0.692
0.786MetLys: 0.786 ± 0.346
1.572MetLeu: 1.572 ± 0.622
1.179MetMet: 1.179 ± 0.519
0.786MetAsn: 0.786 ± 0.885
1.572MetPro: 1.572 ± 0.622
1.179MetGln: 1.179 ± 0.745
1.572MetArg: 1.572 ± 0.692
3.536MetSer: 3.536 ± 1.133
3.536MetThr: 3.536 ± 0.666
1.179MetVal: 1.179 ± 0.519
0.786MetTrp: 0.786 ± 0.346
0.786MetTyr: 0.786 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
1.179AsnAla: 1.179 ± 1.918
0.786AsnCys: 0.786 ± 0.346
1.179AsnAsp: 1.179 ± 0.519
1.965AsnGlu: 1.965 ± 1.487
3.536AsnPhe: 3.536 ± 2.233
0.786AsnGly: 0.786 ± 0.346
1.572AsnHis: 1.572 ± 0.692
1.179AsnIle: 1.179 ± 0.519
0.786AsnLys: 0.786 ± 0.346
4.715AsnLeu: 4.715 ± 1.093
1.965AsnMet: 1.965 ± 0.865
1.965AsnAsn: 1.965 ± 1.092
3.143AsnPro: 3.143 ± 1.029
1.179AsnGln: 1.179 ± 1.226
1.179AsnArg: 1.179 ± 0.745
2.75AsnSer: 2.75 ± 1.057
0.786AsnThr: 0.786 ± 0.346
1.965AsnVal: 1.965 ± 0.865
0.786AsnTrp: 0.786 ± 1.323
0.786AsnTyr: 0.786 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
3.929ProAla: 3.929 ± 3.254
0.0ProCys: 0.0 ± 0.0
2.75ProAsp: 2.75 ± 1.057
3.143ProGlu: 3.143 ± 1.383
2.358ProPhe: 2.358 ± 1.037
2.75ProGly: 2.75 ± 1.362
0.786ProHis: 0.786 ± 0.885
2.75ProIle: 2.75 ± 1.176
2.75ProLys: 2.75 ± 1.21
4.715ProLeu: 4.715 ± 1.866
0.393ProMet: 0.393 ± 0.173
1.179ProAsn: 1.179 ± 0.745
2.75ProPro: 2.75 ± 6.94
1.572ProGln: 1.572 ± 0.692
1.965ProArg: 1.965 ± 2.998
3.929ProSer: 3.929 ± 2.321
2.75ProThr: 2.75 ± 1.057
2.75ProVal: 2.75 ± 1.176
0.786ProTrp: 0.786 ± 1.323
0.786ProTyr: 0.786 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
1.965GlnAla: 1.965 ± 0.528
0.393GlnCys: 0.393 ± 0.173
3.536GlnAsp: 3.536 ± 4.571
2.358GlnGlu: 2.358 ± 0.48
0.393GlnPhe: 0.393 ± 0.173
1.572GlnGly: 1.572 ± 0.622
1.179GlnHis: 1.179 ± 0.519
2.75GlnIle: 2.75 ± 1.176
0.786GlnLys: 0.786 ± 0.346
3.536GlnLeu: 3.536 ± 1.141
1.179GlnMet: 1.179 ± 0.745
1.179GlnAsn: 1.179 ± 0.519
0.786GlnPro: 0.786 ± 1.976
1.179GlnGln: 1.179 ± 0.745
3.143GlnArg: 3.143 ± 2.371
3.143GlnSer: 3.143 ± 0.559
3.143GlnThr: 3.143 ± 1.383
2.358GlnVal: 2.358 ± 2.453
1.179GlnTrp: 1.179 ± 1.226
0.786GlnTyr: 0.786 ± 0.885
0.0GlnXaa: 0.0 ± 0.0
Arg
4.322ArgAla: 4.322 ± 1.299
0.786ArgCys: 0.786 ± 1.323
1.965ArgAsp: 1.965 ± 0.865
6.68ArgGlu: 6.68 ± 1.218
2.358ArgPhe: 2.358 ± 0.48
1.965ArgGly: 1.965 ± 1.627
1.965ArgHis: 1.965 ± 0.528
4.715ArgIle: 4.715 ± 1.866
4.715ArgLys: 4.715 ± 2.09
5.108ArgLeu: 5.108 ± 2.202
2.75ArgMet: 2.75 ± 1.21
3.536ArgAsn: 3.536 ± 0.892
2.358ArgPro: 2.358 ± 1.037
4.322ArgGln: 4.322 ± 2.543
3.143ArgArg: 3.143 ± 1.383
5.108ArgSer: 5.108 ± 1.527
3.143ArgThr: 3.143 ± 0.559
5.108ArgVal: 5.108 ± 0.956
1.179ArgTrp: 1.179 ± 0.519
0.786ArgTyr: 0.786 ± 0.346
0.0ArgXaa: 0.0 ± 0.0
Ser
4.322SerAla: 4.322 ± 1.299
0.786SerCys: 0.786 ± 0.346
4.322SerAsp: 4.322 ± 0.941
3.536SerGlu: 3.536 ± 1.556
3.143SerPhe: 3.143 ± 0.559
4.322SerGly: 4.322 ± 1.902
1.572SerHis: 1.572 ± 1.648
3.143SerIle: 3.143 ± 1.082
3.536SerLys: 3.536 ± 1.556
10.216SerLeu: 10.216 ± 0.383
1.572SerMet: 1.572 ± 0.692
2.75SerAsn: 2.75 ± 2.715
5.108SerPro: 5.108 ± 1.083
2.75SerGln: 2.75 ± 0.492
5.894SerArg: 5.894 ± 2.594
5.501SerSer: 5.501 ± 1.657
3.143SerThr: 3.143 ± 1.029
5.108SerVal: 5.108 ± 1.251
0.786SerTrp: 0.786 ± 0.346
1.572SerTyr: 1.572 ± 1.148
0.0SerXaa: 0.0 ± 0.0
Thr
3.929ThrAla: 3.929 ± 1.729
0.393ThrCys: 0.393 ± 0.173
3.143ThrAsp: 3.143 ± 1.244
5.501ThrGlu: 5.501 ± 1.413
2.358ThrPhe: 2.358 ± 0.48
3.143ThrGly: 3.143 ± 1.082
0.786ThrHis: 0.786 ± 0.346
1.965ThrIle: 1.965 ± 0.865
7.073ThrLys: 7.073 ± 2.543
6.68ThrLeu: 6.68 ± 1.918
1.179ThrMet: 1.179 ± 0.745
3.929ThrAsn: 3.929 ± 1.729
2.358ThrPro: 2.358 ± 2.855
3.536ThrGln: 3.536 ± 0.666
3.143ThrArg: 3.143 ± 1.383
4.715ThrSer: 4.715 ± 1.407
4.715ThrThr: 4.715 ± 2.075
3.536ThrVal: 3.536 ± 0.892
0.393ThrTrp: 0.393 ± 0.173
1.179ThrTyr: 1.179 ± 0.519
0.0ThrXaa: 0.0 ± 0.0
Val
4.322ValAla: 4.322 ± 0.941
1.179ValCys: 1.179 ± 0.519
3.143ValAsp: 3.143 ± 1.383
7.073ValGlu: 7.073 ± 0.237
3.143ValPhe: 3.143 ± 0.559
4.322ValGly: 4.322 ± 1.299
1.179ValHis: 1.179 ± 0.745
1.965ValIle: 1.965 ± 2.464
3.929ValLys: 3.929 ± 0.768
5.108ValLeu: 5.108 ± 1.527
2.75ValMet: 2.75 ± 1.057
2.75ValAsn: 2.75 ± 1.21
0.786ValPro: 0.786 ± 0.346
1.179ValGln: 1.179 ± 0.519
4.322ValArg: 4.322 ± 0.994
2.358ValSer: 2.358 ± 2.291
3.929ValThr: 3.929 ± 0.796
4.322ValVal: 4.322 ± 0.994
1.572ValTrp: 1.572 ± 1.148
1.965ValTyr: 1.965 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.572TrpAla: 1.572 ± 0.622
0.393TrpCys: 0.393 ± 0.173
0.786TrpAsp: 0.786 ± 0.346
0.393TrpGlu: 0.393 ± 0.173
1.179TrpPhe: 1.179 ± 0.519
0.393TrpGly: 0.393 ± 0.173
0.0TrpHis: 0.0 ± 0.0
0.393TrpIle: 0.393 ± 0.173
0.393TrpLys: 0.393 ± 0.173
3.536TrpLeu: 3.536 ± 1.133
0.0TrpMet: 0.0 ± 0.0
0.393TrpAsn: 0.393 ± 0.173
0.0TrpPro: 0.0 ± 0.0
0.393TrpGln: 0.393 ± 1.434
0.786TrpArg: 0.786 ± 2.868
0.393TrpSer: 0.393 ± 0.173
1.965TrpThr: 1.965 ± 1.487
0.786TrpVal: 0.786 ± 0.346
0.0TrpTrp: 0.0 ± 0.0
0.393TrpTyr: 0.393 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.393TyrCys: 0.393 ± 0.173
1.179TyrAsp: 1.179 ± 0.519
2.75TyrGlu: 2.75 ± 1.21
0.0TyrPhe: 0.0 ± 0.0
1.572TyrGly: 1.572 ± 1.77
1.965TyrHis: 1.965 ± 0.865
2.75TyrIle: 2.75 ± 1.21
0.786TyrLys: 0.786 ± 0.346
2.75TyrLeu: 2.75 ± 0.492
0.786TyrMet: 0.786 ± 0.346
0.786TyrAsn: 0.786 ± 0.346
0.393TyrPro: 0.393 ± 1.035
0.393TyrGln: 0.393 ± 1.035
1.965TyrArg: 1.965 ± 0.865
1.965TyrSer: 1.965 ± 0.865
3.929TyrThr: 3.929 ± 1.729
1.179TyrVal: 1.179 ± 1.226
0.786TyrTrp: 0.786 ± 0.346
0.786TyrTyr: 0.786 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2546 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski