Amino acid dipepetide frequency for Kigluaik phantom orthophasmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.381AlaAla: 3.381 ± 2.33
0.615AlaCys: 0.615 ± 0.32
3.074AlaAsp: 3.074 ± 1.125
3.074AlaGlu: 3.074 ± 1.773
2.459AlaPhe: 2.459 ± 0.408
2.152AlaGly: 2.152 ± 1.973
0.307AlaHis: 0.307 ± 0.16
4.611AlaIle: 4.611 ± 1.778
3.996AlaLys: 3.996 ± 3.973
3.996AlaLeu: 3.996 ± 1.518
0.615AlaMet: 0.615 ± 0.32
2.767AlaAsn: 2.767 ± 1.008
1.23AlaPro: 1.23 ± 0.95
0.615AlaGln: 0.615 ± 0.475
3.381AlaArg: 3.381 ± 0.423
4.919AlaSer: 4.919 ± 0.657
2.767AlaThr: 2.767 ± 0.586
3.074AlaVal: 3.074 ± 1.722
0.615AlaTrp: 0.615 ± 0.32
1.844AlaTyr: 1.844 ± 0.208
0.0AlaXaa: 0.0 ± 0.0
Cys
0.922CysAla: 0.922 ± 0.493
0.307CysCys: 0.307 ± 0.16
2.459CysAsp: 2.459 ± 2.102
1.537CysGlu: 1.537 ± 0.243
0.615CysPhe: 0.615 ± 0.285
0.615CysGly: 0.615 ± 0.475
0.922CysHis: 0.922 ± 0.48
0.922CysIle: 0.922 ± 0.48
0.922CysLys: 0.922 ± 0.247
0.922CysLeu: 0.922 ± 0.247
0.307CysMet: 0.307 ± 0.39
0.922CysAsn: 0.922 ± 0.664
0.0CysPro: 0.0 ± 0.0
0.615CysGln: 0.615 ± 0.78
1.537CysArg: 1.537 ± 0.243
2.152CysSer: 2.152 ± 1.31
0.922CysThr: 0.922 ± 0.664
1.23CysVal: 1.23 ± 0.752
0.307CysTrp: 0.307 ± 0.16
1.537CysTyr: 1.537 ± 0.418
0.0CysXaa: 0.0 ± 0.0
Asp
4.304AspAla: 4.304 ± 0.563
0.0AspCys: 0.0 ± 0.0
3.381AspAsp: 3.381 ± 0.371
3.996AspGlu: 3.996 ± 1.012
3.074AspPhe: 3.074 ± 0.703
2.767AspGly: 2.767 ± 0.063
0.307AspHis: 0.307 ± 0.16
7.378AspIle: 7.378 ± 0.354
3.689AspLys: 3.689 ± 1.708
5.841AspLeu: 5.841 ± 2.122
2.152AspMet: 2.152 ± 0.795
3.381AspAsn: 3.381 ± 0.827
3.074AspPro: 3.074 ± 0.703
1.23AspGln: 1.23 ± 0.303
3.381AspArg: 3.381 ± 0.857
3.689AspSer: 3.689 ± 0.987
1.844AspThr: 1.844 ± 0.433
3.996AspVal: 3.996 ± 0.583
0.0AspTrp: 0.0 ± 0.0
3.996AspTyr: 3.996 ± 0.583
0.0AspXaa: 0.0 ± 0.0
Glu
4.304GluAla: 4.304 ± 1.472
1.844GluCys: 1.844 ± 0.854
3.996GluAsp: 3.996 ± 1.012
2.767GluGlu: 2.767 ± 1.008
2.152GluPhe: 2.152 ± 0.274
2.767GluGly: 2.767 ± 1.345
2.459GluHis: 2.459 ± 0.408
3.996GluIle: 3.996 ± 0.951
4.919GluLys: 4.919 ± 1.339
4.611GluLeu: 4.611 ± 1.725
1.23GluMet: 1.23 ± 0.468
4.611GluAsn: 4.611 ± 1.234
1.844GluPro: 1.844 ± 1.37
1.537GluGln: 1.537 ± 0.8
3.074GluArg: 3.074 ± 0.213
3.996GluSer: 3.996 ± 0.365
2.152GluThr: 2.152 ± 0.281
6.456GluVal: 6.456 ± 0.388
0.307GluTrp: 0.307 ± 0.546
4.611GluTyr: 4.611 ± 0.539
0.0GluXaa: 0.0 ± 0.0
Phe
1.537PheAla: 1.537 ± 0.508
0.307PheCys: 0.307 ± 0.39
3.074PheAsp: 3.074 ± 0.689
1.23PheGlu: 1.23 ± 0.354
1.23PhePhe: 1.23 ± 0.569
0.615PheGly: 0.615 ± 0.285
0.615PheHis: 0.615 ± 0.475
4.304PheIle: 4.304 ± 1.127
1.23PheLys: 1.23 ± 0.484
3.074PheLeu: 3.074 ± 1.126
2.152PheMet: 2.152 ± 0.713
2.767PheAsn: 2.767 ± 1.008
0.615PhePro: 0.615 ± 0.285
1.23PheGln: 1.23 ± 0.64
1.844PheArg: 1.844 ± 0.67
3.381PheSer: 3.381 ± 0.371
2.152PheThr: 2.152 ± 0.274
3.381PheVal: 3.381 ± 1.031
0.0PheTrp: 0.0 ± 0.0
2.767PheTyr: 2.767 ± 1.509
0.0PheXaa: 0.0 ± 0.0
Gly
3.381GlyAla: 3.381 ± 0.584
1.537GlyCys: 1.537 ± 1.44
2.459GlyAsp: 2.459 ± 0.607
2.767GlyGlu: 2.767 ± 0.552
2.152GlyPhe: 2.152 ± 0.786
2.459GlyGly: 2.459 ± 0.969
0.922GlyHis: 0.922 ± 0.493
4.304GlyIle: 4.304 ± 1.571
5.533GlyLys: 5.533 ± 1.087
4.611GlyLeu: 4.611 ± 1.551
1.537GlyMet: 1.537 ± 0.243
3.996GlyAsn: 3.996 ± 1.012
1.844GlyPro: 1.844 ± 0.788
1.537GlyGln: 1.537 ± 0.243
2.152GlyArg: 2.152 ± 0.281
4.919GlySer: 4.919 ± 2.109
3.074GlyThr: 3.074 ± 1.126
2.459GlyVal: 2.459 ± 0.668
0.922GlyTrp: 0.922 ± 0.452
2.152GlyTyr: 2.152 ± 0.702
0.0GlyXaa: 0.0 ± 0.0
His
1.23HisAla: 1.23 ± 0.95
0.307HisCys: 0.307 ± 0.16
0.615HisAsp: 0.615 ± 0.32
2.152HisGlu: 2.152 ± 0.529
0.615HisPhe: 0.615 ± 0.285
0.922HisGly: 0.922 ± 0.247
0.615HisHis: 0.615 ± 0.78
0.922HisIle: 0.922 ± 0.48
1.537HisLys: 1.537 ± 0.8
2.767HisLeu: 2.767 ± 0.712
0.615HisMet: 0.615 ± 0.32
1.537HisAsn: 1.537 ± 0.944
0.615HisPro: 0.615 ± 0.475
0.615HisGln: 0.615 ± 0.475
1.23HisArg: 1.23 ± 0.64
1.23HisSer: 1.23 ± 0.484
1.23HisThr: 1.23 ± 0.303
0.307HisVal: 0.307 ± 0.16
0.0HisTrp: 0.0 ± 0.0
0.307HisTyr: 0.307 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
3.074IleAla: 3.074 ± 1.126
0.307IleCys: 0.307 ± 0.39
6.148IleAsp: 6.148 ± 0.885
7.07IleGlu: 7.07 ± 1.731
1.844IlePhe: 1.844 ± 0.987
5.533IleGly: 5.533 ± 3.5
1.23IleHis: 1.23 ± 0.303
7.378IleIle: 7.378 ± 3.317
5.533IleLys: 5.533 ± 0.49
5.226IleLeu: 5.226 ± 1.459
3.381IleMet: 3.381 ± 1.148
4.611IleAsn: 4.611 ± 2.36
3.689IlePro: 3.689 ± 0.987
1.23IleGln: 1.23 ± 0.64
5.226IleArg: 5.226 ± 0.102
8.915IleSer: 8.915 ± 3.307
6.763IleThr: 6.763 ± 0.427
3.381IleVal: 3.381 ± 0.857
0.0IleTrp: 0.0 ± 0.0
4.919IleTyr: 4.919 ± 1.213
0.0IleXaa: 0.0 ± 0.0
Lys
1.844LysAla: 1.844 ± 1.425
1.537LysCys: 1.537 ± 0.508
3.381LysAsp: 3.381 ± 0.857
4.919LysGlu: 4.919 ± 0.292
3.074LysPhe: 3.074 ± 0.703
4.611LysGly: 4.611 ± 0.866
1.537LysHis: 1.537 ± 0.418
8.3LysIle: 8.3 ± 2.981
3.381LysLys: 3.381 ± 0.753
6.456LysLeu: 6.456 ± 2.074
2.767LysMet: 2.767 ± 0.063
5.533LysAsn: 5.533 ± 0.523
1.23LysPro: 1.23 ± 0.484
2.152LysGln: 2.152 ± 0.844
2.152LysArg: 2.152 ± 0.786
4.919LysSer: 4.919 ± 0.762
3.996LysThr: 3.996 ± 2.136
3.689LysVal: 3.689 ± 0.953
0.0LysTrp: 0.0 ± 0.0
4.304LysTyr: 4.304 ± 1.328
0.0LysXaa: 0.0 ± 0.0
Leu
4.611LeuAla: 4.611 ± 1.372
2.767LeuCys: 2.767 ± 1.037
3.689LeuAsp: 3.689 ± 1.013
3.381LeuGlu: 3.381 ± 1.031
2.152LeuPhe: 2.152 ± 0.788
4.919LeuGly: 4.919 ± 1.213
1.537LeuHis: 1.537 ± 0.418
4.611LeuIle: 4.611 ± 1.916
4.304LeuLys: 4.304 ± 1.059
5.533LeuLeu: 5.533 ± 1.963
3.381LeuMet: 3.381 ± 0.301
5.226LeuAsn: 5.226 ± 1.317
2.459LeuPro: 2.459 ± 0.932
2.459LeuGln: 2.459 ± 0.854
2.459LeuArg: 2.459 ± 0.932
7.378LeuSer: 7.378 ± 1.614
2.767LeuThr: 2.767 ± 0.659
3.996LeuVal: 3.996 ± 0.937
1.23LeuTrp: 1.23 ± 0.64
5.841LeuTyr: 5.841 ± 2.122
0.0LeuXaa: 0.0 ± 0.0
Met
1.537MetAla: 1.537 ± 0.913
0.307MetCys: 0.307 ± 0.39
3.074MetAsp: 3.074 ± 0.485
2.459MetGlu: 2.459 ± 0.408
1.537MetPhe: 1.537 ± 0.593
1.23MetGly: 1.23 ± 0.95
0.922MetHis: 0.922 ± 0.452
2.767MetIle: 2.767 ± 1.509
2.767MetLys: 2.767 ± 1.067
2.767MetLeu: 2.767 ± 0.586
1.23MetMet: 1.23 ± 0.303
0.615MetAsn: 0.615 ± 0.285
2.459MetPro: 2.459 ± 0.607
1.537MetGln: 1.537 ± 0.562
1.537MetArg: 1.537 ± 0.418
3.381MetSer: 3.381 ± 0.371
1.844MetThr: 1.844 ± 0.555
1.537MetVal: 1.537 ± 0.886
0.0MetTrp: 0.0 ± 0.0
1.23MetTyr: 1.23 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
3.381AsnAla: 3.381 ± 0.423
1.844AsnCys: 1.844 ± 0.555
3.996AsnAsp: 3.996 ± 1.012
3.074AsnGlu: 3.074 ± 1.164
2.459AsnPhe: 2.459 ± 0.668
5.226AsnGly: 5.226 ± 1.951
0.307AsnHis: 0.307 ± 0.16
7.993AsnIle: 7.993 ± 2.505
4.611AsnLys: 4.611 ± 1.127
4.304AsnLeu: 4.304 ± 0.563
2.767AsnMet: 2.767 ± 0.644
4.304AsnAsn: 4.304 ± 1.571
1.844AsnPro: 1.844 ± 0.788
1.844AsnGln: 1.844 ± 0.555
3.689AsnArg: 3.689 ± 1.11
5.533AsnSer: 5.533 ± 2.051
4.611AsnThr: 4.611 ± 0.391
4.611AsnVal: 4.611 ± 0.539
0.922AsnTrp: 0.922 ± 0.247
2.767AsnTyr: 2.767 ± 1.008
0.0AsnXaa: 0.0 ± 0.0
Pro
1.537ProAla: 1.537 ± 0.562
0.307ProCys: 0.307 ± 0.546
2.767ProAsp: 2.767 ± 0.552
3.381ProGlu: 3.381 ± 0.584
1.23ProPhe: 1.23 ± 0.64
1.537ProGly: 1.537 ± 0.243
0.0ProHis: 0.0 ± 0.0
2.767ProIle: 2.767 ± 1.008
2.152ProLys: 2.152 ± 0.702
1.844ProLeu: 1.844 ± 0.939
1.537ProMet: 1.537 ± 0.562
2.459ProAsn: 2.459 ± 1.354
1.23ProPro: 1.23 ± 0.354
1.23ProGln: 1.23 ± 0.484
1.844ProArg: 1.844 ± 0.555
2.152ProSer: 2.152 ± 0.713
2.152ProThr: 2.152 ± 0.795
1.537ProVal: 1.537 ± 0.593
0.307ProTrp: 0.307 ± 0.16
0.615ProTyr: 0.615 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
2.152GlnAla: 2.152 ± 1.334
0.307GlnCys: 0.307 ± 0.16
0.0GlnAsp: 0.0 ± 0.0
1.537GlnGlu: 1.537 ± 0.886
1.844GlnPhe: 1.844 ± 0.904
1.23GlnGly: 1.23 ± 0.64
0.615GlnHis: 0.615 ± 0.32
1.23GlnIle: 1.23 ± 0.95
1.537GlnLys: 1.537 ± 0.418
2.767GlnLeu: 2.767 ± 1.008
0.922GlnMet: 0.922 ± 0.48
3.996GlnAsn: 3.996 ± 0.583
0.615GlnPro: 0.615 ± 0.32
0.922GlnGln: 0.922 ± 0.48
0.615GlnArg: 0.615 ± 0.32
2.459GlnSer: 2.459 ± 0.408
0.922GlnThr: 0.922 ± 0.247
0.922GlnVal: 0.922 ± 0.247
0.307GlnTrp: 0.307 ± 0.16
1.23GlnTyr: 1.23 ± 0.484
0.0GlnXaa: 0.0 ± 0.0
Arg
1.844ArgAla: 1.844 ± 0.67
0.922ArgCys: 0.922 ± 0.664
4.611ArgAsp: 4.611 ± 0.728
3.996ArgGlu: 3.996 ± 1.462
2.767ArgPhe: 2.767 ± 0.586
2.152ArgGly: 2.152 ± 0.529
1.23ArgHis: 1.23 ± 0.303
2.767ArgIle: 2.767 ± 0.74
3.074ArgLys: 3.074 ± 0.703
2.767ArgLeu: 2.767 ± 0.74
1.537ArgMet: 1.537 ± 1.134
3.996ArgAsn: 3.996 ± 1.255
2.152ArgPro: 2.152 ± 0.923
0.922ArgGln: 0.922 ± 1.01
2.459ArgArg: 2.459 ± 0.607
4.304ArgSer: 4.304 ± 0.481
2.767ArgThr: 2.767 ± 1.074
3.074ArgVal: 3.074 ± 0.703
0.615ArgTrp: 0.615 ± 0.32
1.537ArgTyr: 1.537 ± 0.508
0.0ArgXaa: 0.0 ± 0.0
Ser
3.074SerAla: 3.074 ± 1.371
1.844SerCys: 1.844 ± 1.83
3.074SerAsp: 3.074 ± 0.835
3.996SerGlu: 3.996 ± 0.897
2.767SerPhe: 2.767 ± 0.063
4.611SerGly: 4.611 ± 0.391
1.23SerHis: 1.23 ± 0.354
6.456SerIle: 6.456 ± 2.768
7.993SerLys: 7.993 ± 0.601
6.456SerLeu: 6.456 ± 1.802
2.767SerMet: 2.767 ± 1.026
6.148SerAsn: 6.148 ± 2.845
1.844SerPro: 1.844 ± 0.555
1.537SerGln: 1.537 ± 0.418
4.919SerArg: 4.919 ± 0.782
5.841SerSer: 5.841 ± 2.484
6.763SerThr: 6.763 ± 1.316
5.226SerVal: 5.226 ± 1.459
0.922SerTrp: 0.922 ± 0.247
2.459SerTyr: 2.459 ± 0.709
0.0SerXaa: 0.0 ± 0.0
Thr
2.152ThrAla: 2.152 ± 1.384
1.844ThrCys: 1.844 ± 0.939
3.996ThrAsp: 3.996 ± 0.937
2.767ThrGlu: 2.767 ± 0.659
3.689ThrPhe: 3.689 ± 0.294
3.689ThrGly: 3.689 ± 0.91
0.922ThrHis: 0.922 ± 0.664
5.841ThrIle: 5.841 ± 1.275
3.074ThrLys: 3.074 ± 1.6
2.459ThrLeu: 2.459 ± 0.854
2.152ThrMet: 2.152 ± 0.788
3.996ThrAsn: 3.996 ± 1.012
1.844ThrPro: 1.844 ± 0.67
0.307ThrGln: 0.307 ± 0.16
2.459ThrArg: 2.459 ± 0.408
5.533ThrSer: 5.533 ± 0.775
3.689ThrThr: 3.689 ± 0.433
3.074ThrVal: 3.074 ± 1.371
0.307ThrTrp: 0.307 ± 0.16
2.767ThrTyr: 2.767 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
2.767ValAla: 2.767 ± 1.357
1.537ValCys: 1.537 ± 1.134
3.381ValAsp: 3.381 ± 0.77
4.611ValGlu: 4.611 ± 0.728
1.23ValPhe: 1.23 ± 0.64
4.304ValGly: 4.304 ± 2.038
2.459ValHis: 2.459 ± 0.854
4.304ValIle: 4.304 ± 1.074
4.304ValLys: 4.304 ± 1.472
4.304ValLeu: 4.304 ± 0.563
2.152ValMet: 2.152 ± 0.702
5.226ValAsn: 5.226 ± 0.434
1.844ValPro: 1.844 ± 0.208
1.537ValGln: 1.537 ± 0.886
3.074ValArg: 3.074 ± 0.689
2.459ValSer: 2.459 ± 1.505
2.459ValThr: 2.459 ± 0.118
3.689ValVal: 3.689 ± 0.842
0.307ValTrp: 0.307 ± 0.16
1.537ValTyr: 1.537 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.615TrpAla: 0.615 ± 0.32
0.0TrpCys: 0.0 ± 0.0
0.307TrpAsp: 0.307 ± 0.16
0.615TrpGlu: 0.615 ± 0.642
0.0TrpPhe: 0.0 ± 0.0
0.615TrpGly: 0.615 ± 0.32
0.307TrpHis: 0.307 ± 0.16
0.307TrpIle: 0.307 ± 0.16
0.307TrpLys: 0.307 ± 0.16
0.307TrpLeu: 0.307 ± 0.16
0.0TrpMet: 0.0 ± 0.0
0.615TrpAsn: 0.615 ± 0.32
0.922TrpPro: 0.922 ± 0.48
0.615TrpGln: 0.615 ± 0.475
0.307TrpArg: 0.307 ± 0.16
0.0TrpSer: 0.0 ± 0.0
0.615TrpThr: 0.615 ± 0.32
0.307TrpVal: 0.307 ± 0.16
0.0TrpTrp: 0.0 ± 0.0
0.307TrpTyr: 0.307 ± 0.39
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.844TyrAla: 1.844 ± 0.433
1.23TyrCys: 1.23 ± 0.303
3.996TyrAsp: 3.996 ± 1.17
4.304TyrGlu: 4.304 ± 1.404
0.615TyrPhe: 0.615 ± 0.32
2.767TyrGly: 2.767 ± 1.44
0.922TyrHis: 0.922 ± 0.48
4.304TyrIle: 4.304 ± 1.059
4.919TyrLys: 4.919 ± 0.657
3.689TyrLeu: 3.689 ± 0.987
1.23TyrMet: 1.23 ± 0.64
3.689TyrAsn: 3.689 ± 0.433
1.23TyrPro: 1.23 ± 0.303
2.459TyrGln: 2.459 ± 0.854
2.152TyrArg: 2.152 ± 1.211
2.767TyrSer: 2.767 ± 1.067
3.074TyrThr: 3.074 ± 0.768
1.537TyrVal: 1.537 ± 0.243
0.0TyrTrp: 0.0 ± 0.0
3.381TyrTyr: 3.381 ± 0.827
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski