Amino acid dipepetide frequency for Xiburema virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.812AlaAla: 1.812 ± 0.615
1.035AlaCys: 1.035 ± 0.559
2.847AlaAsp: 2.847 ± 0.995
2.329AlaGlu: 2.329 ± 1.583
1.294AlaPhe: 1.294 ± 0.713
2.329AlaGly: 2.329 ± 0.454
1.553AlaHis: 1.553 ± 0.532
2.329AlaIle: 2.329 ± 0.72
2.07AlaLys: 2.07 ± 0.855
5.176AlaLeu: 5.176 ± 0.871
1.553AlaMet: 1.553 ± 0.659
2.847AlaAsn: 2.847 ± 0.772
2.07AlaPro: 2.07 ± 1.798
1.812AlaGln: 1.812 ± 0.615
2.329AlaArg: 2.329 ± 1.905
1.294AlaSer: 1.294 ± 0.392
2.07AlaThr: 2.07 ± 0.637
2.07AlaVal: 2.07 ± 1.351
0.518AlaTrp: 0.518 ± 0.317
1.553AlaTyr: 1.553 ± 0.708
0.0AlaXaa: 0.0 ± 0.0
Cys
0.776CysAla: 0.776 ± 0.266
0.259CysCys: 0.259 ± 0.484
1.035CysAsp: 1.035 ± 0.634
1.035CysGlu: 1.035 ± 0.37
0.518CysPhe: 0.518 ± 0.264
0.776CysGly: 0.776 ± 0.475
0.776CysHis: 0.776 ± 0.356
1.035CysIle: 1.035 ± 0.343
1.035CysLys: 1.035 ± 0.474
2.588CysLeu: 2.588 ± 0.837
0.0CysMet: 0.0 ± 0.0
1.294CysAsn: 1.294 ± 0.85
1.553CysPro: 1.553 ± 0.876
0.259CysGln: 0.259 ± 0.344
0.776CysArg: 0.776 ± 0.802
1.812CysSer: 1.812 ± 0.564
1.294CysThr: 1.294 ± 1.03
0.259CysVal: 0.259 ± 0.158
0.259CysTrp: 0.259 ± 0.158
0.518CysTyr: 0.518 ± 0.531
0.0CysXaa: 0.0 ± 0.0
Asp
2.329AspAla: 2.329 ± 0.655
1.035AspCys: 1.035 ± 0.639
2.847AspAsp: 2.847 ± 0.976
2.847AspGlu: 2.847 ± 0.782
1.812AspPhe: 1.812 ± 0.512
3.882AspGly: 3.882 ± 1.053
1.035AspHis: 1.035 ± 0.37
5.435AspIle: 5.435 ± 1.325
2.847AspLys: 2.847 ± 1.081
3.882AspLeu: 3.882 ± 1.113
2.588AspMet: 2.588 ± 1.111
2.07AspAsn: 2.07 ± 0.612
4.141AspPro: 4.141 ± 0.805
2.588AspGln: 2.588 ± 0.59
1.812AspArg: 1.812 ± 0.518
2.847AspSer: 2.847 ± 0.893
1.294AspThr: 1.294 ± 0.473
2.588AspVal: 2.588 ± 1.318
2.329AspTrp: 2.329 ± 1.053
1.812AspTyr: 1.812 ± 0.935
0.0AspXaa: 0.0 ± 0.0
Glu
2.847GluAla: 2.847 ± 0.589
0.0GluCys: 0.0 ± 0.0
3.364GluAsp: 3.364 ± 1.269
6.729GluGlu: 6.729 ± 1.959
2.329GluPhe: 2.329 ± 1.246
3.364GluGly: 3.364 ± 1.301
1.812GluHis: 1.812 ± 0.802
4.917GluIle: 4.917 ± 1.181
6.211GluLys: 6.211 ± 1.33
6.211GluLeu: 6.211 ± 1.003
2.588GluMet: 2.588 ± 0.627
2.329GluAsn: 2.329 ± 1.297
2.07GluPro: 2.07 ± 0.698
2.07GluGln: 2.07 ± 0.82
3.106GluArg: 3.106 ± 0.614
4.658GluSer: 4.658 ± 1.169
4.917GluThr: 4.917 ± 0.62
2.847GluVal: 2.847 ± 1.145
1.035GluTrp: 1.035 ± 1.112
1.035GluTyr: 1.035 ± 0.349
0.0GluXaa: 0.0 ± 0.0
Phe
1.294PheAla: 1.294 ± 0.678
1.035PheCys: 1.035 ± 0.712
1.294PheAsp: 1.294 ± 0.333
2.847PheGlu: 2.847 ± 0.592
2.588PhePhe: 2.588 ± 0.618
1.294PheGly: 1.294 ± 0.749
1.294PheHis: 1.294 ± 0.571
2.847PheIle: 2.847 ± 1.374
2.329PheLys: 2.329 ± 1.014
3.364PheLeu: 3.364 ± 0.728
0.518PheMet: 0.518 ± 0.464
1.553PheAsn: 1.553 ± 1.033
1.294PhePro: 1.294 ± 0.683
2.847PheGln: 2.847 ± 0.585
2.07PheArg: 2.07 ± 0.963
2.329PheSer: 2.329 ± 0.859
0.776PheThr: 0.776 ± 0.418
4.141PheVal: 4.141 ± 1.064
0.776PheTrp: 0.776 ± 0.475
1.035PheTyr: 1.035 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
2.07GlyAla: 2.07 ± 0.509
1.294GlyCys: 1.294 ± 0.609
3.106GlyAsp: 3.106 ± 0.762
3.882GlyGlu: 3.882 ± 2.274
2.847GlyPhe: 2.847 ± 1.333
2.588GlyGly: 2.588 ± 0.707
1.812GlyHis: 1.812 ± 0.477
7.764GlyIle: 7.764 ± 1.525
3.623GlyLys: 3.623 ± 1.324
7.505GlyLeu: 7.505 ± 2.291
2.07GlyMet: 2.07 ± 0.74
3.882GlyAsn: 3.882 ± 1.11
2.07GlyPro: 2.07 ± 1.21
2.07GlyGln: 2.07 ± 0.338
1.812GlyArg: 1.812 ± 0.495
5.952GlySer: 5.952 ± 0.74
2.847GlyThr: 2.847 ± 0.976
2.847GlyVal: 2.847 ± 0.881
0.518GlyTrp: 0.518 ± 0.264
2.07GlyTyr: 2.07 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
1.294HisAla: 1.294 ± 0.638
0.259HisCys: 0.259 ± 0.158
0.776HisAsp: 0.776 ± 0.593
1.553HisGlu: 1.553 ± 1.326
1.035HisPhe: 1.035 ± 0.634
1.035HisGly: 1.035 ± 0.343
0.259HisHis: 0.259 ± 0.344
2.329HisIle: 2.329 ± 0.332
2.329HisLys: 2.329 ± 0.623
3.364HisLeu: 3.364 ± 0.924
0.518HisMet: 0.518 ± 0.319
0.776HisAsn: 0.776 ± 0.593
2.588HisPro: 2.588 ± 1.37
0.518HisGln: 0.518 ± 0.464
0.776HisArg: 0.776 ± 0.401
0.776HisSer: 0.776 ± 0.356
1.035HisThr: 1.035 ± 0.527
0.518HisVal: 0.518 ± 0.317
0.259HisTrp: 0.259 ± 0.158
0.518HisTyr: 0.518 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
3.106IleAla: 3.106 ± 0.751
2.847IleCys: 2.847 ± 0.841
3.364IleAsp: 3.364 ± 0.661
5.176IleGlu: 5.176 ± 1.115
2.847IlePhe: 2.847 ± 0.577
5.952IleGly: 5.952 ± 1.13
1.553IleHis: 1.553 ± 0.532
6.988IleIle: 6.988 ± 2.39
5.952IleLys: 5.952 ± 1.303
6.729IleLeu: 6.729 ± 0.938
1.553IleMet: 1.553 ± 0.736
4.4IleAsn: 4.4 ± 1.603
5.435IlePro: 5.435 ± 0.47
3.364IleGln: 3.364 ± 0.897
5.435IleArg: 5.435 ± 1.443
5.694IleSer: 5.694 ± 1.316
4.141IleThr: 4.141 ± 0.85
2.329IleVal: 2.329 ± 1.249
1.035IleTrp: 1.035 ± 0.761
1.812IleTyr: 1.812 ± 0.935
0.0IleXaa: 0.0 ± 0.0
Lys
2.07LysAla: 2.07 ± 0.665
1.553LysCys: 1.553 ± 0.746
3.882LysAsp: 3.882 ± 0.718
4.141LysGlu: 4.141 ± 0.649
2.329LysPhe: 2.329 ± 0.979
6.211LysGly: 6.211 ± 0.968
1.553LysHis: 1.553 ± 0.285
4.4LysIle: 4.4 ± 0.981
5.694LysLys: 5.694 ± 1.024
7.246LysLeu: 7.246 ± 1.381
2.07LysMet: 2.07 ± 1.001
2.329LysAsn: 2.329 ± 0.762
3.882LysPro: 3.882 ± 2.274
1.553LysGln: 1.553 ± 0.56
3.623LysArg: 3.623 ± 1.121
5.176LysSer: 5.176 ± 0.886
2.588LysThr: 2.588 ± 0.924
4.141LysVal: 4.141 ± 0.805
1.812LysTrp: 1.812 ± 0.837
1.553LysTyr: 1.553 ± 0.879
0.0LysXaa: 0.0 ± 0.0
Leu
4.658LeuAla: 4.658 ± 1.117
1.553LeuCys: 1.553 ± 0.946
5.176LeuAsp: 5.176 ± 0.686
5.952LeuGlu: 5.952 ± 1.008
2.588LeuPhe: 2.588 ± 0.689
5.694LeuGly: 5.694 ± 0.788
1.294LeuHis: 1.294 ± 0.669
9.834LeuIle: 9.834 ± 1.937
6.729LeuLys: 6.729 ± 0.891
6.47LeuLeu: 6.47 ± 1.7
3.882LeuMet: 3.882 ± 1.054
8.799LeuAsn: 8.799 ± 1.88
3.882LeuPro: 3.882 ± 0.65
2.588LeuGln: 2.588 ± 0.666
5.952LeuArg: 5.952 ± 1.238
6.988LeuSer: 6.988 ± 1.02
6.211LeuThr: 6.211 ± 1.343
4.4LeuVal: 4.4 ± 1.014
1.294LeuTrp: 1.294 ± 0.663
4.4LeuTyr: 4.4 ± 1.383
0.0LeuXaa: 0.0 ± 0.0
Met
1.553MetAla: 1.553 ± 1.158
1.035MetCys: 1.035 ± 0.634
2.07MetAsp: 2.07 ± 0.697
2.588MetGlu: 2.588 ± 0.579
2.329MetPhe: 2.329 ± 1.286
2.588MetGly: 2.588 ± 0.868
0.518MetHis: 0.518 ± 0.464
2.329MetIle: 2.329 ± 0.563
1.812MetLys: 1.812 ± 0.27
3.623MetLeu: 3.623 ± 0.919
1.035MetMet: 1.035 ± 0.527
1.553MetAsn: 1.553 ± 0.988
0.259MetPro: 0.259 ± 0.358
0.518MetGln: 0.518 ± 0.521
2.329MetArg: 2.329 ± 1.132
1.553MetSer: 1.553 ± 0.711
1.553MetThr: 1.553 ± 0.693
1.812MetVal: 1.812 ± 0.853
1.035MetTrp: 1.035 ± 0.675
1.812MetTyr: 1.812 ± 0.817
0.0MetXaa: 0.0 ± 0.0
Asn
2.07AsnAla: 2.07 ± 1.016
0.0AsnCys: 0.0 ± 0.0
2.847AsnAsp: 2.847 ± 1.135
1.294AsnGlu: 1.294 ± 0.636
2.588AsnPhe: 2.588 ± 0.494
2.329AsnGly: 2.329 ± 1.021
1.035AsnHis: 1.035 ± 0.349
2.847AsnIle: 2.847 ± 1.119
4.658AsnLys: 4.658 ± 1.127
6.988AsnLeu: 6.988 ± 1.533
3.106AsnMet: 3.106 ± 0.868
2.588AsnAsn: 2.588 ± 0.825
3.623AsnPro: 3.623 ± 0.872
3.106AsnGln: 3.106 ± 0.811
2.588AsnArg: 2.588 ± 0.671
1.812AsnSer: 1.812 ± 0.417
3.623AsnThr: 3.623 ± 0.4
2.07AsnVal: 2.07 ± 0.523
1.553AsnTrp: 1.553 ± 0.508
3.364AsnTyr: 3.364 ± 0.746
0.0AsnXaa: 0.0 ± 0.0
Pro
1.812ProAla: 1.812 ± 0.776
0.518ProCys: 0.518 ± 0.317
2.847ProAsp: 2.847 ± 0.828
3.364ProGlu: 3.364 ± 1.761
0.259ProPhe: 0.259 ± 0.485
1.812ProGly: 1.812 ± 1.338
1.294ProHis: 1.294 ± 0.532
4.658ProIle: 4.658 ± 1.389
2.847ProLys: 2.847 ± 0.926
4.917ProLeu: 4.917 ± 0.909
1.553ProMet: 1.553 ± 0.345
3.623ProAsn: 3.623 ± 1.013
2.588ProPro: 2.588 ± 1.083
2.847ProGln: 2.847 ± 1.204
1.812ProArg: 1.812 ± 1.023
5.694ProSer: 5.694 ± 0.621
2.847ProThr: 2.847 ± 1.337
3.106ProVal: 3.106 ± 1.022
0.518ProTrp: 0.518 ± 0.317
2.329ProTyr: 2.329 ± 0.549
0.0ProXaa: 0.0 ± 0.0
Gln
2.07GlnAla: 2.07 ± 0.509
1.035GlnCys: 1.035 ± 0.665
1.294GlnAsp: 1.294 ± 0.529
2.588GlnGlu: 2.588 ± 0.519
1.294GlnPhe: 1.294 ± 0.473
2.847GlnGly: 2.847 ± 1.209
0.776GlnHis: 0.776 ± 0.401
3.623GlnIle: 3.623 ± 0.617
2.07GlnLys: 2.07 ± 0.854
4.658GlnLeu: 4.658 ± 1.217
1.035GlnMet: 1.035 ± 0.736
1.553GlnAsn: 1.553 ± 0.39
0.518GlnPro: 0.518 ± 0.339
0.776GlnGln: 0.776 ± 0.593
1.812GlnArg: 1.812 ± 0.558
3.106GlnSer: 3.106 ± 0.857
1.812GlnThr: 1.812 ± 0.485
1.553GlnVal: 1.553 ± 0.669
0.259GlnTrp: 0.259 ± 0.158
1.294GlnTyr: 1.294 ± 0.874
0.0GlnXaa: 0.0 ± 0.0
Arg
2.588ArgAla: 2.588 ± 0.622
1.553ArgCys: 1.553 ± 0.819
2.588ArgAsp: 2.588 ± 0.437
2.588ArgGlu: 2.588 ± 0.378
3.106ArgPhe: 3.106 ± 0.78
4.658ArgGly: 4.658 ± 1.66
1.553ArgHis: 1.553 ± 0.459
3.106ArgIle: 3.106 ± 1.092
1.812ArgLys: 1.812 ± 0.564
3.623ArgLeu: 3.623 ± 1.267
1.553ArgMet: 1.553 ± 0.765
2.329ArgAsn: 2.329 ± 0.582
1.294ArgPro: 1.294 ± 0.505
1.812ArgGln: 1.812 ± 0.65
3.882ArgArg: 3.882 ± 0.969
3.623ArgSer: 3.623 ± 0.737
3.364ArgThr: 3.364 ± 1.045
2.329ArgVal: 2.329 ± 0.762
0.776ArgTrp: 0.776 ± 0.486
2.329ArgTyr: 2.329 ± 0.75
0.0ArgXaa: 0.0 ± 0.0
Ser
4.141SerAla: 4.141 ± 1.052
0.776SerCys: 0.776 ± 0.497
4.658SerAsp: 4.658 ± 1.901
4.917SerGlu: 4.917 ± 1.397
2.329SerPhe: 2.329 ± 0.798
5.435SerGly: 5.435 ± 1.033
0.518SerHis: 0.518 ± 0.264
5.435SerIle: 5.435 ± 1.279
4.658SerLys: 4.658 ± 2.284
5.952SerLeu: 5.952 ± 1.464
2.847SerMet: 2.847 ± 0.475
2.847SerAsn: 2.847 ± 0.811
4.917SerPro: 4.917 ± 0.862
2.588SerGln: 2.588 ± 0.685
4.141SerArg: 4.141 ± 1.014
7.505SerSer: 7.505 ± 1.791
3.364SerThr: 3.364 ± 1.104
4.141SerVal: 4.141 ± 2.05
1.553SerTrp: 1.553 ± 0.801
1.294SerTyr: 1.294 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
1.035ThrAla: 1.035 ± 0.639
0.259ThrCys: 0.259 ± 0.358
2.847ThrAsp: 2.847 ± 1.081
3.623ThrGlu: 3.623 ± 1.019
1.553ThrPhe: 1.553 ± 0.879
2.847ThrGly: 2.847 ± 0.724
1.553ThrHis: 1.553 ± 0.56
2.588ThrIle: 2.588 ± 1.316
3.882ThrLys: 3.882 ± 0.718
4.4ThrLeu: 4.4 ± 1.487
2.07ThrMet: 2.07 ± 0.637
2.588ThrAsn: 2.588 ± 0.671
2.847ThrPro: 2.847 ± 0.589
2.329ThrGln: 2.329 ± 1.255
2.07ThrArg: 2.07 ± 0.897
4.4ThrSer: 4.4 ± 0.789
3.106ThrThr: 3.106 ± 1.193
4.4ThrVal: 4.4 ± 1.333
1.294ThrTrp: 1.294 ± 0.473
2.847ThrTyr: 2.847 ± 1.106
0.0ThrXaa: 0.0 ± 0.0
Val
1.553ValAla: 1.553 ± 0.982
1.553ValCys: 1.553 ± 0.946
3.106ValAsp: 3.106 ± 1.498
3.364ValGlu: 3.364 ± 1.326
2.07ValPhe: 2.07 ± 0.804
3.623ValGly: 3.623 ± 0.935
1.294ValHis: 1.294 ± 0.995
3.364ValIle: 3.364 ± 0.927
2.588ValLys: 2.588 ± 1.185
5.176ValLeu: 5.176 ± 1.625
1.553ValMet: 1.553 ± 0.597
3.106ValAsn: 3.106 ± 1.225
2.847ValPro: 2.847 ± 0.814
1.035ValGln: 1.035 ± 0.444
1.812ValArg: 1.812 ± 0.495
3.623ValSer: 3.623 ± 0.702
3.106ValThr: 3.106 ± 0.954
2.588ValVal: 2.588 ± 0.818
0.259ValTrp: 0.259 ± 0.158
2.588ValTyr: 2.588 ± 0.81
0.0ValXaa: 0.0 ± 0.0
Trp
0.776TrpAla: 0.776 ± 0.593
0.0TrpCys: 0.0 ± 0.0
0.259TrpAsp: 0.259 ± 0.386
1.812TrpGlu: 1.812 ± 0.852
0.518TrpPhe: 0.518 ± 0.317
1.294TrpGly: 1.294 ± 0.631
0.518TrpHis: 0.518 ± 0.317
2.847TrpIle: 2.847 ± 0.61
1.035TrpLys: 1.035 ± 0.343
1.553TrpLeu: 1.553 ± 0.415
0.259TrpMet: 0.259 ± 0.358
0.776TrpAsn: 0.776 ± 0.266
0.776TrpPro: 0.776 ± 0.419
0.0TrpGln: 0.0 ± 0.0
0.259TrpArg: 0.259 ± 0.484
1.812TrpSer: 1.812 ± 0.6
1.812TrpThr: 1.812 ± 1.126
0.776TrpVal: 0.776 ± 0.435
0.259TrpTrp: 0.259 ± 0.344
0.518TrpTyr: 0.518 ± 0.51
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.035TyrAla: 1.035 ± 0.349
0.259TyrCys: 0.259 ± 0.158
1.812TyrAsp: 1.812 ± 0.817
1.812TyrGlu: 1.812 ± 0.58
1.294TyrPhe: 1.294 ± 0.473
2.07TyrGly: 2.07 ± 0.855
0.518TyrHis: 0.518 ± 0.264
1.294TyrIle: 1.294 ± 0.473
3.623TyrLys: 3.623 ± 0.624
4.917TyrLeu: 4.917 ± 1.161
1.294TyrMet: 1.294 ± 0.643
2.847TyrAsn: 2.847 ± 1.067
2.588TyrPro: 2.588 ± 0.853
1.294TyrGln: 1.294 ± 0.473
2.07TyrArg: 2.07 ± 0.976
3.364TyrSer: 3.364 ± 1.06
0.776TyrThr: 0.776 ± 0.401
1.294TyrVal: 1.294 ± 0.657
0.518TyrTrp: 0.518 ± 0.604
1.294TyrTyr: 1.294 ± 0.635
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski