Amino acid dipepetide frequency for Rukutama virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.956AlaAla: 4.956 ± 1.229
1.377AlaCys: 1.377 ± 0.613
2.478AlaAsp: 2.478 ± 0.37
3.579AlaGlu: 3.579 ± 0.657
1.377AlaPhe: 1.377 ± 0.576
3.579AlaGly: 3.579 ± 1.19
0.551AlaHis: 0.551 ± 0.487
3.304AlaIle: 3.304 ± 1.187
4.13AlaLys: 4.13 ± 0.706
4.405AlaLeu: 4.405 ± 0.294
1.101AlaMet: 1.101 ± 0.804
2.203AlaAsn: 2.203 ± 1.051
0.826AlaPro: 0.826 ± 0.496
0.826AlaGln: 0.826 ± 0.462
1.927AlaArg: 1.927 ± 0.84
5.782AlaSer: 5.782 ± 1.993
4.13AlaThr: 4.13 ± 1.266
2.478AlaVal: 2.478 ± 0.719
0.826AlaTrp: 0.826 ± 0.42
1.377AlaTyr: 1.377 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.554
0.551CysCys: 0.551 ± 0.183
1.377CysAsp: 1.377 ± 0.576
1.927CysGlu: 1.927 ± 1.537
2.203CysPhe: 2.203 ± 1.422
1.652CysGly: 1.652 ± 0.424
0.551CysHis: 0.551 ± 0.183
1.652CysIle: 1.652 ± 0.549
1.927CysLys: 1.927 ± 1.149
3.579CysLeu: 3.579 ± 0.668
0.826CysMet: 0.826 ± 0.437
1.101CysAsn: 1.101 ± 0.366
0.826CysPro: 0.826 ± 0.83
1.377CysGln: 1.377 ± 0.986
0.826CysArg: 0.826 ± 0.437
3.029CysSer: 3.029 ± 1.137
0.826CysThr: 0.826 ± 0.83
0.826CysVal: 0.826 ± 0.439
0.275CysTrp: 0.275 ± 0.277
0.826CysTyr: 0.826 ± 0.83
0.0CysXaa: 0.0 ± 0.0
Asp
3.304AspAla: 3.304 ± 0.686
2.203AspCys: 2.203 ± 2.215
2.753AspAsp: 2.753 ± 0.374
2.203AspGlu: 2.203 ± 1.029
3.304AspPhe: 3.304 ± 0.546
2.478AspGly: 2.478 ± 0.812
1.377AspHis: 1.377 ± 0.358
3.579AspIle: 3.579 ± 0.233
4.956AspLys: 4.956 ± 1.326
6.608AspLeu: 6.608 ± 1.517
1.101AspMet: 1.101 ± 0.85
2.203AspAsn: 2.203 ± 1.029
3.029AspPro: 3.029 ± 0.713
1.927AspGln: 1.927 ± 0.548
3.029AspArg: 3.029 ± 0.817
4.405AspSer: 4.405 ± 1.146
1.377AspThr: 1.377 ± 0.482
4.13AspVal: 4.13 ± 0.989
1.927AspTrp: 1.927 ± 1.79
2.478AspTyr: 2.478 ± 1.318
0.0AspXaa: 0.0 ± 0.0
Glu
3.304GluAla: 3.304 ± 0.63
2.203GluCys: 2.203 ± 1.051
4.681GluAsp: 4.681 ± 0.851
4.681GluGlu: 4.681 ± 1.114
4.681GluPhe: 4.681 ± 1.422
3.855GluGly: 3.855 ± 0.938
1.377GluHis: 1.377 ± 0.31
4.956GluIle: 4.956 ± 0.927
3.855GluLys: 3.855 ± 0.588
4.956GluLeu: 4.956 ± 1.381
0.826GluMet: 0.826 ± 0.764
1.101GluAsn: 1.101 ± 0.476
0.275GluPro: 0.275 ± 0.165
2.203GluGln: 2.203 ± 0.732
3.579GluArg: 3.579 ± 1.775
4.956GluSer: 4.956 ± 1.273
4.405GluThr: 4.405 ± 0.842
4.956GluVal: 4.956 ± 1.242
1.927GluTrp: 1.927 ± 0.548
1.377GluTyr: 1.377 ± 0.482
0.0GluXaa: 0.0 ± 0.0
Phe
2.478PheAla: 2.478 ± 1.167
2.203PheCys: 2.203 ± 0.732
3.029PheAsp: 3.029 ± 0.787
1.927PheGlu: 1.927 ± 0.469
2.753PhePhe: 2.753 ± 1.287
3.029PheGly: 3.029 ± 0.817
2.478PheHis: 2.478 ± 0.419
2.203PheIle: 2.203 ± 0.332
4.681PheLys: 4.681 ± 1.305
5.507PheLeu: 5.507 ± 2.142
0.551PheMet: 0.551 ± 0.331
3.304PheAsn: 3.304 ± 0.888
1.377PhePro: 1.377 ± 0.827
1.927PheGln: 1.927 ± 0.294
3.029PheArg: 3.029 ± 0.218
4.13PheSer: 4.13 ± 0.987
3.579PheThr: 3.579 ± 0.965
1.927PheVal: 1.927 ± 0.828
0.826PheTrp: 0.826 ± 0.496
1.652PheTyr: 1.652 ± 0.549
0.0PheXaa: 0.0 ± 0.0
Gly
2.478GlyAla: 2.478 ± 0.334
1.101GlyCys: 1.101 ± 0.341
2.753GlyAsp: 2.753 ± 0.72
3.579GlyGlu: 3.579 ± 1.334
3.304GlyPhe: 3.304 ± 1.245
1.927GlyGly: 1.927 ± 0.792
0.826GlyHis: 0.826 ± 0.212
3.855GlyIle: 3.855 ± 0.617
3.855GlyLys: 3.855 ± 0.586
4.956GlyLeu: 4.956 ± 1.419
1.377GlyMet: 1.377 ± 0.402
1.652GlyAsn: 1.652 ± 0.343
2.478GlyPro: 2.478 ± 0.871
1.927GlyGln: 1.927 ± 1.148
1.377GlyArg: 1.377 ± 0.36
4.405GlySer: 4.405 ± 0.842
2.753GlyThr: 2.753 ± 0.915
4.13GlyVal: 4.13 ± 0.731
1.652GlyTrp: 1.652 ± 1.148
1.377GlyTyr: 1.377 ± 0.634
0.0GlyXaa: 0.0 ± 0.0
His
0.826HisAla: 0.826 ± 0.439
1.377HisCys: 1.377 ± 0.613
1.101HisAsp: 1.101 ± 0.711
0.826HisGlu: 0.826 ± 0.42
1.377HisPhe: 1.377 ± 0.482
2.203HisGly: 2.203 ± 1.029
1.377HisHis: 1.377 ± 0.576
1.652HisIle: 1.652 ± 0.424
1.101HisLys: 1.101 ± 0.333
3.579HisLeu: 3.579 ± 1.404
0.826HisMet: 0.826 ± 1.094
0.551HisAsn: 0.551 ± 0.183
2.203HisPro: 2.203 ± 0.96
0.551HisGln: 0.551 ± 0.582
1.377HisArg: 1.377 ± 0.827
2.203HisSer: 2.203 ± 0.428
0.551HisThr: 0.551 ± 0.331
1.927HisVal: 1.927 ± 0.534
0.826HisTrp: 0.826 ± 0.496
0.826HisTyr: 0.826 ± 0.42
0.0HisXaa: 0.0 ± 0.0
Ile
2.478IleAla: 2.478 ± 0.783
2.753IleCys: 2.753 ± 1.227
4.405IleAsp: 4.405 ± 0.852
3.304IleGlu: 3.304 ± 0.686
0.551IlePhe: 0.551 ± 0.331
3.855IleGly: 3.855 ± 1.167
2.478IleHis: 2.478 ± 0.637
6.608IleIle: 6.608 ± 2.035
5.507IleLys: 5.507 ± 0.688
4.405IleLeu: 4.405 ± 0.674
1.652IleMet: 1.652 ± 0.676
3.029IleAsn: 3.029 ± 0.218
3.579IlePro: 3.579 ± 0.69
2.203IleGln: 2.203 ± 0.667
3.855IleArg: 3.855 ± 0.646
7.434IleSer: 7.434 ± 0.659
2.203IleThr: 2.203 ± 0.603
4.13IleVal: 4.13 ± 1.97
1.377IleTrp: 1.377 ± 0.639
2.203IleTyr: 2.203 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
5.507LysAla: 5.507 ± 2.362
1.377LysCys: 1.377 ± 0.986
3.579LysAsp: 3.579 ± 1.151
2.753LysGlu: 2.753 ± 1.453
3.855LysPhe: 3.855 ± 1.597
2.203LysGly: 2.203 ± 0.732
1.377LysHis: 1.377 ± 0.827
4.956LysIle: 4.956 ± 0.754
6.333LysLys: 6.333 ± 0.935
7.709LysLeu: 7.709 ± 0.866
2.753LysMet: 2.753 ± 0.699
1.927LysAsn: 1.927 ± 0.534
2.203LysPro: 2.203 ± 0.715
2.478LysGln: 2.478 ± 0.719
5.782LysArg: 5.782 ± 2.494
4.956LysSer: 4.956 ± 1.239
4.681LysThr: 4.681 ± 0.391
4.681LysVal: 4.681 ± 1.091
1.377LysTrp: 1.377 ± 0.634
2.203LysTyr: 2.203 ± 0.603
0.0LysXaa: 0.0 ± 0.0
Leu
5.231LeuAla: 5.231 ± 1.046
2.478LeuCys: 2.478 ± 0.419
7.434LeuAsp: 7.434 ± 1.493
6.608LeuGlu: 6.608 ± 1.616
5.507LeuPhe: 5.507 ± 1.624
6.333LeuGly: 6.333 ± 1.47
2.753LeuHis: 2.753 ± 0.345
5.782LeuIle: 5.782 ± 0.554
7.434LeuLys: 7.434 ± 1.385
9.637LeuLeu: 9.637 ± 1.083
3.029LeuMet: 3.029 ± 1.211
4.405LeuAsn: 4.405 ± 0.724
3.304LeuPro: 3.304 ± 0.51
3.579LeuGln: 3.579 ± 1.352
4.681LeuArg: 4.681 ± 1.009
7.434LeuSer: 7.434 ± 1.729
8.26LeuThr: 8.26 ± 2.699
4.13LeuVal: 4.13 ± 1.15
1.101LeuTrp: 1.101 ± 0.91
1.377LeuTyr: 1.377 ± 0.639
0.0LeuXaa: 0.0 ± 0.0
Met
1.101MetAla: 1.101 ± 1.163
0.0MetCys: 0.0 ± 0.0
1.377MetAsp: 1.377 ± 0.82
1.377MetGlu: 1.377 ± 0.451
1.927MetPhe: 1.927 ± 0.798
1.101MetGly: 1.101 ± 0.341
1.101MetHis: 1.101 ± 0.705
1.927MetIle: 1.927 ± 0.829
1.927MetLys: 1.927 ± 1.105
1.652MetLeu: 1.652 ± 0.839
0.826MetMet: 0.826 ± 0.487
0.275MetAsn: 0.275 ± 0.277
0.826MetPro: 0.826 ± 0.745
1.101MetGln: 1.101 ± 0.476
1.377MetArg: 1.377 ± 0.613
2.478MetSer: 2.478 ± 1.087
1.652MetThr: 1.652 ± 0.549
1.101MetVal: 1.101 ± 0.333
0.0MetTrp: 0.0 ± 0.0
0.275MetTyr: 0.275 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
1.101AsnAla: 1.101 ± 0.988
1.652AsnCys: 1.652 ± 0.549
1.101AsnAsp: 1.101 ± 0.661
2.203AsnGlu: 2.203 ± 0.682
2.753AsnPhe: 2.753 ± 0.72
0.826AsnGly: 0.826 ± 0.437
1.101AsnHis: 1.101 ± 0.333
1.377AsnIle: 1.377 ± 0.36
3.029AsnLys: 3.029 ± 1.351
3.855AsnLeu: 3.855 ± 1.293
1.927AsnMet: 1.927 ± 0.543
1.377AsnAsn: 1.377 ± 0.613
1.927AsnPro: 1.927 ± 0.831
1.652AsnGln: 1.652 ± 0.345
1.927AsnArg: 1.927 ± 0.798
4.13AsnSer: 4.13 ± 1.516
1.652AsnThr: 1.652 ± 1.454
2.753AsnVal: 2.753 ± 0.72
1.377AsnTrp: 1.377 ± 0.482
0.275AsnTyr: 0.275 ± 0.165
0.0AsnXaa: 0.0 ± 0.0
Pro
2.753ProAla: 2.753 ± 1.726
0.275ProCys: 0.275 ± 0.165
1.927ProAsp: 1.927 ± 0.512
4.681ProGlu: 4.681 ± 1.274
3.579ProPhe: 3.579 ± 0.536
2.478ProGly: 2.478 ± 1.002
0.826ProHis: 0.826 ± 0.212
1.652ProIle: 1.652 ± 0.992
2.203ProLys: 2.203 ± 0.603
3.304ProLeu: 3.304 ± 0.153
0.275ProMet: 0.275 ± 0.165
1.101ProAsn: 1.101 ± 0.91
1.652ProPro: 1.652 ± 0.748
0.826ProGln: 0.826 ± 0.42
1.652ProArg: 1.652 ± 0.424
3.855ProSer: 3.855 ± 0.994
1.652ProThr: 1.652 ± 0.424
1.377ProVal: 1.377 ± 1.184
0.551ProTrp: 0.551 ± 0.331
0.826ProTyr: 0.826 ± 0.437
0.0ProXaa: 0.0 ± 0.0
Gln
0.826GlnAla: 0.826 ± 0.439
1.652GlnCys: 1.652 ± 0.676
1.377GlnAsp: 1.377 ± 0.576
3.579GlnGlu: 3.579 ± 0.536
1.101GlnPhe: 1.101 ± 0.476
1.927GlnGly: 1.927 ± 0.829
1.377GlnHis: 1.377 ± 0.96
2.478GlnIle: 2.478 ± 0.347
3.579GlnLys: 3.579 ± 1.092
1.927GlnLeu: 1.927 ± 0.512
0.275GlnMet: 0.275 ± 0.165
0.826GlnAsn: 0.826 ± 0.42
1.377GlnPro: 1.377 ± 0.986
0.826GlnGln: 0.826 ± 0.487
1.101GlnArg: 1.101 ± 0.711
3.304GlnSer: 3.304 ± 0.849
1.377GlnThr: 1.377 ± 0.358
3.029GlnVal: 3.029 ± 1.267
0.275GlnTrp: 0.275 ± 0.165
0.826GlnTyr: 0.826 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
1.927ArgAla: 1.927 ± 0.534
1.652ArgCys: 1.652 ± 0.676
4.405ArgAsp: 4.405 ± 0.598
2.753ArgGlu: 2.753 ± 0.661
2.753ArgPhe: 2.753 ± 0.355
3.579ArgGly: 3.579 ± 1.271
1.101ArgHis: 1.101 ± 1.107
4.405ArgIle: 4.405 ± 1.206
2.478ArgLys: 2.478 ± 1.055
4.405ArgLeu: 4.405 ± 0.956
1.377ArgMet: 1.377 ± 0.458
2.478ArgAsn: 2.478 ± 1.318
1.652ArgPro: 1.652 ± 0.701
2.203ArgGln: 2.203 ± 0.96
2.203ArgArg: 2.203 ± 0.603
4.681ArgSer: 4.681 ± 1.259
2.203ArgThr: 2.203 ± 0.319
2.753ArgVal: 2.753 ± 0.335
0.826ArgTrp: 0.826 ± 0.212
1.652ArgTyr: 1.652 ± 0.839
0.0ArgXaa: 0.0 ± 0.0
Ser
4.956SerAla: 4.956 ± 1.466
1.377SerCys: 1.377 ± 1.384
4.13SerAsp: 4.13 ± 0.13
7.709SerGlu: 7.709 ± 1.909
2.478SerPhe: 2.478 ± 0.334
3.579SerGly: 3.579 ± 0.536
2.753SerHis: 2.753 ± 0.915
5.782SerIle: 5.782 ± 0.523
7.709SerLys: 7.709 ± 1.885
11.013SerLeu: 11.013 ± 1.336
2.203SerMet: 2.203 ± 0.332
3.029SerAsn: 3.029 ± 0.534
3.855SerPro: 3.855 ± 0.723
2.203SerGln: 2.203 ± 0.319
5.231SerArg: 5.231 ± 0.733
6.608SerSer: 6.608 ± 1.281
4.681SerThr: 4.681 ± 0.778
3.855SerVal: 3.855 ± 0.582
1.377SerTrp: 1.377 ± 0.451
1.377SerTyr: 1.377 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
3.304ThrAla: 3.304 ± 0.57
1.377ThrCys: 1.377 ± 0.792
3.855ThrAsp: 3.855 ± 1.565
3.029ThrGlu: 3.029 ± 0.218
3.855ThrPhe: 3.855 ± 0.588
4.405ThrGly: 4.405 ± 0.956
1.652ThrHis: 1.652 ± 0.361
3.304ThrIle: 3.304 ± 1.432
2.478ThrLys: 2.478 ± 0.334
6.333ThrLeu: 6.333 ± 2.475
1.101ThrMet: 1.101 ± 0.711
2.478ThrAsn: 2.478 ± 0.637
1.652ThrPro: 1.652 ± 0.361
1.101ThrGln: 1.101 ± 0.333
2.478ThrArg: 2.478 ± 1.259
3.855ThrSer: 3.855 ± 1.565
2.753ThrThr: 2.753 ± 0.903
4.681ThrVal: 4.681 ± 3.238
0.826ThrTrp: 0.826 ± 0.212
1.101ThrTyr: 1.101 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
2.753ValAla: 2.753 ± 0.355
0.826ValCys: 0.826 ± 0.439
4.405ValAsp: 4.405 ± 1.042
4.405ValGlu: 4.405 ± 1.348
3.029ValPhe: 3.029 ± 0.534
1.101ValGly: 1.101 ± 0.711
1.101ValHis: 1.101 ± 0.586
3.855ValIle: 3.855 ± 0.895
3.579ValLys: 3.579 ± 0.74
6.333ValLeu: 6.333 ± 0.536
0.275ValMet: 0.275 ± 0.277
2.478ValAsn: 2.478 ± 1.033
2.478ValPro: 2.478 ± 0.37
2.753ValGln: 2.753 ± 0.849
2.753ValArg: 2.753 ± 0.567
5.782ValSer: 5.782 ± 1.304
3.304ValThr: 3.304 ± 0.686
4.13ValVal: 4.13 ± 0.608
1.377ValTrp: 1.377 ± 0.613
1.377ValTyr: 1.377 ± 0.613
0.0ValXaa: 0.0 ± 0.0
Trp
1.377TrpAla: 1.377 ± 0.613
0.0TrpCys: 0.0 ± 0.0
0.826TrpAsp: 0.826 ± 0.745
1.377TrpGlu: 1.377 ± 1.112
1.101TrpPhe: 1.101 ± 0.661
1.377TrpGly: 1.377 ± 0.639
0.0TrpHis: 0.0 ± 0.0
2.753TrpIle: 2.753 ± 0.335
0.826TrpLys: 0.826 ± 0.437
2.478TrpLeu: 2.478 ± 0.419
0.826TrpMet: 0.826 ± 0.439
0.551TrpAsn: 0.551 ± 0.425
0.551TrpPro: 0.551 ± 0.979
0.275TrpGln: 0.275 ± 0.165
1.652TrpArg: 1.652 ± 0.345
1.101TrpSer: 1.101 ± 0.366
1.101TrpThr: 1.101 ± 0.333
0.551TrpVal: 0.551 ± 0.487
0.0TrpTrp: 0.0 ± 0.0
0.551TrpTyr: 0.551 ± 0.331
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.275TyrCys: 0.275 ± 0.277
1.377TyrAsp: 1.377 ± 0.613
1.652TyrGlu: 1.652 ± 0.992
1.101TyrPhe: 1.101 ± 0.366
0.551TyrGly: 0.551 ± 0.487
1.101TyrHis: 1.101 ± 0.661
1.927TyrIle: 1.927 ± 0.798
1.101TyrLys: 1.101 ± 0.341
3.855TyrLeu: 3.855 ± 0.374
0.0TyrMet: 0.0 ± 0.0
1.927TyrAsn: 1.927 ± 0.469
1.377TyrPro: 1.377 ± 0.639
1.101TyrGln: 1.101 ± 0.945
1.652TyrArg: 1.652 ± 1.148
1.377TyrSer: 1.377 ± 0.458
2.478TyrThr: 2.478 ± 0.707
0.551TyrVal: 0.551 ± 0.331
0.551TyrTrp: 0.551 ± 0.183
0.551TyrTyr: 0.551 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3633 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski