Amino acid dipepetide frequency for Tofla virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.578AlaAla: 3.578 ± 1.301
1.022AlaCys: 1.022 ± 0.464
2.044AlaAsp: 2.044 ± 0.576
3.748AlaGlu: 3.748 ± 1.789
2.044AlaPhe: 2.044 ± 0.021
2.215AlaGly: 2.215 ± 1.2
0.511AlaHis: 0.511 ± 0.431
3.066AlaIle: 3.066 ± 0.959
2.896AlaLys: 2.896 ± 2.158
5.622AlaLeu: 5.622 ± 0.721
1.193AlaMet: 1.193 ± 0.432
2.726AlaAsn: 2.726 ± 0.4
1.022AlaPro: 1.022 ± 0.924
2.385AlaGln: 2.385 ± 0.471
2.726AlaArg: 2.726 ± 0.181
3.066AlaSer: 3.066 ± 0.967
4.089AlaThr: 4.089 ± 1.86
4.429AlaVal: 4.429 ± 1.997
1.363AlaTrp: 1.363 ± 1.308
1.704AlaTyr: 1.704 ± 1.31
0.0AlaXaa: 0.0 ± 0.0
Cys
1.874CysAla: 1.874 ± 1.18
1.363CysCys: 1.363 ± 0.379
1.363CysAsp: 1.363 ± 0.421
1.704CysGlu: 1.704 ± 0.462
1.533CysPhe: 1.533 ± 0.432
0.852CysGly: 0.852 ± 0.511
0.511CysHis: 0.511 ± 0.144
2.385CysIle: 2.385 ± 0.705
1.874CysLys: 1.874 ± 0.971
3.237CysLeu: 3.237 ± 0.984
0.17CysMet: 0.17 ± 0.213
0.852CysAsn: 0.852 ± 0.511
1.874CysPro: 1.874 ± 1.23
0.681CysGln: 0.681 ± 0.383
1.533CysArg: 1.533 ± 0.82
2.385CysSer: 2.385 ± 0.865
2.215CysThr: 2.215 ± 1.649
1.874CysVal: 1.874 ± 0.116
0.511CysTrp: 0.511 ± 0.36
0.341CysTyr: 0.341 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
2.044AspAla: 2.044 ± 2.263
2.385AspCys: 2.385 ± 0.711
2.555AspAsp: 2.555 ± 0.788
3.407AspGlu: 3.407 ± 0.511
1.193AspPhe: 1.193 ± 0.276
3.918AspGly: 3.918 ± 1.306
1.193AspHis: 1.193 ± 0.438
4.6AspIle: 4.6 ± 0.151
3.918AspLys: 3.918 ± 1.034
5.792AspLeu: 5.792 ± 1.764
1.022AspMet: 1.022 ± 0.312
2.555AspAsn: 2.555 ± 0.54
2.215AspPro: 2.215 ± 0.358
1.022AspGln: 1.022 ± 0.288
3.407AspArg: 3.407 ± 0.947
4.259AspSer: 4.259 ± 0.337
2.215AspThr: 2.215 ± 0.232
3.407AspVal: 3.407 ± 0.511
0.681AspTrp: 0.681 ± 0.968
2.555AspTyr: 2.555 ± 0.72
0.0AspXaa: 0.0 ± 0.0
Glu
4.429GluAla: 4.429 ± 0.773
2.726GluCys: 2.726 ± 0.323
3.918GluAsp: 3.918 ± 1.715
6.644GluGlu: 6.644 ± 0.722
2.215GluPhe: 2.215 ± 0.716
2.896GluGly: 2.896 ± 0.291
1.193GluHis: 1.193 ± 0.849
3.237GluIle: 3.237 ± 0.799
4.77GluLys: 4.77 ± 0.848
8.348GluLeu: 8.348 ± 1.149
1.874GluMet: 1.874 ± 0.773
3.237GluAsn: 3.237 ± 0.27
1.533GluPro: 1.533 ± 0.622
1.874GluGln: 1.874 ± 0.971
3.748GluArg: 3.748 ± 0.513
4.94GluSer: 4.94 ± 0.344
5.281GluThr: 5.281 ± 1.174
6.474GluVal: 6.474 ± 0.636
0.511GluTrp: 0.511 ± 0.462
1.363GluTyr: 1.363 ± 0.553
0.0GluXaa: 0.0 ± 0.0
Phe
1.874PheAla: 1.874 ± 1.533
1.193PheCys: 1.193 ± 0.99
0.852PheAsp: 0.852 ± 0.264
3.237PheGlu: 3.237 ± 0.841
2.215PhePhe: 2.215 ± 1.125
1.874PheGly: 1.874 ± 0.116
0.681PheHis: 0.681 ± 0.548
1.704PheIle: 1.704 ± 0.527
1.704PheLys: 1.704 ± 0.657
5.281PheLeu: 5.281 ± 0.403
0.852PheMet: 0.852 ± 0.264
1.704PheAsn: 1.704 ± 0.462
0.852PhePro: 0.852 ± 0.423
1.363PheGln: 1.363 ± 0.751
1.363PheArg: 1.363 ± 0.296
3.748PheSer: 3.748 ± 1.201
3.407PheThr: 3.407 ± 0.924
1.363PheVal: 1.363 ± 0.421
0.17PheTrp: 0.17 ± 0.096
1.704PheTyr: 1.704 ± 0.958
0.0PheXaa: 0.0 ± 0.0
Gly
2.215GlyAla: 2.215 ± 1.355
1.704GlyCys: 1.704 ± 0.773
3.066GlyAsp: 3.066 ± 1.634
3.407GlyGlu: 3.407 ± 0.427
0.852GlyPhe: 0.852 ± 0.283
3.237GlyGly: 3.237 ± 0.443
1.022GlyHis: 1.022 ± 0.451
2.896GlyIle: 2.896 ± 0.825
5.451GlyLys: 5.451 ± 1.649
6.985GlyLeu: 6.985 ± 0.803
1.193GlyMet: 1.193 ± 0.276
1.874GlyAsn: 1.874 ± 0.61
2.726GlyPro: 2.726 ± 1.074
1.193GlyGln: 1.193 ± 0.278
3.066GlyArg: 3.066 ± 0.864
5.281GlySer: 5.281 ± 0.965
3.066GlyThr: 3.066 ± 0.382
3.237GlyVal: 3.237 ± 1.072
0.511GlyTrp: 0.511 ± 0.287
1.533GlyTyr: 1.533 ± 1.259
0.0GlyXaa: 0.0 ± 0.0
His
1.193HisAla: 1.193 ± 0.495
1.022HisCys: 1.022 ± 0.288
0.511HisAsp: 0.511 ± 0.36
0.681HisGlu: 0.681 ± 0.383
0.681HisPhe: 0.681 ± 0.189
1.704HisGly: 1.704 ± 0.779
0.17HisHis: 0.17 ± 0.096
1.193HisIle: 1.193 ± 0.278
1.533HisLys: 1.533 ± 1.291
2.896HisLeu: 2.896 ± 0.459
0.341HisMet: 0.341 ± 0.192
0.341HisAsn: 0.341 ± 0.192
1.193HisPro: 1.193 ± 0.276
1.022HisGln: 1.022 ± 0.288
1.193HisArg: 1.193 ± 0.323
2.555HisSer: 2.555 ± 0.697
1.022HisThr: 1.022 ± 0.845
1.704HisVal: 1.704 ± 0.779
0.341HisTrp: 0.341 ± 0.192
0.511HisTyr: 0.511 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
2.044IleAla: 2.044 ± 0.576
1.363IleCys: 1.363 ± 0.379
2.896IleAsp: 2.896 ± 0.415
3.237IleGlu: 3.237 ± 0.27
1.874IlePhe: 1.874 ± 0.61
1.533IleGly: 1.533 ± 0.343
1.193IleHis: 1.193 ± 0.323
2.726IleIle: 2.726 ± 0.4
4.77IleLys: 4.77 ± 1.041
5.451IleLeu: 5.451 ± 0.903
1.874IleMet: 1.874 ± 0.573
2.555IleAsn: 2.555 ± 0.267
1.533IlePro: 1.533 ± 0.432
2.896IleGln: 2.896 ± 0.473
2.726IleArg: 2.726 ± 0.917
4.259IleSer: 4.259 ± 0.768
4.259IleThr: 4.259 ± 0.71
4.259IleVal: 4.259 ± 1.647
0.681IleTrp: 0.681 ± 0.548
1.704IleTyr: 1.704 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
4.94LysAla: 4.94 ± 3.138
1.363LysCys: 1.363 ± 0.618
5.281LysAsp: 5.281 ± 1.174
5.792LysGlu: 5.792 ± 1.267
3.066LysPhe: 3.066 ± 0.967
4.77LysGly: 4.77 ± 1.998
1.874LysHis: 1.874 ± 0.116
3.578LysIle: 3.578 ± 0.433
5.281LysLys: 5.281 ± 0.895
8.859LysLeu: 8.859 ± 1.307
2.385LysMet: 2.385 ± 1.041
2.726LysAsn: 2.726 ± 0.843
1.874LysPro: 1.874 ± 0.738
3.748LysGln: 3.748 ± 1.271
4.089LysArg: 4.089 ± 0.333
3.578LysSer: 3.578 ± 0.596
2.896LysThr: 2.896 ± 0.896
4.77LysVal: 4.77 ± 1.089
0.681LysTrp: 0.681 ± 0.389
1.704LysTyr: 1.704 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
5.281LeuAla: 5.281 ± 1.932
2.896LeuCys: 2.896 ± 0.784
5.451LeuAsp: 5.451 ± 0.99
8.688LeuGlu: 8.688 ± 0.749
4.6LeuPhe: 4.6 ± 0.489
4.94LeuGly: 4.94 ± 0.438
3.578LeuHis: 3.578 ± 0.433
5.111LeuIle: 5.111 ± 0.801
9.37LeuLys: 9.37 ± 1.415
13.799LeuLeu: 13.799 ± 2.896
2.044LeuMet: 2.044 ± 0.714
6.303LeuAsn: 6.303 ± 0.914
3.918LeuPro: 3.918 ± 1.134
3.237LeuGln: 3.237 ± 1.133
5.451LeuArg: 5.451 ± 1.04
10.733LeuSer: 10.733 ± 2.275
7.155LeuThr: 7.155 ± 1.06
7.496LeuVal: 7.496 ± 0.442
0.511LeuTrp: 0.511 ± 0.287
3.237LeuTyr: 3.237 ± 0.886
0.0LeuXaa: 0.0 ± 0.0
Met
0.852MetAla: 0.852 ± 0.264
0.17MetCys: 0.17 ± 0.096
0.852MetAsp: 0.852 ± 0.329
1.193MetGlu: 1.193 ± 0.849
0.681MetPhe: 0.681 ± 0.383
1.533MetGly: 1.533 ± 0.449
1.022MetHis: 1.022 ± 0.924
1.193MetIle: 1.193 ± 0.843
1.363MetLys: 1.363 ± 0.379
3.918MetLeu: 3.918 ± 0.791
0.852MetMet: 0.852 ± 0.264
0.681MetAsn: 0.681 ± 0.189
0.17MetPro: 0.17 ± 0.096
1.193MetGln: 1.193 ± 0.278
1.363MetArg: 1.363 ± 0.618
3.066MetSer: 3.066 ± 0.477
0.681MetThr: 0.681 ± 0.309
0.681MetVal: 0.681 ± 0.389
0.17MetTrp: 0.17 ± 0.096
0.17MetTyr: 0.17 ± 0.096
0.0MetXaa: 0.0 ± 0.0
Asn
1.874AsnAla: 1.874 ± 1.237
1.874AsnCys: 1.874 ± 0.738
1.193AsnAsp: 1.193 ± 0.278
0.681AsnGlu: 0.681 ± 0.383
1.193AsnPhe: 1.193 ± 0.438
1.533AsnGly: 1.533 ± 0.449
0.511AsnHis: 0.511 ± 0.287
3.237AsnIle: 3.237 ± 1.369
3.066AsnLys: 3.066 ± 1.439
4.77AsnLeu: 4.77 ± 0.343
0.852AsnMet: 0.852 ± 0.283
1.533AsnAsn: 1.533 ± 0.622
2.896AsnPro: 2.896 ± 1.585
1.193AsnGln: 1.193 ± 0.438
2.896AsnArg: 2.896 ± 1.009
4.429AsnSer: 4.429 ± 1.208
2.555AsnThr: 2.555 ± 0.946
3.748AsnVal: 3.748 ± 1.221
1.022AsnTrp: 1.022 ± 0.288
1.193AsnTyr: 1.193 ± 0.794
0.0AsnXaa: 0.0 ± 0.0
Pro
2.385ProAla: 2.385 ± 0.645
0.852ProCys: 0.852 ± 1.002
3.237ProAsp: 3.237 ± 0.799
3.237ProGlu: 3.237 ± 0.889
1.533ProPhe: 1.533 ± 0.308
2.385ProGly: 2.385 ± 1.024
0.341ProHis: 0.341 ± 0.155
2.044ProIle: 2.044 ± 0.697
3.066ProLys: 3.066 ± 0.616
2.555ProLeu: 2.555 ± 0.66
0.341ProMet: 0.341 ± 0.192
1.022ProAsn: 1.022 ± 0.464
0.852ProPro: 0.852 ± 0.455
1.193ProGln: 1.193 ± 0.278
1.363ProArg: 1.363 ± 0.529
3.237ProSer: 3.237 ± 0.294
2.044ProThr: 2.044 ± 0.576
1.363ProVal: 1.363 ± 1.095
0.511ProTrp: 0.511 ± 0.462
0.681ProTyr: 0.681 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
2.726GlnAla: 2.726 ± 0.363
0.681GlnCys: 0.681 ± 0.309
0.852GlnAsp: 0.852 ± 0.329
2.215GlnGlu: 2.215 ± 0.604
1.193GlnPhe: 1.193 ± 0.849
2.215GlnGly: 2.215 ± 0.522
1.022GlnHis: 1.022 ± 0.288
1.874GlnIle: 1.874 ± 0.194
2.726GlnLys: 2.726 ± 0.937
3.578GlnLeu: 3.578 ± 1.106
1.022GlnMet: 1.022 ± 0.348
1.704GlnAsn: 1.704 ± 1.445
0.681GlnPro: 0.681 ± 0.309
2.044GlnGln: 2.044 ± 1.15
0.852GlnArg: 0.852 ± 0.264
3.578GlnSer: 3.578 ± 0.828
2.896GlnThr: 2.896 ± 1.009
2.555GlnVal: 2.555 ± 0.697
0.511GlnTrp: 0.511 ± 0.144
0.511GlnTyr: 0.511 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
1.363ArgAla: 1.363 ± 0.751
1.874ArgCys: 1.874 ± 0.509
3.918ArgAsp: 3.918 ± 1.715
2.896ArgGlu: 2.896 ± 1.45
2.215ArgPhe: 2.215 ± 0.637
2.215ArgGly: 2.215 ± 0.642
1.533ArgHis: 1.533 ± 0.449
1.704ArgIle: 1.704 ± 0.739
1.874ArgLys: 1.874 ± 0.661
6.985ArgLeu: 6.985 ± 1.638
2.044ArgMet: 2.044 ± 0.312
3.066ArgAsn: 3.066 ± 0.438
2.215ArgPro: 2.215 ± 1.125
2.726ArgGln: 2.726 ± 0.873
4.259ArgArg: 4.259 ± 0.534
4.429ArgSer: 4.429 ± 1.321
2.215ArgThr: 2.215 ± 0.703
3.407ArgVal: 3.407 ± 0.924
0.17ArgTrp: 0.17 ± 0.096
1.022ArgTyr: 1.022 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
4.259SerAla: 4.259 ± 0.625
2.726SerCys: 2.726 ± 1.74
5.622SerAsp: 5.622 ± 0.583
8.007SerGlu: 8.007 ± 0.714
3.237SerPhe: 3.237 ± 0.42
5.451SerGly: 5.451 ± 1.321
2.215SerHis: 2.215 ± 0.232
4.77SerIle: 4.77 ± 1.326
5.963SerLys: 5.963 ± 1.29
7.666SerLeu: 7.666 ± 1.34
0.852SerMet: 0.852 ± 0.455
3.066SerAsn: 3.066 ± 0.822
2.215SerPro: 2.215 ± 0.649
1.704SerGln: 1.704 ± 0.407
4.6SerArg: 4.6 ± 0.721
8.518SerSer: 8.518 ± 2.088
7.325SerThr: 7.325 ± 0.887
5.111SerVal: 5.111 ± 0.825
1.193SerTrp: 1.193 ± 0.664
2.385SerTyr: 2.385 ± 0.171
0.0SerXaa: 0.0 ± 0.0
Thr
3.407ThrAla: 3.407 ± 0.755
1.533ThrCys: 1.533 ± 1.353
4.089ThrAsp: 4.089 ± 0.559
4.94ThrGlu: 4.94 ± 0.237
2.726ThrPhe: 2.726 ± 0.566
5.111ThrGly: 5.111 ± 1.758
1.533ThrHis: 1.533 ± 0.714
2.726ThrIle: 2.726 ± 0.634
4.77ThrLys: 4.77 ± 0.343
6.474ThrLeu: 6.474 ± 1.755
0.852ThrMet: 0.852 ± 0.264
3.237ThrAsn: 3.237 ± 0.861
2.555ThrPro: 2.555 ± 1.007
1.704ThrGln: 1.704 ± 0.527
1.363ThrArg: 1.363 ± 0.529
4.259ThrSer: 4.259 ± 0.903
4.6ThrThr: 4.6 ± 1.405
4.94ThrVal: 4.94 ± 0.78
0.852ThrTrp: 0.852 ± 0.511
1.704ThrTyr: 1.704 ± 0.142
0.0ThrXaa: 0.0 ± 0.0
Val
3.578ValAla: 3.578 ± 1.879
1.193ValCys: 1.193 ± 0.323
4.77ValAsp: 4.77 ± 1.326
5.792ValGlu: 5.792 ± 1.568
2.044ValPhe: 2.044 ± 0.564
2.896ValGly: 2.896 ± 1.016
0.852ValHis: 0.852 ± 0.264
3.918ValIle: 3.918 ± 0.567
5.281ValLys: 5.281 ± 0.54
6.985ValLeu: 6.985 ± 1.67
1.022ValMet: 1.022 ± 0.288
1.533ValAsn: 1.533 ± 0.432
2.726ValPro: 2.726 ± 0.978
3.066ValGln: 3.066 ± 0.985
4.259ValArg: 4.259 ± 0.602
6.133ValSer: 6.133 ± 1.127
4.089ValThr: 4.089 ± 0.889
3.748ValVal: 3.748 ± 1.7
0.511ValTrp: 0.511 ± 0.462
1.193ValTyr: 1.193 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.511TrpCys: 0.511 ± 0.462
0.681TrpAsp: 0.681 ± 0.389
0.511TrpGlu: 0.511 ± 0.36
0.852TrpPhe: 0.852 ± 0.903
1.704TrpGly: 1.704 ± 0.739
0.0TrpHis: 0.0 ± 0.0
0.511TrpIle: 0.511 ± 0.462
1.363TrpLys: 1.363 ± 0.296
1.704TrpLeu: 1.704 ± 0.407
0.341TrpMet: 0.341 ± 0.155
0.17TrpAsn: 0.17 ± 0.213
0.511TrpPro: 0.511 ± 0.639
0.17TrpGln: 0.17 ± 0.096
0.511TrpArg: 0.511 ± 0.462
1.533TrpSer: 1.533 ± 0.582
0.17TrpThr: 0.17 ± 0.096
0.17TrpVal: 0.17 ± 0.096
0.17TrpTrp: 0.17 ± 0.096
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.533TyrAla: 1.533 ± 0.62
0.681TyrCys: 0.681 ± 0.571
1.363TyrAsp: 1.363 ± 0.296
1.193TyrGlu: 1.193 ± 0.671
1.193TyrPhe: 1.193 ± 0.432
1.533TyrGly: 1.533 ± 0.622
0.852TyrHis: 0.852 ± 0.264
1.363TyrIle: 1.363 ± 0.379
2.215TyrLys: 2.215 ± 0.642
2.896TyrLeu: 2.896 ± 0.459
0.511TyrMet: 0.511 ± 0.431
1.363TyrAsn: 1.363 ± 0.296
0.681TyrPro: 0.681 ± 0.416
1.022TyrGln: 1.022 ± 0.288
1.363TyrArg: 1.363 ± 0.2
2.896TyrSer: 2.896 ± 0.634
1.193TyrThr: 1.193 ± 0.432
0.852TyrVal: 0.852 ± 0.283
0.511TyrTrp: 0.511 ± 0.462
1.022TyrTyr: 1.022 ± 0.595
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski