Amino acid dipepetide frequency for Mukawa virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.284AlaAla: 4.284 ± 1.163
2.142AlaCys: 2.142 ± 1.114
2.677AlaAsp: 2.677 ± 1.028
2.945AlaGlu: 2.945 ± 0.867
2.677AlaPhe: 2.677 ± 0.731
3.748AlaGly: 3.748 ± 2.444
2.142AlaHis: 2.142 ± 1.251
4.819AlaIle: 4.819 ± 1.046
5.087AlaLys: 5.087 ± 0.671
5.622AlaLeu: 5.622 ± 0.449
2.142AlaMet: 2.142 ± 1.208
0.535AlaAsn: 0.535 ± 0.59
1.874AlaPro: 1.874 ± 0.478
1.606AlaGln: 1.606 ± 0.47
5.355AlaArg: 5.355 ± 1.476
6.158AlaSer: 6.158 ± 0.806
2.945AlaThr: 2.945 ± 1.165
2.41AlaVal: 2.41 ± 0.756
0.535AlaTrp: 0.535 ± 0.472
1.071AlaTyr: 1.071 ± 0.681
0.0AlaXaa: 0.0 ± 0.0
Cys
2.677CysAla: 2.677 ± 1.446
0.803CysCys: 0.803 ± 0.21
0.803CysAsp: 0.803 ± 0.387
2.677CysGlu: 2.677 ± 1.367
2.142CysPhe: 2.142 ± 1.114
1.606CysGly: 1.606 ± 0.99
0.803CysHis: 0.803 ± 0.466
1.071CysIle: 1.071 ± 0.994
2.677CysLys: 2.677 ± 0.291
1.339CysLeu: 1.339 ± 1.05
1.071CysMet: 1.071 ± 0.381
0.803CysAsn: 0.803 ± 0.688
1.339CysPro: 1.339 ± 1.475
1.071CysGln: 1.071 ± 0.381
1.071CysArg: 1.071 ± 0.534
4.016CysSer: 4.016 ± 1.988
2.142CysThr: 2.142 ± 0.762
2.142CysVal: 2.142 ± 0.423
0.803CysTrp: 0.803 ± 0.339
0.268CysTyr: 0.268 ± 0.295
0.0CysXaa: 0.0 ± 0.0
Asp
3.213AspAla: 3.213 ± 1.029
1.606AspCys: 1.606 ± 0.939
4.016AspAsp: 4.016 ± 0.695
3.213AspGlu: 3.213 ± 0.628
2.677AspPhe: 2.677 ± 0.995
5.355AspGly: 5.355 ± 1.087
1.071AspHis: 1.071 ± 1.084
5.087AspIle: 5.087 ± 1.923
2.945AspLys: 2.945 ± 1.548
5.622AspLeu: 5.622 ± 2.859
1.071AspMet: 1.071 ± 0.38
2.142AspAsn: 2.142 ± 0.927
3.481AspPro: 3.481 ± 0.612
2.677AspGln: 2.677 ± 0.364
2.677AspArg: 2.677 ± 0.725
3.748AspSer: 3.748 ± 1.055
1.874AspThr: 1.874 ± 0.34
2.677AspVal: 2.677 ± 0.365
0.803AspTrp: 0.803 ± 0.339
0.268AspTyr: 0.268 ± 0.295
0.0AspXaa: 0.0 ± 0.0
Glu
4.819GluAla: 4.819 ± 0.479
1.339GluCys: 1.339 ± 1.245
4.819GluAsp: 4.819 ± 1.503
4.284GluGlu: 4.284 ± 1.232
2.945GluPhe: 2.945 ± 0.849
3.213GluGly: 3.213 ± 1.114
1.071GluHis: 1.071 ± 0.331
2.945GluIle: 2.945 ± 0.856
2.142GluLys: 2.142 ± 0.7
5.89GluLeu: 5.89 ± 2.173
0.803GluMet: 0.803 ± 0.21
2.142GluAsn: 2.142 ± 1.006
2.677GluPro: 2.677 ± 0.518
2.41GluGln: 2.41 ± 0.835
2.945GluArg: 2.945 ± 1.026
6.158GluSer: 6.158 ± 0.756
3.748GluThr: 3.748 ± 0.971
3.481GluVal: 3.481 ± 0.671
1.071GluTrp: 1.071 ± 0.381
1.874GluTyr: 1.874 ± 0.811
0.0GluXaa: 0.0 ± 0.0
Phe
2.142PheAla: 2.142 ± 0.375
1.339PheCys: 1.339 ± 0.617
3.213PheAsp: 3.213 ± 0.764
1.874PheGlu: 1.874 ± 0.811
1.606PhePhe: 1.606 ± 0.932
2.41PheGly: 2.41 ± 1.248
1.606PheHis: 1.606 ± 0.933
1.339PheIle: 1.339 ± 0.764
2.677PheLys: 2.677 ± 1.414
4.284PheLeu: 4.284 ± 0.942
1.071PheMet: 1.071 ± 0.257
1.874PheAsn: 1.874 ± 0.578
2.142PhePro: 2.142 ± 1.086
2.142PheGln: 2.142 ± 0.978
2.677PheArg: 2.677 ± 0.999
3.481PheSer: 3.481 ± 0.532
2.142PheThr: 2.142 ± 0.347
2.945PheVal: 2.945 ± 0.856
0.803PheTrp: 0.803 ± 0.21
0.535PheTyr: 0.535 ± 0.59
0.0PheXaa: 0.0 ± 0.0
Gly
3.213GlyAla: 3.213 ± 0.628
2.945GlyCys: 2.945 ± 1.034
1.874GlyAsp: 1.874 ± 0.546
4.016GlyGlu: 4.016 ± 0.754
4.016GlyPhe: 4.016 ± 0.493
4.552GlyGly: 4.552 ± 1.437
0.803GlyHis: 0.803 ± 0.21
2.677GlyIle: 2.677 ± 0.854
5.355GlyLys: 5.355 ± 3.122
5.622GlyLeu: 5.622 ± 0.78
1.874GlyMet: 1.874 ± 0.506
1.606GlyAsn: 1.606 ± 0.47
2.41GlyPro: 2.41 ± 0.724
2.41GlyGln: 2.41 ± 0.473
2.945GlyArg: 2.945 ± 0.486
8.032GlySer: 8.032 ± 3.216
3.213GlyThr: 3.213 ± 1.866
5.087GlyVal: 5.087 ± 1.107
0.535GlyTrp: 0.535 ± 0.472
0.803GlyTyr: 0.803 ± 0.466
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.21
1.071HisCys: 1.071 ± 0.381
1.339HisAsp: 1.339 ± 0.275
1.339HisGlu: 1.339 ± 0.362
1.339HisPhe: 1.339 ± 0.978
1.339HisGly: 1.339 ± 0.622
0.803HisHis: 0.803 ± 0.466
1.339HisIle: 1.339 ± 0.622
0.268HisLys: 0.268 ± 0.588
3.213HisLeu: 3.213 ± 1.483
0.803HisMet: 0.803 ± 0.466
0.268HisAsn: 0.268 ± 0.171
0.803HisPro: 0.803 ± 0.507
0.803HisGln: 0.803 ± 0.21
0.803HisArg: 0.803 ± 0.21
1.606HisSer: 1.606 ± 0.386
1.606HisThr: 1.606 ± 0.419
2.41HisVal: 2.41 ± 0.653
0.0HisTrp: 0.0 ± 0.0
1.874HisTyr: 1.874 ± 0.528
0.0HisXaa: 0.0 ± 0.0
Ile
3.748IleAla: 3.748 ± 1.246
1.339IleCys: 1.339 ± 0.362
2.945IleAsp: 2.945 ± 1.044
3.213IleGlu: 3.213 ± 0.471
2.41IlePhe: 2.41 ± 0.474
2.945IleGly: 2.945 ± 0.856
0.803IleHis: 0.803 ± 0.466
4.284IleIle: 4.284 ± 1.957
3.748IleLys: 3.748 ± 1.464
4.552IleLeu: 4.552 ± 1.258
1.071IleMet: 1.071 ± 0.612
1.874IleAsn: 1.874 ± 0.897
3.213IlePro: 3.213 ± 1.13
1.874IleGln: 1.874 ± 0.478
5.355IleArg: 5.355 ± 0.73
4.552IleSer: 4.552 ± 1.005
2.945IleThr: 2.945 ± 0.676
2.41IleVal: 2.41 ± 0.36
0.803IleTrp: 0.803 ± 0.339
1.606IleTyr: 1.606 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
5.355LysAla: 5.355 ± 1.244
2.677LysCys: 2.677 ± 1.907
3.213LysAsp: 3.213 ± 0.601
2.142LysGlu: 2.142 ± 0.864
2.41LysPhe: 2.41 ± 0.474
2.945LysGly: 2.945 ± 1.24
1.606LysHis: 1.606 ± 0.419
2.142LysIle: 2.142 ± 0.98
3.213LysLys: 3.213 ± 1.114
5.355LysLeu: 5.355 ± 1.813
2.677LysMet: 2.677 ± 0.365
2.41LysAsn: 2.41 ± 0.64
4.016LysPro: 4.016 ± 1.266
1.339LysGln: 1.339 ± 0.649
2.945LysArg: 2.945 ± 1.153
4.552LysSer: 4.552 ± 1.441
2.677LysThr: 2.677 ± 0.54
5.89LysVal: 5.89 ± 1.153
1.339LysTrp: 1.339 ± 0.275
1.606LysTyr: 1.606 ± 1.108
0.0LysXaa: 0.0 ± 0.0
Leu
5.087LeuAla: 5.087 ± 1.437
2.142LeuCys: 2.142 ± 0.423
6.158LeuAsp: 6.158 ± 1.121
4.284LeuGlu: 4.284 ± 1.399
4.284LeuPhe: 4.284 ± 1.612
6.426LeuGly: 6.426 ± 0.428
1.606LeuHis: 1.606 ± 0.419
5.355LeuIle: 5.355 ± 2.381
5.89LeuLys: 5.89 ± 1.826
7.764LeuLeu: 7.764 ± 2.357
2.142LeuMet: 2.142 ± 0.605
3.481LeuAsn: 3.481 ± 1.352
4.016LeuPro: 4.016 ± 0.634
4.016LeuGln: 4.016 ± 1.191
5.89LeuArg: 5.89 ± 0.725
10.174LeuSer: 10.174 ± 2.286
5.89LeuThr: 5.89 ± 1.498
5.087LeuVal: 5.087 ± 1.071
0.535LeuTrp: 0.535 ± 0.191
2.677LeuTyr: 2.677 ± 1.028
0.0LeuXaa: 0.0 ± 0.0
Met
2.677MetAla: 2.677 ± 0.364
0.0MetCys: 0.0 ± 0.0
3.213MetAsp: 3.213 ± 1.272
1.339MetGlu: 1.339 ± 0.418
1.339MetPhe: 1.339 ± 0.275
2.945MetGly: 2.945 ± 0.177
1.071MetHis: 1.071 ± 0.612
1.606MetIle: 1.606 ± 0.386
0.535MetLys: 0.535 ± 0.345
3.213MetLeu: 3.213 ± 0.906
1.071MetMet: 1.071 ± 0.612
1.071MetAsn: 1.071 ± 0.534
0.803MetPro: 0.803 ± 0.387
1.071MetGln: 1.071 ± 0.381
1.874MetArg: 1.874 ± 1.051
2.41MetSer: 2.41 ± 0.756
1.071MetThr: 1.071 ± 0.381
1.071MetVal: 1.071 ± 0.381
0.268MetTrp: 0.268 ± 0.295
0.803MetTyr: 0.803 ± 0.807
0.0MetXaa: 0.0 ± 0.0
Asn
1.606AsnAla: 1.606 ± 1.344
1.071AsnCys: 1.071 ± 0.534
0.535AsnAsp: 0.535 ± 0.342
2.142AsnGlu: 2.142 ± 0.794
2.677AsnPhe: 2.677 ± 0.758
1.606AsnGly: 1.606 ± 0.939
1.339AsnHis: 1.339 ± 0.836
1.606AsnIle: 1.606 ± 0.679
1.606AsnLys: 1.606 ± 0.738
4.284AsnLeu: 4.284 ± 1.033
0.803AsnMet: 0.803 ± 0.715
0.535AsnAsn: 0.535 ± 0.345
1.606AsnPro: 1.606 ± 0.419
1.606AsnGln: 1.606 ± 0.572
2.677AsnArg: 2.677 ± 0.758
2.677AsnSer: 2.677 ± 1.417
1.874AsnThr: 1.874 ± 0.272
1.071AsnVal: 1.071 ± 0.432
0.803AsnTrp: 0.803 ± 0.807
0.535AsnTyr: 0.535 ± 0.545
0.0AsnXaa: 0.0 ± 0.0
Pro
2.677ProAla: 2.677 ± 0.825
0.268ProCys: 0.268 ± 0.171
2.142ProAsp: 2.142 ± 0.605
4.284ProGlu: 4.284 ± 1.753
1.071ProPhe: 1.071 ± 0.257
3.481ProGly: 3.481 ± 0.671
0.803ProHis: 0.803 ± 0.885
3.481ProIle: 3.481 ± 0.95
2.677ProLys: 2.677 ± 0.725
3.748ProLeu: 3.748 ± 1.231
1.071ProMet: 1.071 ± 0.638
0.803ProAsn: 0.803 ± 0.387
1.874ProPro: 1.874 ± 0.834
1.606ProGln: 1.606 ± 0.646
2.677ProArg: 2.677 ± 0.654
3.748ProSer: 3.748 ± 0.578
2.945ProThr: 2.945 ± 0.676
2.41ProVal: 2.41 ± 1.057
1.339ProTrp: 1.339 ± 0.708
1.339ProTyr: 1.339 ± 0.854
0.0ProXaa: 0.0 ± 0.0
Gln
2.677GlnAla: 2.677 ± 0.731
1.874GlnCys: 1.874 ± 1.222
3.481GlnAsp: 3.481 ± 1.477
1.874GlnGlu: 1.874 ± 0.519
1.339GlnPhe: 1.339 ± 0.617
1.606GlnGly: 1.606 ± 0.828
1.071GlnHis: 1.071 ± 0.683
2.142GlnIle: 2.142 ± 0.347
1.874GlnLys: 1.874 ± 0.927
3.213GlnLeu: 3.213 ± 0.321
0.0GlnMet: 0.0 ± 0.0
1.606GlnAsn: 1.606 ± 0.419
1.339GlnPro: 1.339 ± 0.418
0.268GlnGln: 0.268 ± 0.171
2.142GlnArg: 2.142 ± 0.674
1.874GlnSer: 1.874 ± 0.528
2.677GlnThr: 2.677 ± 0.836
2.677GlnVal: 2.677 ± 0.605
0.0GlnTrp: 0.0 ± 0.0
1.339GlnTyr: 1.339 ± 0.484
0.0GlnXaa: 0.0 ± 0.0
Arg
4.016ArgAla: 4.016 ± 0.281
1.606ArgCys: 1.606 ± 0.649
4.284ArgAsp: 4.284 ± 1.246
4.016ArgGlu: 4.016 ± 1.191
1.606ArgPhe: 1.606 ± 1.025
3.481ArgGly: 3.481 ± 1.292
0.803ArgHis: 0.803 ± 0.512
2.677ArgIle: 2.677 ± 1.414
4.016ArgLys: 4.016 ± 0.754
5.355ArgLeu: 5.355 ± 1.904
2.142ArgMet: 2.142 ± 0.963
1.606ArgAsn: 1.606 ± 0.294
2.142ArgPro: 2.142 ± 0.55
1.606ArgGln: 1.606 ± 0.386
2.677ArgArg: 2.677 ± 1.014
6.693ArgSer: 6.693 ± 1.632
2.142ArgThr: 2.142 ± 0.674
5.89ArgVal: 5.89 ± 1.209
1.874ArgTrp: 1.874 ± 0.54
1.874ArgTyr: 1.874 ± 0.54
0.0ArgXaa: 0.0 ± 0.0
Ser
4.819SerAla: 4.819 ± 1.406
4.284SerCys: 4.284 ± 2.279
4.819SerAsp: 4.819 ± 1.564
6.158SerGlu: 6.158 ± 0.898
3.213SerPhe: 3.213 ± 1.043
7.229SerGly: 7.229 ± 1.797
2.41SerHis: 2.41 ± 0.327
3.748SerIle: 3.748 ± 1.228
6.961SerLys: 6.961 ± 1.079
6.693SerLeu: 6.693 ± 1.305
3.748SerMet: 3.748 ± 1.726
4.284SerAsn: 4.284 ± 1.025
4.284SerPro: 4.284 ± 1.612
2.677SerGln: 2.677 ± 0.707
4.552SerArg: 4.552 ± 0.968
8.032SerSer: 8.032 ± 1.451
4.284SerThr: 4.284 ± 0.79
4.284SerVal: 4.284 ± 0.942
3.213SerTrp: 3.213 ± 0.772
2.142SerTyr: 2.142 ± 0.663
0.0SerXaa: 0.0 ± 0.0
Thr
2.142ThrAla: 2.142 ± 0.788
2.41ThrCys: 2.41 ± 1.057
3.213ThrAsp: 3.213 ± 1.143
3.748ThrGlu: 3.748 ± 1.295
0.535ThrPhe: 0.535 ± 0.191
3.748ThrGly: 3.748 ± 1.855
0.535ThrHis: 0.535 ± 0.342
2.41ThrIle: 2.41 ± 0.474
3.748ThrLys: 3.748 ± 0.471
6.158ThrLeu: 6.158 ± 0.847
1.339ThrMet: 1.339 ± 0.649
2.945ThrAsn: 2.945 ± 1.165
1.874ThrPro: 1.874 ± 0.528
2.142ThrGln: 2.142 ± 0.663
3.748ThrArg: 3.748 ± 1.246
4.284ThrSer: 4.284 ± 1.129
2.41ThrThr: 2.41 ± 0.381
4.552ThrVal: 4.552 ± 0.666
0.803ThrTrp: 0.803 ± 0.554
1.071ThrTyr: 1.071 ± 0.626
0.0ThrXaa: 0.0 ± 0.0
Val
2.945ValAla: 2.945 ± 0.847
1.874ValCys: 1.874 ± 0.927
2.41ValAsp: 2.41 ± 1.343
4.552ValGlu: 4.552 ± 1.014
1.874ValPhe: 1.874 ± 0.54
2.41ValGly: 2.41 ± 0.64
2.41ValHis: 2.41 ± 0.381
5.355ValIle: 5.355 ± 1.122
3.481ValLys: 3.481 ± 1.099
6.961ValLeu: 6.961 ± 1.609
2.945ValMet: 2.945 ± 0.176
1.339ValAsn: 1.339 ± 0.491
2.41ValPro: 2.41 ± 1.321
2.945ValGln: 2.945 ± 1.485
4.016ValArg: 4.016 ± 1.148
6.693ValSer: 6.693 ± 1.618
2.677ValThr: 2.677 ± 0.731
2.677ValVal: 2.677 ± 0.291
1.071ValTrp: 1.071 ± 0.489
1.339ValTyr: 1.339 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
0.803TrpAla: 0.803 ± 0.21
0.535TrpCys: 0.535 ± 0.191
0.535TrpAsp: 0.535 ± 0.765
1.071TrpGlu: 1.071 ± 0.257
1.339TrpPhe: 1.339 ± 0.484
1.339TrpGly: 1.339 ± 0.649
0.0TrpHis: 0.0 ± 0.0
0.803TrpIle: 0.803 ± 0.21
0.535TrpLys: 0.535 ± 0.621
2.142TrpLeu: 2.142 ± 0.347
0.535TrpMet: 0.535 ± 0.765
0.268TrpAsn: 0.268 ± 0.171
0.803TrpPro: 0.803 ± 0.708
0.268TrpGln: 0.268 ± 0.382
1.606TrpArg: 1.606 ± 0.679
0.803TrpSer: 0.803 ± 0.512
1.874TrpThr: 1.874 ± 0.771
1.071TrpVal: 1.071 ± 0.381
0.268TrpTrp: 0.268 ± 0.171
0.803TrpTyr: 0.803 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.071TyrAla: 1.071 ± 0.612
0.0TyrCys: 0.0 ± 0.0
0.535TyrAsp: 0.535 ± 0.191
1.606TyrGlu: 1.606 ± 0.379
0.803TyrPhe: 0.803 ± 0.387
1.606TyrGly: 1.606 ± 0.419
1.071TyrHis: 1.071 ± 0.612
0.803TyrIle: 0.803 ± 0.512
1.071TyrLys: 1.071 ± 0.681
1.874TyrLeu: 1.874 ± 0.34
0.803TyrMet: 0.803 ± 0.688
1.071TyrAsn: 1.071 ± 0.489
1.606TyrPro: 1.606 ± 0.939
0.535TyrGln: 0.535 ± 0.545
1.874TyrArg: 1.874 ± 0.34
2.142TyrSer: 2.142 ± 0.56
2.677TyrThr: 2.677 ± 0.968
2.142TyrVal: 2.142 ± 0.56
0.535TyrTrp: 0.535 ± 0.191
0.268TyrTyr: 0.268 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3736 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski