Amino acid dipepetide frequency for Bacillus virus Finn

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.03AlaAla: 2.03 ± 0.447
0.406AlaCys: 0.406 ± 0.163
3.451AlaAsp: 3.451 ± 0.559
3.789AlaGlu: 3.789 ± 0.492
3.654AlaPhe: 3.654 ± 0.486
4.128AlaGly: 4.128 ± 0.64
1.218AlaHis: 1.218 ± 0.322
4.94AlaIle: 4.94 ± 0.772
5.684AlaLys: 5.684 ± 0.801
6.022AlaLeu: 6.022 ± 0.877
1.827AlaMet: 1.827 ± 0.535
2.571AlaAsn: 2.571 ± 0.402
2.233AlaPro: 2.233 ± 0.411
1.692AlaGln: 1.692 ± 0.362
2.977AlaArg: 2.977 ± 0.515
3.18AlaSer: 3.18 ± 0.464
3.451AlaThr: 3.451 ± 0.578
4.466AlaVal: 4.466 ± 0.751
0.947AlaTrp: 0.947 ± 0.256
2.707AlaTyr: 2.707 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.609CysAla: 0.609 ± 0.229
0.068CysCys: 0.068 ± 0.066
0.474CysAsp: 0.474 ± 0.184
0.609CysGlu: 0.609 ± 0.183
0.744CysPhe: 0.744 ± 0.235
0.744CysGly: 0.744 ± 0.21
0.609CysHis: 0.609 ± 0.228
0.271CysIle: 0.271 ± 0.105
1.083CysLys: 1.083 ± 0.273
1.015CysLeu: 1.015 ± 0.28
0.135CysMet: 0.135 ± 0.085
0.338CysAsn: 0.338 ± 0.158
0.203CysPro: 0.203 ± 0.11
0.271CysGln: 0.271 ± 0.123
0.203CysArg: 0.203 ± 0.113
0.474CysSer: 0.474 ± 0.171
0.677CysThr: 0.677 ± 0.2
0.271CysVal: 0.271 ± 0.12
0.068CysTrp: 0.068 ± 0.064
0.338CysTyr: 0.338 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
3.789AspAla: 3.789 ± 0.526
0.474AspCys: 0.474 ± 0.18
4.128AspAsp: 4.128 ± 0.528
3.586AspGlu: 3.586 ± 0.659
3.857AspPhe: 3.857 ± 0.629
4.06AspGly: 4.06 ± 0.458
1.759AspHis: 1.759 ± 0.436
4.804AspIle: 4.804 ± 0.576
4.94AspLys: 4.94 ± 0.856
5.143AspLeu: 5.143 ± 0.527
2.03AspMet: 2.03 ± 0.387
2.639AspAsn: 2.639 ± 0.436
2.977AspPro: 2.977 ± 0.379
2.098AspGln: 2.098 ± 0.342
3.925AspArg: 3.925 ± 0.499
2.91AspSer: 2.91 ± 0.441
2.842AspThr: 2.842 ± 0.409
3.316AspVal: 3.316 ± 0.423
0.677AspTrp: 0.677 ± 0.211
2.233AspTyr: 2.233 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
5.887GluAla: 5.887 ± 0.697
0.812GluCys: 0.812 ± 0.21
4.331GluAsp: 4.331 ± 0.547
8.459GluGlu: 8.459 ± 1.087
3.316GluPhe: 3.316 ± 0.466
4.804GluGly: 4.804 ± 0.692
2.098GluHis: 2.098 ± 0.458
5.346GluIle: 5.346 ± 0.658
4.669GluLys: 4.669 ± 0.55
7.782GluLeu: 7.782 ± 0.897
2.436GluMet: 2.436 ± 0.406
3.992GluAsn: 3.992 ± 0.553
1.895GluPro: 1.895 ± 0.435
4.06GluGln: 4.06 ± 0.649
2.707GluArg: 2.707 ± 0.411
3.857GluSer: 3.857 ± 0.491
4.398GluThr: 4.398 ± 0.578
4.669GluVal: 4.669 ± 0.702
1.015GluTrp: 1.015 ± 0.235
3.789GluTyr: 3.789 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
2.639PheAla: 2.639 ± 0.497
0.338PheCys: 0.338 ± 0.151
2.977PheAsp: 2.977 ± 0.517
4.263PheGlu: 4.263 ± 0.527
2.03PhePhe: 2.03 ± 0.393
3.248PheGly: 3.248 ± 0.39
1.083PheHis: 1.083 ± 0.341
2.707PheIle: 2.707 ± 0.433
3.18PheLys: 3.18 ± 0.491
3.654PheLeu: 3.654 ± 0.498
1.489PheMet: 1.489 ± 0.365
2.233PheAsn: 2.233 ± 0.335
1.624PhePro: 1.624 ± 0.368
1.286PheGln: 1.286 ± 0.258
1.556PheArg: 1.556 ± 0.356
2.774PheSer: 2.774 ± 0.483
3.654PheThr: 3.654 ± 0.52
2.504PheVal: 2.504 ± 0.441
0.406PheTrp: 0.406 ± 0.164
1.827PheTyr: 1.827 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
5.278GlyAla: 5.278 ± 0.825
0.609GlyCys: 0.609 ± 0.212
4.669GlyAsp: 4.669 ± 0.528
4.804GlyGlu: 4.804 ± 0.519
3.451GlyPhe: 3.451 ± 0.462
5.346GlyGly: 5.346 ± 0.932
1.15GlyHis: 1.15 ± 0.323
4.804GlyIle: 4.804 ± 0.662
5.616GlyLys: 5.616 ± 0.647
5.278GlyLeu: 5.278 ± 0.688
2.165GlyMet: 2.165 ± 0.503
3.113GlyAsn: 3.113 ± 0.66
0.068GlyPro: 0.068 ± 0.06
2.368GlyGln: 2.368 ± 0.449
2.571GlyArg: 2.571 ± 0.471
4.669GlySer: 4.669 ± 0.665
2.774GlyThr: 2.774 ± 0.473
4.601GlyVal: 4.601 ± 0.64
0.88GlyTrp: 0.88 ± 0.195
3.18GlyTyr: 3.18 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.231
0.203HisCys: 0.203 ± 0.159
0.812HisAsp: 0.812 ± 0.221
1.489HisGlu: 1.489 ± 0.357
1.421HisPhe: 1.421 ± 0.298
1.895HisGly: 1.895 ± 0.404
0.474HisHis: 0.474 ± 0.183
1.353HisIle: 1.353 ± 0.306
1.624HisLys: 1.624 ± 0.368
1.556HisLeu: 1.556 ± 0.338
0.609HisMet: 0.609 ± 0.176
1.083HisAsn: 1.083 ± 0.273
1.353HisPro: 1.353 ± 0.242
0.609HisGln: 0.609 ± 0.211
1.15HisArg: 1.15 ± 0.231
1.624HisSer: 1.624 ± 0.405
1.083HisThr: 1.083 ± 0.305
1.083HisVal: 1.083 ± 0.25
0.271HisTrp: 0.271 ± 0.123
0.947HisTyr: 0.947 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
3.992IleAla: 3.992 ± 0.482
0.541IleCys: 0.541 ± 0.197
4.94IleAsp: 4.94 ± 0.504
5.752IleGlu: 5.752 ± 0.593
2.301IlePhe: 2.301 ± 0.495
3.654IleGly: 3.654 ± 0.536
1.353IleHis: 1.353 ± 0.269
4.263IleIle: 4.263 ± 0.604
6.158IleLys: 6.158 ± 0.642
4.331IleLeu: 4.331 ± 0.552
1.421IleMet: 1.421 ± 0.322
4.06IleAsn: 4.06 ± 0.482
3.18IlePro: 3.18 ± 0.382
2.571IleGln: 2.571 ± 0.411
3.113IleArg: 3.113 ± 0.397
4.331IleSer: 4.331 ± 0.831
4.195IleThr: 4.195 ± 0.54
3.925IleVal: 3.925 ± 0.521
0.338IleTrp: 0.338 ± 0.128
2.301IleTyr: 2.301 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
4.398LysAla: 4.398 ± 0.585
0.609LysCys: 0.609 ± 0.268
4.737LysAsp: 4.737 ± 0.73
8.526LysGlu: 8.526 ± 1.032
3.925LysPhe: 3.925 ± 0.537
4.128LysGly: 4.128 ± 0.617
2.098LysHis: 2.098 ± 0.364
3.857LysIle: 3.857 ± 0.518
7.173LysLys: 7.173 ± 0.946
5.549LysLeu: 5.549 ± 0.793
2.774LysMet: 2.774 ± 0.485
4.331LysAsn: 4.331 ± 0.465
2.977LysPro: 2.977 ± 0.429
3.925LysGln: 3.925 ± 0.58
3.925LysArg: 3.925 ± 0.585
4.804LysSer: 4.804 ± 0.791
4.128LysThr: 4.128 ± 0.531
5.075LysVal: 5.075 ± 0.718
0.541LysTrp: 0.541 ± 0.164
2.91LysTyr: 2.91 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
3.925LeuAla: 3.925 ± 0.701
0.744LeuCys: 0.744 ± 0.215
5.21LeuAsp: 5.21 ± 0.6
6.293LeuGlu: 6.293 ± 0.875
2.165LeuPhe: 2.165 ± 0.415
5.684LeuGly: 5.684 ± 0.665
1.556LeuHis: 1.556 ± 0.296
4.669LeuIle: 4.669 ± 0.52
7.647LeuLys: 7.647 ± 0.85
5.752LeuLeu: 5.752 ± 0.579
2.301LeuMet: 2.301 ± 0.369
4.128LeuAsn: 4.128 ± 0.52
3.113LeuPro: 3.113 ± 0.38
2.436LeuGln: 2.436 ± 0.379
3.654LeuArg: 3.654 ± 0.49
5.549LeuSer: 5.549 ± 0.536
5.278LeuThr: 5.278 ± 0.533
5.413LeuVal: 5.413 ± 0.663
0.812LeuTrp: 0.812 ± 0.207
2.977LeuTyr: 2.977 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.368MetAla: 2.368 ± 0.44
0.338MetCys: 0.338 ± 0.152
1.759MetAsp: 1.759 ± 0.358
2.03MetGlu: 2.03 ± 0.31
1.489MetPhe: 1.489 ± 0.285
1.421MetGly: 1.421 ± 0.298
0.609MetHis: 0.609 ± 0.196
1.624MetIle: 1.624 ± 0.364
3.045MetLys: 3.045 ± 0.445
1.759MetLeu: 1.759 ± 0.349
0.677MetMet: 0.677 ± 0.201
1.353MetAsn: 1.353 ± 0.279
0.812MetPro: 0.812 ± 0.244
0.474MetGln: 0.474 ± 0.162
0.88MetArg: 0.88 ± 0.242
3.113MetSer: 3.113 ± 0.521
2.098MetThr: 2.098 ± 0.396
1.286MetVal: 1.286 ± 0.298
0.406MetTrp: 0.406 ± 0.13
1.015MetTyr: 1.015 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
2.504AsnAla: 2.504 ± 0.442
0.474AsnCys: 0.474 ± 0.194
3.316AsnAsp: 3.316 ± 0.458
3.586AsnGlu: 3.586 ± 0.434
1.759AsnPhe: 1.759 ± 0.267
4.195AsnGly: 4.195 ± 0.655
1.083AsnHis: 1.083 ± 0.231
3.248AsnIle: 3.248 ± 0.546
3.654AsnLys: 3.654 ± 0.449
3.451AsnLeu: 3.451 ± 0.374
1.624AsnMet: 1.624 ± 0.274
3.248AsnAsn: 3.248 ± 0.53
2.233AsnPro: 2.233 ± 0.406
1.962AsnGln: 1.962 ± 0.391
2.301AsnArg: 2.301 ± 0.388
3.113AsnSer: 3.113 ± 0.534
3.045AsnThr: 3.045 ± 0.765
3.045AsnVal: 3.045 ± 0.498
1.15AsnTrp: 1.15 ± 0.298
1.692AsnTyr: 1.692 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
3.18ProAla: 3.18 ± 0.455
0.271ProCys: 0.271 ± 0.124
2.436ProAsp: 2.436 ± 0.435
3.18ProGlu: 3.18 ± 0.526
2.165ProPhe: 2.165 ± 0.322
0.0ProGly: 0.0 ± 0.0
1.015ProHis: 1.015 ± 0.255
1.895ProIle: 1.895 ± 0.454
2.639ProLys: 2.639 ± 0.448
3.519ProLeu: 3.519 ± 0.557
1.15ProMet: 1.15 ± 0.26
1.556ProAsn: 1.556 ± 0.314
1.083ProPro: 1.083 ± 0.304
0.677ProGln: 0.677 ± 0.223
1.15ProArg: 1.15 ± 0.287
2.368ProSer: 2.368 ± 0.487
1.827ProThr: 1.827 ± 0.4
2.03ProVal: 2.03 ± 0.294
0.338ProTrp: 0.338 ± 0.131
0.609ProTyr: 0.609 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
1.827GlnAla: 1.827 ± 0.324
0.271GlnCys: 0.271 ± 0.123
2.098GlnAsp: 2.098 ± 0.413
3.654GlnGlu: 3.654 ± 0.559
1.353GlnPhe: 1.353 ± 0.314
2.301GlnGly: 2.301 ± 0.433
0.609GlnHis: 0.609 ± 0.246
2.436GlnIle: 2.436 ± 0.402
2.707GlnLys: 2.707 ± 0.38
3.248GlnLeu: 3.248 ± 0.432
1.015GlnMet: 1.015 ± 0.235
1.15GlnAsn: 1.15 ± 0.278
1.353GlnPro: 1.353 ± 0.305
0.947GlnGln: 0.947 ± 0.229
2.03GlnArg: 2.03 ± 0.347
1.962GlnSer: 1.962 ± 0.401
2.098GlnThr: 2.098 ± 0.445
2.842GlnVal: 2.842 ± 0.44
0.474GlnTrp: 0.474 ± 0.174
1.759GlnTyr: 1.759 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
3.925ArgAla: 3.925 ± 0.482
0.338ArgCys: 0.338 ± 0.137
1.895ArgAsp: 1.895 ± 0.399
3.789ArgGlu: 3.789 ± 0.597
2.165ArgPhe: 2.165 ± 0.454
2.707ArgGly: 2.707 ± 0.461
0.812ArgHis: 0.812 ± 0.209
3.045ArgIle: 3.045 ± 0.403
3.451ArgLys: 3.451 ± 0.459
2.639ArgLeu: 2.639 ± 0.449
1.015ArgMet: 1.015 ± 0.222
1.962ArgAsn: 1.962 ± 0.416
1.286ArgPro: 1.286 ± 0.312
1.895ArgGln: 1.895 ± 0.36
2.977ArgArg: 2.977 ± 0.398
1.759ArgSer: 1.759 ± 0.389
2.977ArgThr: 2.977 ± 0.642
2.977ArgVal: 2.977 ± 0.548
0.541ArgTrp: 0.541 ± 0.197
1.556ArgTyr: 1.556 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
4.06SerAla: 4.06 ± 0.793
0.947SerCys: 0.947 ± 0.284
3.992SerAsp: 3.992 ± 0.535
4.06SerGlu: 4.06 ± 0.535
2.571SerPhe: 2.571 ± 0.342
5.413SerGly: 5.413 ± 0.826
1.218SerHis: 1.218 ± 0.251
4.466SerIle: 4.466 ± 0.539
5.278SerLys: 5.278 ± 0.669
5.075SerLeu: 5.075 ± 0.457
1.624SerMet: 1.624 ± 0.352
3.925SerAsn: 3.925 ± 0.688
1.489SerPro: 1.489 ± 0.334
3.18SerGln: 3.18 ± 0.407
1.421SerArg: 1.421 ± 0.337
4.263SerSer: 4.263 ± 0.622
3.248SerThr: 3.248 ± 0.856
3.316SerVal: 3.316 ± 0.518
0.744SerTrp: 0.744 ± 0.232
2.03SerTyr: 2.03 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
4.06ThrAla: 4.06 ± 0.599
0.609ThrCys: 0.609 ± 0.188
3.586ThrAsp: 3.586 ± 0.496
4.669ThrGlu: 4.669 ± 0.654
3.045ThrPhe: 3.045 ± 0.508
5.413ThrGly: 5.413 ± 0.67
0.947ThrHis: 0.947 ± 0.261
4.331ThrIle: 4.331 ± 0.428
3.383ThrLys: 3.383 ± 0.586
4.534ThrLeu: 4.534 ± 0.635
0.812ThrMet: 0.812 ± 0.213
2.977ThrAsn: 2.977 ± 0.63
3.18ThrPro: 3.18 ± 0.483
1.759ThrGln: 1.759 ± 0.317
2.301ThrArg: 2.301 ± 0.502
3.992ThrSer: 3.992 ± 1.08
3.519ThrThr: 3.519 ± 0.552
4.195ThrVal: 4.195 ± 0.537
0.812ThrTrp: 0.812 ± 0.321
2.098ThrTyr: 2.098 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
4.06ValAla: 4.06 ± 0.633
0.88ValCys: 0.88 ± 0.242
4.128ValAsp: 4.128 ± 0.468
3.586ValGlu: 3.586 ± 0.487
2.571ValPhe: 2.571 ± 0.419
4.06ValGly: 4.06 ± 0.633
1.083ValHis: 1.083 ± 0.252
4.872ValIle: 4.872 ± 0.571
4.804ValLys: 4.804 ± 0.527
4.601ValLeu: 4.601 ± 0.681
2.03ValMet: 2.03 ± 0.29
3.383ValAsn: 3.383 ± 0.42
1.353ValPro: 1.353 ± 0.303
2.571ValGln: 2.571 ± 0.44
2.504ValArg: 2.504 ± 0.354
3.451ValSer: 3.451 ± 0.645
4.737ValThr: 4.737 ± 0.682
4.06ValVal: 4.06 ± 0.625
0.541ValTrp: 0.541 ± 0.198
2.504ValTyr: 2.504 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.271TrpAla: 0.271 ± 0.142
0.068TrpCys: 0.068 ± 0.062
0.609TrpAsp: 0.609 ± 0.2
0.677TrpGlu: 0.677 ± 0.249
0.406TrpPhe: 0.406 ± 0.165
0.677TrpGly: 0.677 ± 0.205
0.135TrpHis: 0.135 ± 0.094
1.015TrpIle: 1.015 ± 0.272
0.609TrpLys: 0.609 ± 0.21
1.083TrpLeu: 1.083 ± 0.275
0.338TrpMet: 0.338 ± 0.129
0.947TrpAsn: 0.947 ± 0.259
0.0TrpPro: 0.0 ± 0.0
0.338TrpGln: 0.338 ± 0.153
0.609TrpArg: 0.609 ± 0.215
1.624TrpSer: 1.624 ± 0.389
0.744TrpThr: 0.744 ± 0.264
0.541TrpVal: 0.541 ± 0.264
0.203TrpTrp: 0.203 ± 0.124
0.474TrpTyr: 0.474 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.624TyrAla: 1.624 ± 0.316
0.203TyrCys: 0.203 ± 0.118
2.707TyrAsp: 2.707 ± 0.437
3.519TyrGlu: 3.519 ± 0.472
1.083TyrPhe: 1.083 ± 0.283
3.519TyrGly: 3.519 ± 0.445
0.677TyrHis: 0.677 ± 0.195
2.774TyrIle: 2.774 ± 0.454
3.045TyrLys: 3.045 ± 0.532
3.316TyrLeu: 3.316 ± 0.468
1.015TyrMet: 1.015 ± 0.269
1.692TyrAsn: 1.692 ± 0.376
0.677TyrPro: 0.677 ± 0.28
1.015TyrGln: 1.015 ± 0.23
1.759TyrArg: 1.759 ± 0.473
2.504TyrSer: 2.504 ± 0.441
3.451TyrThr: 3.451 ± 0.558
2.165TyrVal: 2.165 ± 0.442
0.203TyrTrp: 0.203 ± 0.106
1.218TyrTyr: 1.218 ± 0.332
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (14779 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski