Amino acid dipepetide frequency for Aeromonas virus phiO18P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.748AlaAla: 11.748 ± 1.629
0.963AlaCys: 0.963 ± 0.34
6.066AlaAsp: 6.066 ± 0.73
6.355AlaGlu: 6.355 ± 0.792
3.178AlaPhe: 3.178 ± 0.553
10.014AlaGly: 10.014 ± 0.993
1.637AlaHis: 1.637 ± 0.436
5.97AlaIle: 5.97 ± 0.807
5.681AlaLys: 5.681 ± 0.793
10.4AlaLeu: 10.4 ± 1.122
3.467AlaMet: 3.467 ± 0.518
4.044AlaAsn: 4.044 ± 0.651
3.948AlaPro: 3.948 ± 0.748
4.333AlaGln: 4.333 ± 0.586
6.259AlaArg: 6.259 ± 0.957
6.259AlaSer: 6.259 ± 0.85
6.644AlaThr: 6.644 ± 0.638
6.837AlaVal: 6.837 ± 0.748
2.985AlaTrp: 2.985 ± 0.508
3.081AlaTyr: 3.081 ± 0.605
0.0AlaXaa: 0.0 ± 0.0
Cys
1.541CysAla: 1.541 ± 0.415
0.096CysCys: 0.096 ± 0.1
0.385CysAsp: 0.385 ± 0.178
0.674CysGlu: 0.674 ± 0.226
0.289CysPhe: 0.289 ± 0.178
1.156CysGly: 1.156 ± 0.363
0.385CysHis: 0.385 ± 0.224
0.193CysIle: 0.193 ± 0.119
0.385CysLys: 0.385 ± 0.189
0.77CysLeu: 0.77 ± 0.259
0.096CysMet: 0.096 ± 0.091
0.481CysAsn: 0.481 ± 0.227
0.481CysPro: 0.481 ± 0.214
1.156CysGln: 1.156 ± 0.487
1.059CysArg: 1.059 ± 0.292
0.867CysSer: 0.867 ± 0.236
0.385CysThr: 0.385 ± 0.18
0.193CysVal: 0.193 ± 0.145
0.481CysTrp: 0.481 ± 0.29
0.289CysTyr: 0.289 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
4.911AspAla: 4.911 ± 0.724
0.578AspCys: 0.578 ± 0.203
3.467AspAsp: 3.467 ± 0.528
3.852AspGlu: 3.852 ± 0.488
1.541AspPhe: 1.541 ± 0.309
5.296AspGly: 5.296 ± 0.823
1.059AspHis: 1.059 ± 0.348
3.274AspIle: 3.274 ± 0.523
2.022AspLys: 2.022 ± 0.343
5.2AspLeu: 5.2 ± 0.562
1.733AspMet: 1.733 ± 0.399
1.83AspAsn: 1.83 ± 0.438
3.467AspPro: 3.467 ± 0.702
2.504AspGln: 2.504 ± 0.488
2.985AspArg: 2.985 ± 0.587
2.215AspSer: 2.215 ± 0.433
3.081AspThr: 3.081 ± 0.629
4.237AspVal: 4.237 ± 0.584
1.541AspTrp: 1.541 ± 0.436
1.83AspTyr: 1.83 ± 0.391
0.0AspXaa: 0.0 ± 0.0
Glu
6.355GluAla: 6.355 ± 0.841
0.481GluCys: 0.481 ± 0.256
2.215GluAsp: 2.215 ± 0.451
2.696GluGlu: 2.696 ± 0.631
2.215GluPhe: 2.215 ± 0.414
2.792GluGly: 2.792 ± 0.513
1.733GluHis: 1.733 ± 0.393
2.215GluIle: 2.215 ± 0.537
2.407GluLys: 2.407 ± 0.618
6.933GluLeu: 6.933 ± 0.69
1.733GluMet: 1.733 ± 0.319
0.963GluAsn: 0.963 ± 0.274
2.985GluPro: 2.985 ± 0.683
5.007GluGln: 5.007 ± 0.715
4.526GluArg: 4.526 ± 0.774
2.504GluSer: 2.504 ± 0.441
2.118GluThr: 2.118 ± 0.516
2.985GluVal: 2.985 ± 0.517
1.156GluTrp: 1.156 ± 0.331
1.541GluTyr: 1.541 ± 0.35
0.0GluXaa: 0.0 ± 0.0
Phe
3.467PheAla: 3.467 ± 0.474
0.385PheCys: 0.385 ± 0.148
1.83PheAsp: 1.83 ± 0.554
1.83PheGlu: 1.83 ± 0.371
0.963PhePhe: 0.963 ± 0.265
3.563PheGly: 3.563 ± 0.668
0.77PheHis: 0.77 ± 0.36
1.444PheIle: 1.444 ± 0.309
1.733PheLys: 1.733 ± 0.274
1.541PheLeu: 1.541 ± 0.499
0.77PheMet: 0.77 ± 0.239
0.674PheAsn: 0.674 ± 0.245
0.867PhePro: 0.867 ± 0.282
1.444PheGln: 1.444 ± 0.356
2.118PheArg: 2.118 ± 0.405
2.022PheSer: 2.022 ± 0.417
2.215PheThr: 2.215 ± 0.413
2.022PheVal: 2.022 ± 0.504
0.867PheTrp: 0.867 ± 0.308
0.481PheTyr: 0.481 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
6.452GlyAla: 6.452 ± 0.687
0.674GlyCys: 0.674 ± 0.325
5.007GlyAsp: 5.007 ± 0.705
5.2GlyGlu: 5.2 ± 0.719
2.311GlyPhe: 2.311 ± 0.511
5.585GlyGly: 5.585 ± 0.791
2.118GlyHis: 2.118 ± 0.376
3.178GlyIle: 3.178 ± 0.559
4.718GlyLys: 4.718 ± 0.547
7.029GlyLeu: 7.029 ± 0.882
2.118GlyMet: 2.118 ± 0.383
2.504GlyAsn: 2.504 ± 0.516
2.022GlyPro: 2.022 ± 0.342
5.392GlyGln: 5.392 ± 0.83
5.104GlyArg: 5.104 ± 0.649
4.429GlySer: 4.429 ± 0.739
4.526GlyThr: 4.526 ± 0.702
5.392GlyVal: 5.392 ± 0.591
1.637GlyTrp: 1.637 ± 0.382
1.926GlyTyr: 1.926 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
2.022HisAla: 2.022 ± 0.457
0.385HisCys: 0.385 ± 0.198
1.156HisAsp: 1.156 ± 0.354
0.578HisGlu: 0.578 ± 0.283
0.77HisPhe: 0.77 ± 0.348
1.637HisGly: 1.637 ± 0.41
0.963HisHis: 0.963 ± 0.29
1.444HisIle: 1.444 ± 0.404
1.444HisLys: 1.444 ± 0.355
2.407HisLeu: 2.407 ± 0.506
0.481HisMet: 0.481 ± 0.189
0.578HisAsn: 0.578 ± 0.242
1.83HisPro: 1.83 ± 0.44
1.83HisGln: 1.83 ± 0.365
1.444HisArg: 1.444 ± 0.412
1.252HisSer: 1.252 ± 0.335
1.541HisThr: 1.541 ± 0.456
1.541HisVal: 1.541 ± 0.347
0.77HisTrp: 0.77 ± 0.252
0.385HisTyr: 0.385 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.392IleAla: 5.392 ± 0.82
0.289IleCys: 0.289 ± 0.219
3.659IleAsp: 3.659 ± 0.494
3.467IleGlu: 3.467 ± 0.67
1.059IlePhe: 1.059 ± 0.253
3.274IleGly: 3.274 ± 0.44
0.578IleHis: 0.578 ± 0.245
1.637IleIle: 1.637 ± 0.375
2.696IleLys: 2.696 ± 0.486
2.311IleLeu: 2.311 ± 0.359
0.867IleMet: 0.867 ± 0.288
2.696IleAsn: 2.696 ± 0.449
2.504IlePro: 2.504 ± 0.457
1.156IleGln: 1.156 ± 0.349
2.504IleArg: 2.504 ± 0.51
2.311IleSer: 2.311 ± 0.459
3.852IleThr: 3.852 ± 0.605
1.926IleVal: 1.926 ± 0.509
0.867IleTrp: 0.867 ± 0.314
1.444IleTyr: 1.444 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
5.778LysAla: 5.778 ± 0.953
0.481LysCys: 0.481 ± 0.227
1.83LysAsp: 1.83 ± 0.338
2.504LysGlu: 2.504 ± 0.466
1.156LysPhe: 1.156 ± 0.39
4.237LysGly: 4.237 ± 0.821
1.252LysHis: 1.252 ± 0.416
0.963LysIle: 0.963 ± 0.276
2.407LysLys: 2.407 ± 0.568
5.104LysLeu: 5.104 ± 0.741
1.059LysMet: 1.059 ± 0.344
1.83LysAsn: 1.83 ± 0.49
2.6LysPro: 2.6 ± 0.45
1.444LysGln: 1.444 ± 0.396
4.429LysArg: 4.429 ± 0.886
1.637LysSer: 1.637 ± 0.397
2.792LysThr: 2.792 ± 0.613
2.696LysVal: 2.696 ± 0.468
0.674LysTrp: 0.674 ± 0.222
0.674LysTyr: 0.674 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
13.192LeuAla: 13.192 ± 1.337
1.83LeuCys: 1.83 ± 0.434
6.548LeuAsp: 6.548 ± 0.817
4.237LeuGlu: 4.237 ± 0.58
2.889LeuPhe: 2.889 ± 0.435
7.703LeuGly: 7.703 ± 0.997
2.022LeuHis: 2.022 ± 0.374
4.333LeuIle: 4.333 ± 0.635
4.526LeuLys: 4.526 ± 0.801
9.052LeuLeu: 9.052 ± 1.093
1.83LeuMet: 1.83 ± 0.332
3.659LeuAsn: 3.659 ± 0.564
5.296LeuPro: 5.296 ± 0.717
4.237LeuGln: 4.237 ± 0.644
6.644LeuArg: 6.644 ± 0.726
5.874LeuSer: 5.874 ± 0.815
5.778LeuThr: 5.778 ± 0.918
5.489LeuVal: 5.489 ± 0.728
1.059LeuTrp: 1.059 ± 0.301
2.311LeuTyr: 2.311 ± 0.628
0.0LeuXaa: 0.0 ± 0.0
Met
2.696MetAla: 2.696 ± 0.551
0.289MetCys: 0.289 ± 0.164
1.156MetAsp: 1.156 ± 0.357
1.444MetGlu: 1.444 ± 0.357
0.578MetPhe: 0.578 ± 0.251
2.022MetGly: 2.022 ± 0.501
0.578MetHis: 0.578 ± 0.243
0.578MetIle: 0.578 ± 0.218
1.348MetLys: 1.348 ± 0.352
2.215MetLeu: 2.215 ± 0.485
0.867MetMet: 0.867 ± 0.283
1.059MetAsn: 1.059 ± 0.346
1.83MetPro: 1.83 ± 0.431
1.156MetGln: 1.156 ± 0.281
1.444MetArg: 1.444 ± 0.321
1.926MetSer: 1.926 ± 0.427
2.792MetThr: 2.792 ± 0.441
1.444MetVal: 1.444 ± 0.38
0.385MetTrp: 0.385 ± 0.223
0.096MetTyr: 0.096 ± 0.077
0.0MetXaa: 0.0 ± 0.0
Asn
3.659AsnAla: 3.659 ± 0.588
0.385AsnCys: 0.385 ± 0.197
1.83AsnAsp: 1.83 ± 0.465
1.83AsnGlu: 1.83 ± 0.38
1.059AsnPhe: 1.059 ± 0.382
3.467AsnGly: 3.467 ± 0.547
0.674AsnHis: 0.674 ± 0.303
1.541AsnIle: 1.541 ± 0.34
1.444AsnLys: 1.444 ± 0.512
2.792AsnLeu: 2.792 ± 0.643
0.963AsnMet: 0.963 ± 0.31
1.059AsnAsn: 1.059 ± 0.464
2.407AsnPro: 2.407 ± 0.513
2.889AsnGln: 2.889 ± 0.423
1.83AsnArg: 1.83 ± 0.466
2.118AsnSer: 2.118 ± 0.634
1.637AsnThr: 1.637 ± 0.426
1.059AsnVal: 1.059 ± 0.363
0.578AsnTrp: 0.578 ± 0.209
0.674AsnTyr: 0.674 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
5.778ProAla: 5.778 ± 0.688
0.674ProCys: 0.674 ± 0.396
4.044ProAsp: 4.044 ± 0.747
3.563ProGlu: 3.563 ± 0.639
1.733ProPhe: 1.733 ± 0.328
4.526ProGly: 4.526 ± 0.464
1.348ProHis: 1.348 ± 0.463
2.118ProIle: 2.118 ± 0.536
1.637ProLys: 1.637 ± 0.413
3.755ProLeu: 3.755 ± 0.856
1.156ProMet: 1.156 ± 0.266
1.444ProAsn: 1.444 ± 0.259
1.926ProPro: 1.926 ± 0.404
1.348ProGln: 1.348 ± 0.427
2.889ProArg: 2.889 ± 0.463
3.178ProSer: 3.178 ± 0.606
2.889ProThr: 2.889 ± 0.601
3.081ProVal: 3.081 ± 0.411
0.963ProTrp: 0.963 ± 0.379
1.156ProTyr: 1.156 ± 0.315
0.0ProXaa: 0.0 ± 0.0
Gln
6.355GlnAla: 6.355 ± 0.915
0.481GlnCys: 0.481 ± 0.198
2.985GlnAsp: 2.985 ± 0.68
2.6GlnGlu: 2.6 ± 0.522
2.118GlnPhe: 2.118 ± 0.539
3.081GlnGly: 3.081 ± 0.614
1.733GlnHis: 1.733 ± 0.518
2.311GlnIle: 2.311 ± 0.512
0.77GlnLys: 0.77 ± 0.279
6.644GlnLeu: 6.644 ± 0.793
1.637GlnMet: 1.637 ± 0.391
0.578GlnAsn: 0.578 ± 0.189
2.889GlnPro: 2.889 ± 0.527
4.237GlnGln: 4.237 ± 0.756
3.274GlnArg: 3.274 ± 0.607
2.215GlnSer: 2.215 ± 0.428
2.696GlnThr: 2.696 ± 0.466
3.178GlnVal: 3.178 ± 0.451
0.867GlnTrp: 0.867 ± 0.29
1.059GlnTyr: 1.059 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
7.607ArgAla: 7.607 ± 0.812
0.963ArgCys: 0.963 ± 0.269
2.792ArgAsp: 2.792 ± 0.633
4.141ArgGlu: 4.141 ± 0.728
2.022ArgPhe: 2.022 ± 0.461
2.696ArgGly: 2.696 ± 0.509
2.022ArgHis: 2.022 ± 0.536
3.659ArgIle: 3.659 ± 0.526
3.852ArgLys: 3.852 ± 0.763
7.607ArgLeu: 7.607 ± 0.954
0.963ArgMet: 0.963 ± 0.335
2.696ArgAsn: 2.696 ± 0.578
2.6ArgPro: 2.6 ± 0.593
3.081ArgGln: 3.081 ± 0.562
5.2ArgArg: 5.2 ± 1.008
2.792ArgSer: 2.792 ± 0.471
3.659ArgThr: 3.659 ± 0.6
3.852ArgVal: 3.852 ± 0.529
1.926ArgTrp: 1.926 ± 0.364
2.311ArgTyr: 2.311 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
5.778SerAla: 5.778 ± 0.729
0.193SerCys: 0.193 ± 0.143
3.081SerAsp: 3.081 ± 0.628
3.37SerGlu: 3.37 ± 0.535
1.252SerPhe: 1.252 ± 0.336
3.659SerGly: 3.659 ± 0.819
1.733SerHis: 1.733 ± 0.477
1.926SerIle: 1.926 ± 0.316
2.118SerLys: 2.118 ± 0.411
4.911SerLeu: 4.911 ± 0.873
2.022SerMet: 2.022 ± 0.457
1.926SerAsn: 1.926 ± 0.333
2.985SerPro: 2.985 ± 0.469
2.985SerGln: 2.985 ± 0.524
3.081SerArg: 3.081 ± 0.64
3.081SerSer: 3.081 ± 0.499
3.081SerThr: 3.081 ± 0.561
3.467SerVal: 3.467 ± 0.469
0.867SerTrp: 0.867 ± 0.271
1.156SerTyr: 1.156 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
5.874ThrAla: 5.874 ± 0.684
0.77ThrCys: 0.77 ± 0.281
4.044ThrAsp: 4.044 ± 0.681
2.889ThrGlu: 2.889 ± 0.515
1.733ThrPhe: 1.733 ± 0.409
3.948ThrGly: 3.948 ± 0.507
1.252ThrHis: 1.252 ± 0.295
3.563ThrIle: 3.563 ± 0.476
3.081ThrLys: 3.081 ± 0.61
7.607ThrLeu: 7.607 ± 0.921
1.637ThrMet: 1.637 ± 0.417
2.022ThrAsn: 2.022 ± 0.498
3.178ThrPro: 3.178 ± 0.456
1.926ThrGln: 1.926 ± 0.482
4.044ThrArg: 4.044 ± 0.706
2.215ThrSer: 2.215 ± 0.66
3.563ThrThr: 3.563 ± 0.66
2.889ThrVal: 2.889 ± 0.504
1.059ThrTrp: 1.059 ± 0.275
2.311ThrTyr: 2.311 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
6.452ValAla: 6.452 ± 0.792
0.77ValCys: 0.77 ± 0.28
2.504ValAsp: 2.504 ± 0.348
2.696ValGlu: 2.696 ± 0.65
2.118ValPhe: 2.118 ± 0.516
6.163ValGly: 6.163 ± 0.867
1.156ValHis: 1.156 ± 0.432
2.215ValIle: 2.215 ± 0.505
1.83ValLys: 1.83 ± 0.417
5.585ValLeu: 5.585 ± 0.597
1.444ValMet: 1.444 ± 0.307
2.215ValAsn: 2.215 ± 0.532
2.792ValPro: 2.792 ± 0.566
2.504ValGln: 2.504 ± 0.404
3.659ValArg: 3.659 ± 0.489
3.755ValSer: 3.755 ± 0.626
4.622ValThr: 4.622 ± 0.582
4.044ValVal: 4.044 ± 0.688
1.156ValTrp: 1.156 ± 0.384
1.733ValTyr: 1.733 ± 0.479
0.0ValXaa: 0.0 ± 0.0
Trp
2.696TrpAla: 2.696 ± 0.606
0.289TrpCys: 0.289 ± 0.168
0.867TrpAsp: 0.867 ± 0.2
1.252TrpGlu: 1.252 ± 0.425
0.77TrpPhe: 0.77 ± 0.256
0.674TrpGly: 0.674 ± 0.286
0.77TrpHis: 0.77 ± 0.271
0.867TrpIle: 0.867 ± 0.289
0.674TrpLys: 0.674 ± 0.264
3.37TrpLeu: 3.37 ± 0.559
0.674TrpMet: 0.674 ± 0.203
0.674TrpAsn: 0.674 ± 0.266
1.444TrpPro: 1.444 ± 0.389
0.77TrpGln: 0.77 ± 0.261
1.733TrpArg: 1.733 ± 0.451
0.963TrpSer: 0.963 ± 0.332
0.578TrpThr: 0.578 ± 0.238
1.156TrpVal: 1.156 ± 0.318
0.385TrpTrp: 0.385 ± 0.215
0.193TrpTyr: 0.193 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.118TyrAla: 2.118 ± 0.461
0.385TyrCys: 0.385 ± 0.187
1.156TyrAsp: 1.156 ± 0.296
0.481TyrGlu: 0.481 ± 0.226
1.252TyrPhe: 1.252 ± 0.396
1.83TyrGly: 1.83 ± 0.497
0.674TyrHis: 0.674 ± 0.287
1.059TyrIle: 1.059 ± 0.263
0.77TyrLys: 0.77 ± 0.23
3.274TyrLeu: 3.274 ± 0.575
0.289TyrMet: 0.289 ± 0.115
0.963TyrAsn: 0.963 ± 0.34
1.156TyrPro: 1.156 ± 0.369
2.311TyrGln: 2.311 ± 0.406
2.215TyrArg: 2.215 ± 0.391
1.059TyrSer: 1.059 ± 0.277
1.156TyrThr: 1.156 ± 0.326
1.926TyrVal: 1.926 ± 0.388
0.578TyrTrp: 0.578 ± 0.234
0.963TyrTyr: 0.963 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski