Amino acid dipepetide frequency for Vibrio phage fNo16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.543AlaAla: 13.543 ± 2.408
0.0AlaCys: 0.0 ± 0.0
4.617AlaAsp: 4.617 ± 0.868
6.464AlaGlu: 6.464 ± 1.177
3.078AlaPhe: 3.078 ± 0.958
6.464AlaGly: 6.464 ± 1.388
0.923AlaHis: 0.923 ± 0.506
5.54AlaIle: 5.54 ± 1.606
7.387AlaLys: 7.387 ± 1.514
7.387AlaLeu: 7.387 ± 1.407
3.078AlaMet: 3.078 ± 1.031
3.078AlaAsn: 3.078 ± 0.818
3.693AlaPro: 3.693 ± 1.065
4.001AlaGln: 4.001 ± 1.212
5.848AlaArg: 5.848 ± 0.995
4.309AlaSer: 4.309 ± 1.298
4.617AlaThr: 4.617 ± 1.255
5.54AlaVal: 5.54 ± 1.619
0.616AlaTrp: 0.616 ± 0.367
1.847AlaTyr: 1.847 ± 0.594
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.437
0.0CysCys: 0.0 ± 0.0
1.231CysAsp: 1.231 ± 0.53
1.231CysGlu: 1.231 ± 0.637
0.308CysPhe: 0.308 ± 0.243
0.616CysGly: 0.616 ± 0.299
0.0CysHis: 0.0 ± 0.0
0.308CysIle: 0.308 ± 0.259
0.923CysLys: 0.923 ± 0.683
0.616CysLeu: 0.616 ± 0.376
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.231CysPro: 1.231 ± 0.529
0.308CysGln: 0.308 ± 0.335
0.923CysArg: 0.923 ± 0.435
0.923CysSer: 0.923 ± 0.4
1.231CysThr: 1.231 ± 0.576
0.308CysVal: 0.308 ± 0.301
0.0CysTrp: 0.0 ± 0.0
0.308CysTyr: 0.308 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
4.925AspAla: 4.925 ± 1.448
0.923AspCys: 0.923 ± 0.671
4.309AspAsp: 4.309 ± 1.412
5.848AspGlu: 5.848 ± 1.145
2.77AspPhe: 2.77 ± 1.178
5.232AspGly: 5.232 ± 1.105
0.308AspHis: 0.308 ± 0.293
3.078AspIle: 3.078 ± 0.861
3.386AspLys: 3.386 ± 1.113
2.77AspLeu: 2.77 ± 0.802
1.231AspMet: 1.231 ± 0.588
2.462AspAsn: 2.462 ± 0.575
1.539AspPro: 1.539 ± 0.816
0.923AspGln: 0.923 ± 0.461
1.539AspArg: 1.539 ± 0.556
3.078AspSer: 3.078 ± 0.841
2.462AspThr: 2.462 ± 0.982
4.617AspVal: 4.617 ± 1.195
0.923AspTrp: 0.923 ± 0.413
1.539AspTyr: 1.539 ± 0.839
0.0AspXaa: 0.0 ± 0.0
Glu
3.078GluAla: 3.078 ± 1.388
1.847GluCys: 1.847 ± 0.824
1.847GluAsp: 1.847 ± 0.764
3.078GluGlu: 3.078 ± 0.921
3.693GluPhe: 3.693 ± 1.127
2.155GluGly: 2.155 ± 0.593
1.539GluHis: 1.539 ± 0.567
6.464GluIle: 6.464 ± 1.565
4.617GluLys: 4.617 ± 0.965
8.002GluLeu: 8.002 ± 0.932
2.155GluMet: 2.155 ± 0.909
3.078GluAsn: 3.078 ± 0.844
2.462GluPro: 2.462 ± 0.837
4.001GluGln: 4.001 ± 1.304
2.77GluArg: 2.77 ± 0.846
4.001GluSer: 4.001 ± 1.521
1.847GluThr: 1.847 ± 0.524
4.309GluVal: 4.309 ± 0.719
0.308GluTrp: 0.308 ± 0.206
2.77GluTyr: 2.77 ± 0.969
0.0GluXaa: 0.0 ± 0.0
Phe
4.309PheAla: 4.309 ± 1.158
0.308PheCys: 0.308 ± 0.31
3.386PheAsp: 3.386 ± 0.84
3.386PheGlu: 3.386 ± 0.77
1.539PhePhe: 1.539 ± 0.576
2.77PheGly: 2.77 ± 1.168
0.616PheHis: 0.616 ± 0.455
0.616PheIle: 0.616 ± 0.387
1.539PheLys: 1.539 ± 0.521
4.309PheLeu: 4.309 ± 0.928
0.616PheMet: 0.616 ± 0.545
1.539PheAsn: 1.539 ± 0.524
1.539PhePro: 1.539 ± 0.55
1.539PheGln: 1.539 ± 0.475
1.847PheArg: 1.847 ± 0.808
1.847PheSer: 1.847 ± 0.703
2.77PheThr: 2.77 ± 0.679
3.386PheVal: 3.386 ± 1.112
0.616PheTrp: 0.616 ± 0.45
1.539PheTyr: 1.539 ± 0.764
0.0PheXaa: 0.0 ± 0.0
Gly
8.926GlyAla: 8.926 ± 1.655
1.539GlyCys: 1.539 ± 0.579
5.54GlyAsp: 5.54 ± 1.404
4.001GlyGlu: 4.001 ± 1.155
4.309GlyPhe: 4.309 ± 1.034
9.234GlyGly: 9.234 ± 2.558
1.539GlyHis: 1.539 ± 0.635
3.693GlyIle: 3.693 ± 1.551
1.539GlyLys: 1.539 ± 0.585
4.925GlyLeu: 4.925 ± 1.22
0.923GlyMet: 0.923 ± 0.524
3.386GlyAsn: 3.386 ± 0.987
1.231GlyPro: 1.231 ± 0.44
3.386GlyGln: 3.386 ± 0.948
4.617GlyArg: 4.617 ± 1.174
5.54GlySer: 5.54 ± 2.036
4.001GlyThr: 4.001 ± 0.983
6.156GlyVal: 6.156 ± 1.215
1.847GlyTrp: 1.847 ± 0.608
2.462GlyTyr: 2.462 ± 0.693
0.0GlyXaa: 0.0 ± 0.0
His
0.308HisAla: 0.308 ± 0.32
0.616HisCys: 0.616 ± 0.405
0.923HisAsp: 0.923 ± 0.457
1.847HisGlu: 1.847 ± 0.646
0.923HisPhe: 0.923 ± 0.381
1.847HisGly: 1.847 ± 0.696
0.308HisHis: 0.308 ± 0.206
0.616HisIle: 0.616 ± 0.375
0.616HisLys: 0.616 ± 0.299
1.231HisLeu: 1.231 ± 0.511
0.308HisMet: 0.308 ± 0.314
0.0HisAsn: 0.0 ± 0.0
0.616HisPro: 0.616 ± 0.413
0.616HisGln: 0.616 ± 0.362
0.616HisArg: 0.616 ± 0.363
0.308HisSer: 0.308 ± 0.293
0.616HisThr: 0.616 ± 0.518
1.539HisVal: 1.539 ± 0.836
0.308HisTrp: 0.308 ± 0.329
0.616HisTyr: 0.616 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
4.001IleAla: 4.001 ± 1.041
0.308IleCys: 0.308 ± 0.329
6.464IleAsp: 6.464 ± 2.066
5.232IleGlu: 5.232 ± 1.131
2.155IlePhe: 2.155 ± 0.667
3.078IleGly: 3.078 ± 1.099
1.231IleHis: 1.231 ± 0.631
3.078IleIle: 3.078 ± 1.374
5.232IleLys: 5.232 ± 0.964
1.847IleLeu: 1.847 ± 0.609
0.616IleMet: 0.616 ± 0.401
3.693IleAsn: 3.693 ± 1.324
1.847IlePro: 1.847 ± 0.588
2.155IleGln: 2.155 ± 0.56
4.925IleArg: 4.925 ± 1.517
3.386IleSer: 3.386 ± 1.11
5.54IleThr: 5.54 ± 1.188
2.155IleVal: 2.155 ± 0.733
2.155IleTrp: 2.155 ± 0.69
1.231IleTyr: 1.231 ± 0.62
0.0IleXaa: 0.0 ± 0.0
Lys
6.156LysAla: 6.156 ± 1.251
1.539LysCys: 1.539 ± 0.544
2.155LysAsp: 2.155 ± 0.622
3.078LysGlu: 3.078 ± 0.972
1.539LysPhe: 1.539 ± 0.565
3.386LysGly: 3.386 ± 1.25
2.462LysHis: 2.462 ± 0.761
2.77LysIle: 2.77 ± 0.928
3.386LysLys: 3.386 ± 1.157
5.54LysLeu: 5.54 ± 0.981
3.386LysMet: 3.386 ± 0.79
2.462LysAsn: 2.462 ± 0.631
3.693LysPro: 3.693 ± 1.054
3.386LysGln: 3.386 ± 1.055
4.617LysArg: 4.617 ± 0.945
4.001LysSer: 4.001 ± 0.884
3.386LysThr: 3.386 ± 0.955
3.693LysVal: 3.693 ± 1.08
0.923LysTrp: 0.923 ± 0.39
1.847LysTyr: 1.847 ± 0.783
0.0LysXaa: 0.0 ± 0.0
Leu
7.079LeuAla: 7.079 ± 1.069
0.616LeuCys: 0.616 ± 0.681
2.77LeuAsp: 2.77 ± 0.689
5.54LeuGlu: 5.54 ± 1.054
2.77LeuPhe: 2.77 ± 0.539
7.079LeuGly: 7.079 ± 1.164
0.923LeuHis: 0.923 ± 0.386
6.771LeuIle: 6.771 ± 1.643
6.771LeuLys: 6.771 ± 1.417
5.848LeuLeu: 5.848 ± 1.35
2.77LeuMet: 2.77 ± 0.85
3.693LeuAsn: 3.693 ± 1.033
1.847LeuPro: 1.847 ± 0.598
1.847LeuGln: 1.847 ± 0.638
2.77LeuArg: 2.77 ± 0.654
6.464LeuSer: 6.464 ± 1.464
6.771LeuThr: 6.771 ± 1.719
4.309LeuVal: 4.309 ± 0.962
0.616LeuTrp: 0.616 ± 0.431
1.847LeuTyr: 1.847 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
3.078MetAla: 3.078 ± 0.778
0.616MetCys: 0.616 ± 0.314
2.155MetAsp: 2.155 ± 0.568
0.616MetGlu: 0.616 ± 0.413
1.539MetPhe: 1.539 ± 0.734
3.078MetGly: 3.078 ± 1.063
0.308MetHis: 0.308 ± 0.31
2.155MetIle: 2.155 ± 1.162
2.155MetLys: 2.155 ± 0.64
1.847MetLeu: 1.847 ± 0.759
0.0MetMet: 0.0 ± 0.0
0.923MetAsn: 0.923 ± 0.486
1.539MetPro: 1.539 ± 0.637
0.923MetGln: 0.923 ± 0.479
1.539MetArg: 1.539 ± 0.901
1.847MetSer: 1.847 ± 0.808
1.847MetThr: 1.847 ± 0.618
1.231MetVal: 1.231 ± 0.536
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.617AsnAla: 4.617 ± 1.472
0.308AsnCys: 0.308 ± 0.206
2.77AsnAsp: 2.77 ± 1.116
2.462AsnGlu: 2.462 ± 0.718
1.539AsnPhe: 1.539 ± 0.574
4.309AsnGly: 4.309 ± 1.396
0.616AsnHis: 0.616 ± 0.413
3.078AsnIle: 3.078 ± 0.851
1.539AsnLys: 1.539 ± 0.501
1.847AsnLeu: 1.847 ± 0.622
3.078AsnMet: 3.078 ± 0.657
3.386AsnAsn: 3.386 ± 1.227
3.386AsnPro: 3.386 ± 1.019
3.693AsnGln: 3.693 ± 0.946
2.77AsnArg: 2.77 ± 0.843
2.77AsnSer: 2.77 ± 0.865
2.462AsnThr: 2.462 ± 1.024
2.77AsnVal: 2.77 ± 0.914
1.231AsnTrp: 1.231 ± 0.605
0.308AsnTyr: 0.308 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
2.462ProAla: 2.462 ± 0.76
0.308ProCys: 0.308 ± 0.301
3.693ProAsp: 3.693 ± 0.907
2.155ProGlu: 2.155 ± 0.738
0.923ProPhe: 0.923 ± 0.529
0.0ProGly: 0.0 ± 0.0
0.616ProHis: 0.616 ± 0.413
3.386ProIle: 3.386 ± 1.011
1.231ProLys: 1.231 ± 0.699
4.001ProLeu: 4.001 ± 0.921
1.539ProMet: 1.539 ± 0.746
4.001ProAsn: 4.001 ± 0.72
1.539ProPro: 1.539 ± 0.51
2.77ProGln: 2.77 ± 0.655
1.847ProArg: 1.847 ± 0.666
2.155ProSer: 2.155 ± 0.761
1.847ProThr: 1.847 ± 0.592
3.386ProVal: 3.386 ± 1.106
0.616ProTrp: 0.616 ± 0.438
0.616ProTyr: 0.616 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
4.309GlnAla: 4.309 ± 0.88
0.0GlnCys: 0.0 ± 0.0
2.155GlnAsp: 2.155 ± 0.837
1.231GlnGlu: 1.231 ± 0.749
1.847GlnPhe: 1.847 ± 0.8
4.001GlnGly: 4.001 ± 0.788
0.0GlnHis: 0.0 ± 0.0
2.155GlnIle: 2.155 ± 0.715
2.462GlnLys: 2.462 ± 1.012
5.232GlnLeu: 5.232 ± 1.186
1.231GlnMet: 1.231 ± 0.498
2.155GlnAsn: 2.155 ± 0.703
1.231GlnPro: 1.231 ± 0.495
1.847GlnGln: 1.847 ± 0.567
2.462GlnArg: 2.462 ± 0.63
2.77GlnSer: 2.77 ± 0.928
2.462GlnThr: 2.462 ± 0.719
3.386GlnVal: 3.386 ± 1.06
0.0GlnTrp: 0.0 ± 0.0
3.078GlnTyr: 3.078 ± 0.82
0.0GlnXaa: 0.0 ± 0.0
Arg
4.001ArgAla: 4.001 ± 1.054
1.231ArgCys: 1.231 ± 0.467
0.616ArgAsp: 0.616 ± 0.417
3.386ArgGlu: 3.386 ± 1.291
3.693ArgPhe: 3.693 ± 0.994
5.54ArgGly: 5.54 ± 1.263
0.616ArgHis: 0.616 ± 0.378
4.309ArgIle: 4.309 ± 1.039
4.617ArgLys: 4.617 ± 1.189
5.232ArgLeu: 5.232 ± 1.37
0.616ArgMet: 0.616 ± 0.519
2.77ArgAsn: 2.77 ± 1.121
1.847ArgPro: 1.847 ± 0.548
0.923ArgGln: 0.923 ± 0.701
3.693ArgArg: 3.693 ± 1.067
1.847ArgSer: 1.847 ± 0.587
1.231ArgThr: 1.231 ± 0.454
4.309ArgVal: 4.309 ± 1.169
0.923ArgTrp: 0.923 ± 0.66
1.539ArgTyr: 1.539 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
7.695SerAla: 7.695 ± 1.51
0.308SerCys: 0.308 ± 0.243
3.693SerAsp: 3.693 ± 1.067
4.001SerGlu: 4.001 ± 1.07
1.539SerPhe: 1.539 ± 0.637
7.079SerGly: 7.079 ± 1.737
0.308SerHis: 0.308 ± 0.206
3.078SerIle: 3.078 ± 1.092
5.232SerLys: 5.232 ± 0.881
3.078SerLeu: 3.078 ± 0.918
1.847SerMet: 1.847 ± 0.727
4.617SerAsn: 4.617 ± 0.854
2.77SerPro: 2.77 ± 0.762
3.386SerGln: 3.386 ± 0.977
2.155SerArg: 2.155 ± 0.999
3.693SerSer: 3.693 ± 0.986
3.078SerThr: 3.078 ± 1.201
3.386SerVal: 3.386 ± 0.996
0.308SerTrp: 0.308 ± 0.329
1.847SerTyr: 1.847 ± 0.502
0.0SerXaa: 0.0 ± 0.0
Thr
4.001ThrAla: 4.001 ± 1.217
0.308ThrCys: 0.308 ± 0.243
1.539ThrAsp: 1.539 ± 0.746
3.386ThrGlu: 3.386 ± 1.184
2.462ThrPhe: 2.462 ± 0.951
4.001ThrGly: 4.001 ± 1.039
0.308ThrHis: 0.308 ± 0.306
3.693ThrIle: 3.693 ± 0.877
4.309ThrLys: 4.309 ± 1.033
5.848ThrLeu: 5.848 ± 1.4
1.231ThrMet: 1.231 ± 0.593
2.77ThrAsn: 2.77 ± 0.82
3.078ThrPro: 3.078 ± 0.923
3.386ThrGln: 3.386 ± 1.17
3.078ThrArg: 3.078 ± 0.918
5.54ThrSer: 5.54 ± 1.543
3.693ThrThr: 3.693 ± 1.054
4.001ThrVal: 4.001 ± 1.162
1.231ThrTrp: 1.231 ± 0.543
1.539ThrTyr: 1.539 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
6.156ValAla: 6.156 ± 1.028
0.0ValCys: 0.0 ± 0.0
3.078ValAsp: 3.078 ± 0.895
4.617ValGlu: 4.617 ± 1.055
1.539ValPhe: 1.539 ± 0.61
6.771ValGly: 6.771 ± 1.638
0.616ValHis: 0.616 ± 0.314
3.078ValIle: 3.078 ± 0.985
3.693ValLys: 3.693 ± 1.018
6.464ValLeu: 6.464 ± 1.664
1.847ValMet: 1.847 ± 0.636
3.386ValAsn: 3.386 ± 0.927
3.693ValPro: 3.693 ± 1.087
1.539ValGln: 1.539 ± 0.628
3.078ValArg: 3.078 ± 0.892
5.232ValSer: 5.232 ± 1.561
6.771ValThr: 6.771 ± 1.897
4.309ValVal: 4.309 ± 1.168
0.308ValTrp: 0.308 ± 0.293
1.539ValTyr: 1.539 ± 0.585
0.0ValXaa: 0.0 ± 0.0
Trp
1.539TrpAla: 1.539 ± 0.554
0.0TrpCys: 0.0 ± 0.0
0.923TrpAsp: 0.923 ± 0.4
1.231TrpGlu: 1.231 ± 0.452
0.308TrpPhe: 0.308 ± 0.293
0.923TrpGly: 0.923 ± 0.372
0.0TrpHis: 0.0 ± 0.0
0.308TrpIle: 0.308 ± 0.329
1.231TrpLys: 1.231 ± 0.533
0.616TrpLeu: 0.616 ± 0.411
0.308TrpMet: 0.308 ± 0.31
0.616TrpAsn: 0.616 ± 0.486
0.0TrpPro: 0.0 ± 0.0
0.616TrpGln: 0.616 ± 0.355
1.231TrpArg: 1.231 ± 0.521
0.616TrpSer: 0.616 ± 0.413
1.231TrpThr: 1.231 ± 0.556
1.231TrpVal: 1.231 ± 0.566
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.231TyrAla: 1.231 ± 0.544
0.308TyrCys: 0.308 ± 0.329
0.0TyrAsp: 0.0 ± 0.0
1.231TyrGlu: 1.231 ± 0.476
1.539TyrPhe: 1.539 ± 0.717
1.231TyrGly: 1.231 ± 0.614
1.539TyrHis: 1.539 ± 0.732
2.155TyrIle: 2.155 ± 0.737
1.539TyrLys: 1.539 ± 0.684
2.462TyrLeu: 2.462 ± 0.628
0.308TyrMet: 0.308 ± 0.341
0.923TyrAsn: 0.923 ± 0.431
0.308TyrPro: 0.308 ± 0.32
2.77TyrGln: 2.77 ± 1.35
0.923TyrArg: 0.923 ± 0.567
2.77TyrSer: 2.77 ± 0.691
1.539TyrThr: 1.539 ± 0.529
3.693TyrVal: 3.693 ± 0.939
0.0TyrTrp: 0.0 ± 0.0
0.616TyrTyr: 0.616 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski