Amino acid dipepetide frequency for Vibrio phage Rostov 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.857AlaAla: 8.857 ± 1.403
0.562AlaCys: 0.562 ± 0.344
5.624AlaAsp: 5.624 ± 1.036
6.045AlaGlu: 6.045 ± 1.56
2.249AlaPhe: 2.249 ± 0.381
6.889AlaGly: 6.889 ± 1.29
0.984AlaHis: 0.984 ± 0.39
5.905AlaIle: 5.905 ± 0.952
5.764AlaLys: 5.764 ± 1.871
7.873AlaLeu: 7.873 ± 0.834
2.952AlaMet: 2.952 ± 0.523
3.234AlaAsn: 3.234 ± 0.47
3.093AlaPro: 3.093 ± 0.534
3.374AlaGln: 3.374 ± 0.816
3.936AlaArg: 3.936 ± 0.694
7.029AlaSer: 7.029 ± 0.986
5.905AlaThr: 5.905 ± 0.759
6.186AlaVal: 6.186 ± 0.872
1.125AlaTrp: 1.125 ± 0.342
2.671AlaTyr: 2.671 ± 0.749
0.0AlaXaa: 0.0 ± 0.0
Cys
0.422CysAla: 0.422 ± 0.186
0.422CysCys: 0.422 ± 0.203
0.281CysAsp: 0.281 ± 0.188
0.703CysGlu: 0.703 ± 0.241
0.562CysPhe: 0.562 ± 0.226
0.984CysGly: 0.984 ± 0.29
0.281CysHis: 0.281 ± 0.191
0.844CysIle: 0.844 ± 0.268
0.281CysLys: 0.281 ± 0.173
0.844CysLeu: 0.844 ± 0.304
0.0CysMet: 0.0 ± 0.0
0.281CysAsn: 0.281 ± 0.191
0.281CysPro: 0.281 ± 0.213
0.422CysGln: 0.422 ± 0.221
0.422CysArg: 0.422 ± 0.257
0.141CysSer: 0.141 ± 0.128
1.125CysThr: 1.125 ± 0.473
0.281CysVal: 0.281 ± 0.208
0.141CysTrp: 0.141 ± 0.128
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.061AspAla: 5.061 ± 0.85
0.984AspCys: 0.984 ± 0.43
3.374AspAsp: 3.374 ± 0.628
4.499AspGlu: 4.499 ± 1.066
2.39AspPhe: 2.39 ± 0.625
6.608AspGly: 6.608 ± 1.686
0.422AspHis: 0.422 ± 0.27
3.796AspIle: 3.796 ± 0.696
4.218AspLys: 4.218 ± 0.998
5.061AspLeu: 5.061 ± 0.928
1.546AspMet: 1.546 ± 0.548
1.968AspAsn: 1.968 ± 0.499
3.093AspPro: 3.093 ± 0.656
0.984AspGln: 0.984 ± 0.396
2.812AspArg: 2.812 ± 0.868
5.061AspSer: 5.061 ± 0.744
3.374AspThr: 3.374 ± 0.864
4.639AspVal: 4.639 ± 0.712
1.828AspTrp: 1.828 ± 0.519
1.406AspTyr: 1.406 ± 0.5
0.0AspXaa: 0.0 ± 0.0
Glu
4.639GluAla: 4.639 ± 0.89
0.422GluCys: 0.422 ± 0.221
3.796GluAsp: 3.796 ± 0.629
5.342GluGlu: 5.342 ± 1.045
2.109GluPhe: 2.109 ± 0.467
3.936GluGly: 3.936 ± 0.866
1.265GluHis: 1.265 ± 0.523
2.109GluIle: 2.109 ± 0.63
2.952GluLys: 2.952 ± 0.866
4.921GluLeu: 4.921 ± 0.947
1.546GluMet: 1.546 ± 0.416
1.968GluAsn: 1.968 ± 0.829
1.406GluPro: 1.406 ± 0.402
3.374GluGln: 3.374 ± 0.741
3.515GluArg: 3.515 ± 0.979
2.952GluSer: 2.952 ± 0.54
2.109GluThr: 2.109 ± 0.454
4.921GluVal: 4.921 ± 0.828
1.687GluTrp: 1.687 ± 0.523
2.109GluTyr: 2.109 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
2.39PheAla: 2.39 ± 0.715
0.281PheCys: 0.281 ± 0.197
2.671PheAsp: 2.671 ± 0.514
1.828PheGlu: 1.828 ± 0.581
0.703PhePhe: 0.703 ± 0.411
4.077PheGly: 4.077 ± 1.026
0.141PheHis: 0.141 ± 0.134
1.687PheIle: 1.687 ± 0.409
1.265PheLys: 1.265 ± 0.751
2.671PheLeu: 2.671 ± 0.653
0.703PheMet: 0.703 ± 0.438
3.234PheAsn: 3.234 ± 0.573
1.546PhePro: 1.546 ± 0.438
1.828PheGln: 1.828 ± 0.653
1.968PheArg: 1.968 ± 0.674
2.39PheSer: 2.39 ± 0.786
2.531PheThr: 2.531 ± 0.625
2.671PheVal: 2.671 ± 0.806
0.141PheTrp: 0.141 ± 0.128
0.844PheTyr: 0.844 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
7.311GlyAla: 7.311 ± 1.366
0.703GlyCys: 0.703 ± 0.451
4.921GlyAsp: 4.921 ± 0.934
2.812GlyGlu: 2.812 ± 0.37
3.234GlyPhe: 3.234 ± 0.621
12.512GlyGly: 12.512 ± 5.42
1.546GlyHis: 1.546 ± 0.522
4.499GlyIle: 4.499 ± 0.671
5.342GlyLys: 5.342 ± 0.611
5.342GlyLeu: 5.342 ± 0.837
2.952GlyMet: 2.952 ± 0.886
8.295GlyAsn: 8.295 ± 2.251
2.531GlyPro: 2.531 ± 0.436
3.515GlyGln: 3.515 ± 0.439
3.234GlyArg: 3.234 ± 0.533
6.889GlySer: 6.889 ± 1.518
6.326GlyThr: 6.326 ± 1.109
5.905GlyVal: 5.905 ± 1.184
1.406GlyTrp: 1.406 ± 0.313
4.077GlyTyr: 4.077 ± 0.657
0.0GlyXaa: 0.0 ± 0.0
His
0.844HisAla: 0.844 ± 0.393
0.141HisCys: 0.141 ± 0.128
1.125HisAsp: 1.125 ± 0.289
0.844HisGlu: 0.844 ± 0.341
0.281HisPhe: 0.281 ± 0.205
0.984HisGly: 0.984 ± 0.318
0.141HisHis: 0.141 ± 0.128
1.265HisIle: 1.265 ± 0.53
0.422HisLys: 0.422 ± 0.316
1.546HisLeu: 1.546 ± 0.472
0.562HisMet: 0.562 ± 0.286
0.703HisAsn: 0.703 ± 0.211
0.703HisPro: 0.703 ± 0.356
0.422HisGln: 0.422 ± 0.218
1.125HisArg: 1.125 ± 0.429
1.265HisSer: 1.265 ± 0.405
0.844HisThr: 0.844 ± 0.303
1.265HisVal: 1.265 ± 0.358
0.0HisTrp: 0.0 ± 0.0
0.562HisTyr: 0.562 ± 0.389
0.0HisXaa: 0.0 ± 0.0
Ile
6.608IleAla: 6.608 ± 0.949
0.281IleCys: 0.281 ± 0.169
4.921IleAsp: 4.921 ± 0.926
2.812IleGlu: 2.812 ± 0.571
1.125IlePhe: 1.125 ± 0.481
4.218IleGly: 4.218 ± 0.757
0.281IleHis: 0.281 ± 0.185
1.968IleIle: 1.968 ± 0.431
2.249IleLys: 2.249 ± 0.511
2.952IleLeu: 2.952 ± 0.572
1.687IleMet: 1.687 ± 0.514
2.812IleAsn: 2.812 ± 0.702
3.374IlePro: 3.374 ± 0.451
2.39IleGln: 2.39 ± 0.562
3.234IleArg: 3.234 ± 0.602
3.655IleSer: 3.655 ± 0.957
2.39IleThr: 2.39 ± 0.657
3.374IleVal: 3.374 ± 0.593
1.546IleTrp: 1.546 ± 0.464
0.844IleTyr: 0.844 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
5.905LysAla: 5.905 ± 1.402
0.141LysCys: 0.141 ± 0.134
1.968LysAsp: 1.968 ± 0.583
2.952LysGlu: 2.952 ± 1.01
1.265LysPhe: 1.265 ± 0.477
6.186LysGly: 6.186 ± 1.166
1.125LysHis: 1.125 ± 0.355
2.249LysIle: 2.249 ± 0.402
4.218LysLys: 4.218 ± 1.032
3.936LysLeu: 3.936 ± 0.596
2.39LysMet: 2.39 ± 0.754
2.39LysAsn: 2.39 ± 0.576
2.109LysPro: 2.109 ± 0.577
1.406LysGln: 1.406 ± 0.468
2.39LysArg: 2.39 ± 0.644
3.515LysSer: 3.515 ± 0.788
2.109LysThr: 2.109 ± 0.762
3.796LysVal: 3.796 ± 0.698
1.265LysTrp: 1.265 ± 0.453
2.952LysTyr: 2.952 ± 0.762
0.0LysXaa: 0.0 ± 0.0
Leu
5.342LeuAla: 5.342 ± 0.701
0.422LeuCys: 0.422 ± 0.29
4.077LeuAsp: 4.077 ± 0.739
4.077LeuGlu: 4.077 ± 0.911
1.687LeuPhe: 1.687 ± 0.637
6.748LeuGly: 6.748 ± 1.137
1.828LeuHis: 1.828 ± 0.553
3.655LeuIle: 3.655 ± 0.787
3.234LeuLys: 3.234 ± 0.745
3.093LeuLeu: 3.093 ± 0.513
3.093LeuMet: 3.093 ± 0.788
3.515LeuAsn: 3.515 ± 0.836
4.358LeuPro: 4.358 ± 0.851
2.812LeuGln: 2.812 ± 0.469
4.077LeuArg: 4.077 ± 0.76
4.639LeuSer: 4.639 ± 0.639
4.499LeuThr: 4.499 ± 0.703
4.921LeuVal: 4.921 ± 1.009
1.406LeuTrp: 1.406 ± 0.551
1.125LeuTyr: 1.125 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
3.234MetAla: 3.234 ± 1.282
0.281MetCys: 0.281 ± 0.179
1.687MetAsp: 1.687 ± 0.313
2.249MetGlu: 2.249 ± 0.55
0.984MetPhe: 0.984 ± 0.363
1.828MetGly: 1.828 ± 0.553
0.703MetHis: 0.703 ± 0.303
1.265MetIle: 1.265 ± 0.339
1.828MetLys: 1.828 ± 0.387
1.687MetLeu: 1.687 ± 0.538
0.984MetMet: 0.984 ± 0.495
1.406MetAsn: 1.406 ± 0.438
1.687MetPro: 1.687 ± 0.468
0.844MetGln: 0.844 ± 0.306
1.968MetArg: 1.968 ± 0.511
1.828MetSer: 1.828 ± 0.621
1.406MetThr: 1.406 ± 0.417
2.109MetVal: 2.109 ± 0.642
0.141MetTrp: 0.141 ± 0.146
1.265MetTyr: 1.265 ± 0.433
0.0MetXaa: 0.0 ± 0.0
Asn
5.905AsnAla: 5.905 ± 1.155
0.562AsnCys: 0.562 ± 0.246
2.531AsnAsp: 2.531 ± 0.749
1.687AsnGlu: 1.687 ± 0.42
1.968AsnPhe: 1.968 ± 0.309
7.029AsnGly: 7.029 ± 1.896
0.422AsnHis: 0.422 ± 0.214
2.671AsnIle: 2.671 ± 0.516
3.234AsnLys: 3.234 ± 0.442
2.249AsnLeu: 2.249 ± 0.49
0.984AsnMet: 0.984 ± 0.342
2.812AsnAsn: 2.812 ± 0.712
2.109AsnPro: 2.109 ± 0.626
1.968AsnGln: 1.968 ± 0.438
2.952AsnArg: 2.952 ± 0.59
3.093AsnSer: 3.093 ± 0.868
3.234AsnThr: 3.234 ± 0.681
2.249AsnVal: 2.249 ± 0.467
1.828AsnTrp: 1.828 ± 0.755
1.687AsnTyr: 1.687 ± 0.454
0.0AsnXaa: 0.0 ± 0.0
Pro
4.358ProAla: 4.358 ± 0.735
0.281ProCys: 0.281 ± 0.169
3.655ProAsp: 3.655 ± 0.532
2.531ProGlu: 2.531 ± 0.627
1.687ProPhe: 1.687 ± 0.335
0.281ProGly: 0.281 ± 0.187
0.703ProHis: 0.703 ± 0.329
2.531ProIle: 2.531 ± 0.476
1.687ProLys: 1.687 ± 0.394
2.249ProLeu: 2.249 ± 0.708
1.406ProMet: 1.406 ± 0.57
3.234ProAsn: 3.234 ± 0.61
1.828ProPro: 1.828 ± 0.832
1.125ProGln: 1.125 ± 0.264
2.39ProArg: 2.39 ± 0.63
3.796ProSer: 3.796 ± 0.69
3.515ProThr: 3.515 ± 0.7
2.531ProVal: 2.531 ± 0.764
0.703ProTrp: 0.703 ± 0.353
1.406ProTyr: 1.406 ± 0.745
0.0ProXaa: 0.0 ± 0.0
Gln
3.374GlnAla: 3.374 ± 0.781
0.141GlnCys: 0.141 ± 0.128
2.109GlnAsp: 2.109 ± 0.491
2.39GlnGlu: 2.39 ± 0.626
1.125GlnPhe: 1.125 ± 0.37
2.952GlnGly: 2.952 ± 0.53
0.562GlnHis: 0.562 ± 0.275
2.249GlnIle: 2.249 ± 0.506
1.546GlnLys: 1.546 ± 0.36
2.531GlnLeu: 2.531 ± 0.557
1.125GlnMet: 1.125 ± 0.521
1.265GlnAsn: 1.265 ± 0.497
1.546GlnPro: 1.546 ± 0.454
1.546GlnGln: 1.546 ± 0.502
2.39GlnArg: 2.39 ± 0.793
2.109GlnSer: 2.109 ± 0.424
3.093GlnThr: 3.093 ± 0.51
1.968GlnVal: 1.968 ± 0.555
1.406GlnTrp: 1.406 ± 0.329
1.125GlnTyr: 1.125 ± 0.452
0.0GlnXaa: 0.0 ± 0.0
Arg
4.921ArgAla: 4.921 ± 0.794
0.562ArgCys: 0.562 ± 0.266
2.812ArgAsp: 2.812 ± 0.481
2.531ArgGlu: 2.531 ± 0.442
3.234ArgPhe: 3.234 ± 0.942
4.218ArgGly: 4.218 ± 1.103
0.141ArgHis: 0.141 ± 0.123
2.671ArgIle: 2.671 ± 0.505
3.515ArgLys: 3.515 ± 0.817
4.077ArgLeu: 4.077 ± 0.918
2.531ArgMet: 2.531 ± 0.523
3.515ArgAsn: 3.515 ± 0.905
1.968ArgPro: 1.968 ± 0.515
1.265ArgGln: 1.265 ± 0.37
2.812ArgArg: 2.812 ± 0.425
3.093ArgSer: 3.093 ± 0.857
2.812ArgThr: 2.812 ± 0.677
2.671ArgVal: 2.671 ± 0.56
1.687ArgTrp: 1.687 ± 0.579
2.39ArgTyr: 2.39 ± 0.474
0.0ArgXaa: 0.0 ± 0.0
Ser
5.202SerAla: 5.202 ± 1.005
0.141SerCys: 0.141 ± 0.123
4.921SerAsp: 4.921 ± 0.863
3.796SerGlu: 3.796 ± 0.668
2.952SerPhe: 2.952 ± 0.495
7.029SerGly: 7.029 ± 1.833
0.984SerHis: 0.984 ± 0.263
3.515SerIle: 3.515 ± 0.848
3.655SerLys: 3.655 ± 1.148
4.218SerLeu: 4.218 ± 0.646
1.687SerMet: 1.687 ± 0.484
4.358SerAsn: 4.358 ± 1.163
3.655SerPro: 3.655 ± 0.67
2.109SerGln: 2.109 ± 0.336
3.515SerArg: 3.515 ± 0.536
4.218SerSer: 4.218 ± 1.074
3.093SerThr: 3.093 ± 0.842
4.499SerVal: 4.499 ± 0.872
0.984SerTrp: 0.984 ± 0.31
2.671SerTyr: 2.671 ± 0.651
0.0SerXaa: 0.0 ± 0.0
Thr
5.764ThrAla: 5.764 ± 1.04
0.422ThrCys: 0.422 ± 0.237
3.655ThrAsp: 3.655 ± 0.747
3.093ThrGlu: 3.093 ± 0.533
3.234ThrPhe: 3.234 ± 0.606
6.748ThrGly: 6.748 ± 0.932
0.562ThrHis: 0.562 ± 0.236
3.234ThrIle: 3.234 ± 0.531
2.531ThrLys: 2.531 ± 0.63
4.499ThrLeu: 4.499 ± 0.691
0.984ThrMet: 0.984 ± 0.402
1.828ThrAsn: 1.828 ± 0.527
3.093ThrPro: 3.093 ± 0.585
1.968ThrGln: 1.968 ± 0.515
2.671ThrArg: 2.671 ± 0.44
3.655ThrSer: 3.655 ± 0.689
2.812ThrThr: 2.812 ± 0.677
4.218ThrVal: 4.218 ± 1.024
1.265ThrTrp: 1.265 ± 0.83
1.546ThrTyr: 1.546 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
5.061ValAla: 5.061 ± 0.809
0.844ValCys: 0.844 ± 0.312
4.639ValAsp: 4.639 ± 0.901
3.936ValGlu: 3.936 ± 0.605
2.39ValPhe: 2.39 ± 0.675
7.17ValGly: 7.17 ± 1.23
2.249ValHis: 2.249 ± 0.579
2.952ValIle: 2.952 ± 0.857
3.093ValLys: 3.093 ± 0.889
5.061ValLeu: 5.061 ± 0.672
1.687ValMet: 1.687 ± 0.424
1.828ValAsn: 1.828 ± 0.765
1.968ValPro: 1.968 ± 0.435
2.531ValGln: 2.531 ± 0.803
4.358ValArg: 4.358 ± 0.905
4.077ValSer: 4.077 ± 1.251
3.936ValThr: 3.936 ± 1.114
5.905ValVal: 5.905 ± 0.909
1.406ValTrp: 1.406 ± 0.46
3.374ValTyr: 3.374 ± 0.747
0.0ValXaa: 0.0 ± 0.0
Trp
1.546TrpAla: 1.546 ± 0.294
0.422TrpCys: 0.422 ± 0.207
1.546TrpAsp: 1.546 ± 0.477
1.265TrpGlu: 1.265 ± 0.48
1.265TrpPhe: 1.265 ± 0.647
0.703TrpGly: 0.703 ± 0.249
0.281TrpHis: 0.281 ± 0.203
1.406TrpIle: 1.406 ± 0.351
0.844TrpLys: 0.844 ± 0.309
1.406TrpLeu: 1.406 ± 0.604
0.281TrpMet: 0.281 ± 0.2
0.422TrpAsn: 0.422 ± 0.221
0.281TrpPro: 0.281 ± 0.246
0.844TrpGln: 0.844 ± 0.314
1.546TrpArg: 1.546 ± 0.461
2.109TrpSer: 2.109 ± 0.561
1.406TrpThr: 1.406 ± 0.285
1.828TrpVal: 1.828 ± 0.466
0.422TrpTrp: 0.422 ± 0.208
0.984TrpTyr: 0.984 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.515TyrAla: 3.515 ± 0.726
0.703TyrCys: 0.703 ± 0.349
2.812TyrAsp: 2.812 ± 0.475
1.687TyrGlu: 1.687 ± 0.587
1.125TyrPhe: 1.125 ± 0.318
2.249TyrGly: 2.249 ± 0.485
0.562TyrHis: 0.562 ± 0.22
2.249TyrIle: 2.249 ± 0.548
2.39TyrLys: 2.39 ± 0.774
2.39TyrLeu: 2.39 ± 0.641
0.141TyrMet: 0.141 ± 0.143
2.109TyrAsn: 2.109 ± 0.652
1.265TyrPro: 1.265 ± 0.354
1.828TyrGln: 1.828 ± 0.437
2.109TyrArg: 2.109 ± 0.539
1.687TyrSer: 1.687 ± 0.561
1.265TyrThr: 1.265 ± 0.389
2.531TyrVal: 2.531 ± 0.535
0.422TyrTrp: 0.422 ± 0.29
1.546TyrTyr: 1.546 ± 0.446
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (7114 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski