Amino acid dipepetide frequency for Pseudomonas phage phi13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.152AlaAla: 14.152 ± 3.2
0.801AlaCys: 0.801 ± 0.452
5.874AlaAsp: 5.874 ± 1.104
5.073AlaGlu: 5.073 ± 1.355
3.471AlaPhe: 3.471 ± 0.999
8.278AlaGly: 8.278 ± 2.666
1.068AlaHis: 1.068 ± 0.521
7.477AlaIle: 7.477 ± 1.883
3.204AlaLys: 3.204 ± 1.176
12.283AlaLeu: 12.283 ± 1.939
3.471AlaMet: 3.471 ± 0.952
2.67AlaAsn: 2.67 ± 0.836
5.874AlaPro: 5.874 ± 1.733
3.471AlaGln: 3.471 ± 1.575
7.21AlaArg: 7.21 ± 1.398
7.744AlaSer: 7.744 ± 1.161
7.744AlaThr: 7.744 ± 1.626
11.215AlaVal: 11.215 ± 2.062
1.602AlaTrp: 1.602 ± 2.021
2.937AlaTyr: 2.937 ± 0.834
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.565
0.0CysCys: 0.0 ± 0.0
0.534CysAsp: 0.534 ± 0.372
0.267CysGlu: 0.267 ± 0.311
0.267CysPhe: 0.267 ± 0.365
0.801CysGly: 0.801 ± 0.514
0.267CysHis: 0.267 ± 0.311
0.0CysIle: 0.0 ± 0.0
0.267CysLys: 0.267 ± 0.329
0.267CysLeu: 0.267 ± 0.222
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.801CysPro: 0.801 ± 0.685
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.267CysThr: 0.267 ± 0.228
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.534CysTyr: 0.534 ± 0.456
0.0CysXaa: 0.0 ± 0.0
Asp
7.21AspAla: 7.21 ± 1.306
0.0AspCys: 0.0 ± 0.0
3.471AspAsp: 3.471 ± 1.322
5.874AspGlu: 5.874 ± 1.054
2.67AspPhe: 2.67 ± 0.713
4.005AspGly: 4.005 ± 0.847
2.403AspHis: 2.403 ± 0.94
2.403AspIle: 2.403 ± 1.064
2.937AspLys: 2.937 ± 1.226
5.073AspLeu: 5.073 ± 1.098
1.068AspMet: 1.068 ± 0.635
1.068AspAsn: 1.068 ± 0.509
3.204AspPro: 3.204 ± 0.926
0.534AspGln: 0.534 ± 0.457
2.136AspArg: 2.136 ± 0.542
3.471AspSer: 3.471 ± 0.926
2.937AspThr: 2.937 ± 0.874
4.539AspVal: 4.539 ± 0.936
1.068AspTrp: 1.068 ± 0.913
1.068AspTyr: 1.068 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
6.142GluAla: 6.142 ± 1.191
0.534GluCys: 0.534 ± 0.4
2.403GluAsp: 2.403 ± 0.594
2.937GluGlu: 2.937 ± 0.817
1.602GluPhe: 1.602 ± 0.742
2.403GluGly: 2.403 ± 0.775
0.801GluHis: 0.801 ± 0.556
2.403GluIle: 2.403 ± 0.692
1.335GluLys: 1.335 ± 0.553
5.874GluLeu: 5.874 ± 1.199
1.335GluMet: 1.335 ± 0.661
1.869GluAsn: 1.869 ± 0.714
1.602GluPro: 1.602 ± 0.989
2.136GluGln: 2.136 ± 0.932
2.937GluArg: 2.937 ± 1.513
3.204GluSer: 3.204 ± 0.706
2.67GluThr: 2.67 ± 0.623
2.937GluVal: 2.937 ± 1.231
0.801GluTrp: 0.801 ± 0.564
1.869GluTyr: 1.869 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 0.762
0.267PheCys: 0.267 ± 0.228
2.136PheAsp: 2.136 ± 0.724
1.869PheGlu: 1.869 ± 0.95
2.403PhePhe: 2.403 ± 0.977
3.471PheGly: 3.471 ± 0.876
0.267PheHis: 0.267 ± 0.222
3.204PheIle: 3.204 ± 1.235
1.068PheLys: 1.068 ± 0.627
3.471PheLeu: 3.471 ± 1.236
1.068PheMet: 1.068 ± 0.595
1.068PheAsn: 1.068 ± 0.409
2.67PhePro: 2.67 ± 0.472
0.534PheGln: 0.534 ± 0.33
1.869PheArg: 1.869 ± 0.974
2.136PheSer: 2.136 ± 0.818
1.335PheThr: 1.335 ± 0.513
3.204PheVal: 3.204 ± 0.789
0.534PheTrp: 0.534 ± 0.268
1.602PheTyr: 1.602 ± 0.791
0.0PheXaa: 0.0 ± 0.0
Gly
9.346GlyAla: 9.346 ± 2.45
1.068GlyCys: 1.068 ± 0.476
3.738GlyAsp: 3.738 ± 1.345
2.67GlyGlu: 2.67 ± 0.822
3.204GlyPhe: 3.204 ± 0.921
6.943GlyGly: 6.943 ± 2.224
1.869GlyHis: 1.869 ± 0.68
2.937GlyIle: 2.937 ± 0.597
3.471GlyLys: 3.471 ± 0.896
8.011GlyLeu: 8.011 ± 1.321
1.869GlyMet: 1.869 ± 0.591
1.068GlyAsn: 1.068 ± 0.464
3.471GlyPro: 3.471 ± 1.035
1.869GlyGln: 1.869 ± 0.838
4.272GlyArg: 4.272 ± 1.081
5.874GlySer: 5.874 ± 0.803
3.204GlyThr: 3.204 ± 0.984
4.272GlyVal: 4.272 ± 1.166
1.602GlyTrp: 1.602 ± 0.675
2.67GlyTyr: 2.67 ± 0.713
0.0GlyXaa: 0.0 ± 0.0
His
1.335HisAla: 1.335 ± 0.645
0.267HisCys: 0.267 ± 0.311
0.801HisAsp: 0.801 ± 0.452
2.136HisGlu: 2.136 ± 0.585
0.267HisPhe: 0.267 ± 0.228
0.267HisGly: 0.267 ± 0.228
1.068HisHis: 1.068 ± 0.655
0.801HisIle: 0.801 ± 0.409
0.267HisLys: 0.267 ± 0.228
2.136HisLeu: 2.136 ± 0.856
0.267HisMet: 0.267 ± 0.222
0.534HisAsn: 0.534 ± 0.291
0.267HisPro: 0.267 ± 0.255
0.534HisGln: 0.534 ± 0.364
0.801HisArg: 0.801 ± 0.524
0.801HisSer: 0.801 ± 0.375
1.869HisThr: 1.869 ± 0.889
2.937HisVal: 2.937 ± 0.929
0.0HisTrp: 0.0 ± 0.0
0.534HisTyr: 0.534 ± 0.364
0.0HisXaa: 0.0 ± 0.0
Ile
6.409IleAla: 6.409 ± 1.212
0.801IleCys: 0.801 ± 0.361
2.67IleAsp: 2.67 ± 0.578
2.403IleGlu: 2.403 ± 1.004
1.602IlePhe: 1.602 ± 0.405
3.738IleGly: 3.738 ± 0.843
0.534IleHis: 0.534 ± 0.658
2.136IleIle: 2.136 ± 1.616
1.602IleLys: 1.602 ± 1.026
4.272IleLeu: 4.272 ± 1.236
1.602IleMet: 1.602 ± 0.508
1.602IleAsn: 1.602 ± 0.678
4.539IlePro: 4.539 ± 1.605
0.534IleGln: 0.534 ± 0.312
2.67IleArg: 2.67 ± 1.248
3.738IleSer: 3.738 ± 0.969
4.272IleThr: 4.272 ± 1.334
3.204IleVal: 3.204 ± 1.571
0.801IleTrp: 0.801 ± 0.676
0.267IleTyr: 0.267 ± 0.27
0.0IleXaa: 0.0 ± 0.0
Lys
3.471LysAla: 3.471 ± 1.103
0.0LysCys: 0.0 ± 0.0
2.67LysAsp: 2.67 ± 0.765
1.068LysGlu: 1.068 ± 0.583
1.068LysPhe: 1.068 ± 0.348
2.67LysGly: 2.67 ± 0.798
0.0LysHis: 0.0 ± 0.0
1.869LysIle: 1.869 ± 1.271
4.272LysLys: 4.272 ± 1.505
4.539LysLeu: 4.539 ± 1.227
2.67LysMet: 2.67 ± 0.986
0.801LysAsn: 0.801 ± 0.627
2.136LysPro: 2.136 ± 1.182
1.335LysGln: 1.335 ± 0.822
2.403LysArg: 2.403 ± 1.262
3.204LysSer: 3.204 ± 0.678
1.068LysThr: 1.068 ± 0.462
3.471LysVal: 3.471 ± 1.333
0.534LysTrp: 0.534 ± 0.392
0.267LysTyr: 0.267 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
11.482LeuAla: 11.482 ± 1.486
0.0LeuCys: 0.0 ± 0.0
4.005LeuAsp: 4.005 ± 1.055
3.738LeuGlu: 3.738 ± 0.757
3.471LeuPhe: 3.471 ± 0.77
8.278LeuGly: 8.278 ± 1.426
1.869LeuHis: 1.869 ± 0.585
4.272LeuIle: 4.272 ± 1.064
3.471LeuLys: 3.471 ± 1.241
10.414LeuLeu: 10.414 ± 1.544
4.005LeuMet: 4.005 ± 1.318
2.937LeuAsn: 2.937 ± 1.556
4.806LeuPro: 4.806 ± 0.853
3.738LeuGln: 3.738 ± 1.148
5.874LeuArg: 5.874 ± 0.681
8.278LeuSer: 8.278 ± 1.641
6.409LeuThr: 6.409 ± 1.471
7.477LeuVal: 7.477 ± 1.645
1.068LeuTrp: 1.068 ± 0.737
2.67LeuTyr: 2.67 ± 0.89
0.0LeuXaa: 0.0 ± 0.0
Met
2.136MetAla: 2.136 ± 0.864
0.0MetCys: 0.0 ± 0.0
1.869MetAsp: 1.869 ± 0.765
1.335MetGlu: 1.335 ± 0.469
1.335MetPhe: 1.335 ± 0.578
2.937MetGly: 2.937 ± 0.805
0.267MetHis: 0.267 ± 0.222
1.869MetIle: 1.869 ± 0.665
1.335MetLys: 1.335 ± 0.541
5.073MetLeu: 5.073 ± 1.651
0.534MetMet: 0.534 ± 0.426
0.801MetAsn: 0.801 ± 0.474
0.801MetPro: 0.801 ± 0.466
1.335MetGln: 1.335 ± 0.538
1.335MetArg: 1.335 ± 0.441
3.204MetSer: 3.204 ± 0.662
2.937MetThr: 2.937 ± 1.02
1.335MetVal: 1.335 ± 0.541
0.534MetTrp: 0.534 ± 0.268
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.539AsnAla: 4.539 ± 0.91
0.0AsnCys: 0.0 ± 0.0
2.67AsnAsp: 2.67 ± 0.718
1.068AsnGlu: 1.068 ± 0.392
1.869AsnPhe: 1.869 ± 0.837
1.335AsnGly: 1.335 ± 0.64
0.534AsnHis: 0.534 ± 0.352
1.335AsnIle: 1.335 ± 0.904
1.335AsnLys: 1.335 ± 0.826
2.403AsnLeu: 2.403 ± 0.532
0.801AsnMet: 0.801 ± 0.468
3.204AsnAsn: 3.204 ± 1.374
2.136AsnPro: 2.136 ± 0.583
0.801AsnGln: 0.801 ± 0.316
1.335AsnArg: 1.335 ± 0.565
2.67AsnSer: 2.67 ± 1.043
0.801AsnThr: 0.801 ± 0.405
3.471AsnVal: 3.471 ± 1.001
0.534AsnTrp: 0.534 ± 0.322
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.477ProAla: 7.477 ± 1.745
0.534ProCys: 0.534 ± 0.37
3.471ProAsp: 3.471 ± 0.868
2.136ProGlu: 2.136 ± 0.671
1.602ProPhe: 1.602 ± 0.625
4.539ProGly: 4.539 ± 1.019
1.335ProHis: 1.335 ± 0.563
3.738ProIle: 3.738 ± 0.985
0.801ProLys: 0.801 ± 0.475
5.34ProLeu: 5.34 ± 1.45
1.335ProMet: 1.335 ± 0.496
1.335ProAsn: 1.335 ± 0.653
1.335ProPro: 1.335 ± 0.716
1.335ProGln: 1.335 ± 0.719
2.67ProArg: 2.67 ± 1.087
5.073ProSer: 5.073 ± 1.851
3.204ProThr: 3.204 ± 0.841
2.67ProVal: 2.67 ± 0.827
0.267ProTrp: 0.267 ± 0.228
0.801ProTyr: 0.801 ± 0.375
0.0ProXaa: 0.0 ± 0.0
Gln
3.738GlnAla: 3.738 ± 1.556
0.0GlnCys: 0.0 ± 0.0
1.335GlnAsp: 1.335 ± 0.692
1.335GlnGlu: 1.335 ± 0.472
0.801GlnPhe: 0.801 ± 0.456
2.136GlnGly: 2.136 ± 0.926
0.801GlnHis: 0.801 ± 0.408
1.335GlnIle: 1.335 ± 0.763
1.068GlnLys: 1.068 ± 0.388
2.937GlnLeu: 2.937 ± 0.933
1.068GlnMet: 1.068 ± 0.655
0.801GlnAsn: 0.801 ± 0.451
0.267GlnPro: 0.267 ± 0.228
0.801GlnGln: 0.801 ± 0.475
1.602GlnArg: 1.602 ± 0.697
2.136GlnSer: 2.136 ± 0.87
2.403GlnThr: 2.403 ± 0.885
1.335GlnVal: 1.335 ± 0.385
1.068GlnTrp: 1.068 ± 0.65
0.801GlnTyr: 0.801 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
6.676ArgAla: 6.676 ± 1.61
0.267ArgCys: 0.267 ± 0.27
2.67ArgAsp: 2.67 ± 0.904
2.136ArgGlu: 2.136 ± 1.143
2.937ArgPhe: 2.937 ± 1.137
2.937ArgGly: 2.937 ± 0.814
1.068ArgHis: 1.068 ± 0.489
2.403ArgIle: 2.403 ± 0.722
1.602ArgLys: 1.602 ± 0.477
5.874ArgLeu: 5.874 ± 1.219
2.403ArgMet: 2.403 ± 0.778
3.204ArgAsn: 3.204 ± 0.736
2.136ArgPro: 2.136 ± 0.79
1.335ArgGln: 1.335 ± 0.537
3.471ArgArg: 3.471 ± 0.668
5.874ArgSer: 5.874 ± 0.84
2.136ArgThr: 2.136 ± 0.574
4.005ArgVal: 4.005 ± 1.022
0.267ArgTrp: 0.267 ± 0.271
1.068ArgTyr: 1.068 ± 0.854
0.0ArgXaa: 0.0 ± 0.0
Ser
7.21SerAla: 7.21 ± 1.435
0.0SerCys: 0.0 ± 0.0
5.874SerAsp: 5.874 ± 0.861
2.403SerGlu: 2.403 ± 0.791
3.738SerPhe: 3.738 ± 1.249
5.34SerGly: 5.34 ± 1.049
1.869SerHis: 1.869 ± 0.864
3.204SerIle: 3.204 ± 1.111
3.471SerLys: 3.471 ± 0.77
7.477SerLeu: 7.477 ± 1.415
1.602SerMet: 1.602 ± 0.988
3.471SerAsn: 3.471 ± 1.235
4.272SerPro: 4.272 ± 1.004
1.602SerGln: 1.602 ± 0.573
3.204SerArg: 3.204 ± 0.912
8.278SerSer: 8.278 ± 1.169
5.34SerThr: 5.34 ± 0.845
8.011SerVal: 8.011 ± 1.337
1.869SerTrp: 1.869 ± 0.699
2.67SerTyr: 2.67 ± 0.935
0.0SerXaa: 0.0 ± 0.0
Thr
7.21ThrAla: 7.21 ± 1.591
0.0ThrCys: 0.0 ± 0.0
3.471ThrAsp: 3.471 ± 1.024
3.204ThrGlu: 3.204 ± 1.01
2.937ThrPhe: 2.937 ± 0.817
5.34ThrGly: 5.34 ± 1.164
0.267ThrHis: 0.267 ± 0.255
2.67ThrIle: 2.67 ± 0.758
2.136ThrLys: 2.136 ± 0.808
4.272ThrLeu: 4.272 ± 1.121
1.869ThrMet: 1.869 ± 0.654
1.335ThrAsn: 1.335 ± 0.6
3.471ThrPro: 3.471 ± 1.298
2.136ThrGln: 2.136 ± 0.903
3.738ThrArg: 3.738 ± 0.868
4.539ThrSer: 4.539 ± 1.235
2.67ThrThr: 2.67 ± 0.611
5.34ThrVal: 5.34 ± 0.712
0.801ThrTrp: 0.801 ± 0.459
1.335ThrTyr: 1.335 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
9.613ValAla: 9.613 ± 2.215
0.534ValCys: 0.534 ± 0.621
4.272ValAsp: 4.272 ± 0.881
3.204ValGlu: 3.204 ± 0.69
1.869ValPhe: 1.869 ± 0.641
6.142ValGly: 6.142 ± 1.03
1.068ValHis: 1.068 ± 0.489
4.005ValIle: 4.005 ± 0.937
3.471ValLys: 3.471 ± 1.666
4.539ValLeu: 4.539 ± 0.978
2.937ValMet: 2.937 ± 0.577
2.136ValAsn: 2.136 ± 0.973
5.607ValPro: 5.607 ± 1.025
1.869ValGln: 1.869 ± 0.786
4.005ValArg: 4.005 ± 1.386
8.545ValSer: 8.545 ± 0.955
5.607ValThr: 5.607 ± 1.769
8.278ValVal: 8.278 ± 2.644
1.602ValTrp: 1.602 ± 0.805
1.869ValTyr: 1.869 ± 0.643
0.0ValXaa: 0.0 ± 0.0
Trp
1.602TrpAla: 1.602 ± 0.671
0.0TrpCys: 0.0 ± 0.0
0.801TrpAsp: 0.801 ± 0.354
1.068TrpGlu: 1.068 ± 0.341
0.267TrpPhe: 0.267 ± 0.271
0.801TrpGly: 0.801 ± 0.496
0.0TrpHis: 0.0 ± 0.0
1.068TrpIle: 1.068 ± 0.508
1.068TrpLys: 1.068 ± 0.686
1.602TrpLeu: 1.602 ± 0.626
0.267TrpMet: 0.267 ± 0.228
1.602TrpAsn: 1.602 ± 0.608
0.801TrpPro: 0.801 ± 0.442
1.068TrpGln: 1.068 ± 0.707
0.534TrpArg: 0.534 ± 0.307
0.534TrpSer: 0.534 ± 0.312
0.801TrpThr: 0.801 ± 0.586
0.534TrpVal: 0.534 ± 0.374
0.801TrpTrp: 0.801 ± 0.497
0.534TrpTyr: 0.534 ± 0.307
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.602TyrAla: 1.602 ± 0.491
0.0TyrCys: 0.0 ± 0.0
2.403TyrAsp: 2.403 ± 0.805
1.869TyrGlu: 1.869 ± 0.436
0.267TyrPhe: 0.267 ± 0.255
1.068TyrGly: 1.068 ± 0.532
0.534TyrHis: 0.534 ± 0.373
0.267TyrIle: 0.267 ± 0.3
1.335TyrLys: 1.335 ± 0.969
2.403TyrLeu: 2.403 ± 0.77
0.534TyrMet: 0.534 ± 0.312
1.335TyrAsn: 1.335 ± 0.469
0.801TyrPro: 0.801 ± 0.629
0.801TyrGln: 0.801 ± 0.375
2.403TyrArg: 2.403 ± 0.61
1.602TyrSer: 1.602 ± 0.75
1.068TyrThr: 1.068 ± 0.477
3.204TyrVal: 3.204 ± 0.732
0.0TyrTrp: 0.0 ± 0.0
1.068TyrTyr: 1.068 ± 0.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (3746 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski