Amino acid dipepetide frequency for Pseudomonas phage phiNN

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.785AlaAla: 10.785 ± 2.881
0.83AlaCys: 0.83 ± 0.394
7.19AlaAsp: 7.19 ± 1.87
4.425AlaGlu: 4.425 ± 0.68
4.148AlaPhe: 4.148 ± 0.99
6.084AlaGly: 6.084 ± 1.33
1.106AlaHis: 1.106 ± 0.575
6.914AlaIle: 6.914 ± 1.269
4.701AlaLys: 4.701 ± 1.193
11.892AlaLeu: 11.892 ± 1.952
2.765AlaMet: 2.765 ± 0.83
3.042AlaAsn: 3.042 ± 1.202
5.808AlaPro: 5.808 ± 1.145
3.042AlaGln: 3.042 ± 0.681
4.978AlaArg: 4.978 ± 1.444
7.467AlaSer: 7.467 ± 0.961
5.808AlaThr: 5.808 ± 1.069
11.892AlaVal: 11.892 ± 2.345
1.383AlaTrp: 1.383 ± 1.2
5.254AlaTyr: 5.254 ± 0.736
0.0AlaXaa: 0.0 ± 0.0
Cys
0.277CysAla: 0.277 ± 0.329
0.277CysCys: 0.277 ± 0.214
0.553CysAsp: 0.553 ± 0.337
0.553CysGlu: 0.553 ± 0.281
0.277CysPhe: 0.277 ± 0.214
0.277CysGly: 0.277 ± 0.306
0.553CysHis: 0.553 ± 0.361
0.553CysIle: 0.553 ± 0.365
0.0CysLys: 0.0 ± 0.0
0.277CysLeu: 0.277 ± 0.214
0.0CysMet: 0.0 ± 0.0
0.553CysAsn: 0.553 ± 0.275
0.277CysPro: 0.277 ± 0.198
0.0CysGln: 0.0 ± 0.0
0.553CysArg: 0.553 ± 0.337
0.277CysSer: 0.277 ± 0.329
0.277CysThr: 0.277 ± 0.306
0.553CysVal: 0.553 ± 0.258
0.277CysTrp: 0.277 ± 0.198
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.914AspAla: 6.914 ± 1.299
0.553AspCys: 0.553 ± 0.356
1.936AspAsp: 1.936 ± 0.87
3.595AspGlu: 3.595 ± 1.005
1.383AspPhe: 1.383 ± 0.419
2.489AspGly: 2.489 ± 0.659
1.659AspHis: 1.659 ± 0.613
2.489AspIle: 2.489 ± 0.858
0.83AspLys: 0.83 ± 0.431
5.808AspLeu: 5.808 ± 1.238
2.489AspMet: 2.489 ± 0.803
1.936AspAsn: 1.936 ± 0.768
3.319AspPro: 3.319 ± 0.737
1.659AspGln: 1.659 ± 0.492
2.212AspArg: 2.212 ± 0.84
4.978AspSer: 4.978 ± 0.788
4.425AspThr: 4.425 ± 1.334
5.254AspVal: 5.254 ± 1.335
0.83AspTrp: 0.83 ± 0.552
0.277AspTyr: 0.277 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
3.872GluAla: 3.872 ± 0.98
0.0GluCys: 0.0 ± 0.0
2.765GluAsp: 2.765 ± 0.657
3.319GluGlu: 3.319 ± 0.825
1.936GluPhe: 1.936 ± 0.677
3.042GluGly: 3.042 ± 0.941
0.553GluHis: 0.553 ± 0.258
3.042GluIle: 3.042 ± 1.085
1.383GluLys: 1.383 ± 0.491
7.743GluLeu: 7.743 ± 1.781
1.106GluMet: 1.106 ± 0.507
0.83GluAsn: 0.83 ± 0.781
3.042GluPro: 3.042 ± 0.59
1.383GluGln: 1.383 ± 0.503
2.489GluArg: 2.489 ± 0.813
2.212GluSer: 2.212 ± 0.875
3.042GluThr: 3.042 ± 0.997
4.978GluVal: 4.978 ± 1.227
0.553GluTrp: 0.553 ± 0.281
1.659GluTyr: 1.659 ± 0.544
0.0GluXaa: 0.0 ± 0.0
Phe
3.595PheAla: 3.595 ± 1.037
0.553PheCys: 0.553 ± 0.281
1.659PheAsp: 1.659 ± 0.49
1.936PheGlu: 1.936 ± 0.666
1.659PhePhe: 1.659 ± 0.545
2.489PheGly: 2.489 ± 1.0
0.553PheHis: 0.553 ± 0.258
2.765PheIle: 2.765 ± 0.736
2.489PheLys: 2.489 ± 0.545
4.425PheLeu: 4.425 ± 1.188
0.0PheMet: 0.0 ± 0.0
1.936PheAsn: 1.936 ± 0.653
2.212PhePro: 2.212 ± 0.747
0.83PheGln: 0.83 ± 0.282
1.383PheArg: 1.383 ± 0.638
3.319PheSer: 3.319 ± 1.204
3.319PheThr: 3.319 ± 0.937
1.936PheVal: 1.936 ± 0.915
0.553PheTrp: 0.553 ± 0.429
0.83PheTyr: 0.83 ± 0.594
0.0PheXaa: 0.0 ± 0.0
Gly
8.85GlyAla: 8.85 ± 1.828
0.0GlyCys: 0.0 ± 0.0
2.489GlyAsp: 2.489 ± 0.897
3.042GlyGlu: 3.042 ± 0.637
3.042GlyPhe: 3.042 ± 0.583
6.637GlyGly: 6.637 ± 1.637
0.553GlyHis: 0.553 ± 0.397
4.701GlyIle: 4.701 ± 0.894
2.765GlyLys: 2.765 ± 0.437
9.403GlyLeu: 9.403 ± 1.524
1.659GlyMet: 1.659 ± 0.722
4.148GlyAsn: 4.148 ± 1.161
1.106GlyPro: 1.106 ± 0.562
4.425GlyGln: 4.425 ± 1.083
4.978GlyArg: 4.978 ± 0.785
6.361GlySer: 6.361 ± 1.383
4.148GlyThr: 4.148 ± 0.714
4.425GlyVal: 4.425 ± 1.167
1.106GlyTrp: 1.106 ± 0.501
1.936GlyTyr: 1.936 ± 0.681
0.0GlyXaa: 0.0 ± 0.0
His
0.83HisAla: 0.83 ± 0.325
0.0HisCys: 0.0 ± 0.0
0.553HisAsp: 0.553 ± 0.258
1.106HisGlu: 1.106 ± 0.407
0.277HisPhe: 0.277 ± 0.198
0.553HisGly: 0.553 ± 0.397
0.553HisHis: 0.553 ± 0.258
1.383HisIle: 1.383 ± 0.677
0.83HisLys: 0.83 ± 0.362
1.936HisLeu: 1.936 ± 0.599
0.553HisMet: 0.553 ± 0.258
0.83HisAsn: 0.83 ± 0.401
0.277HisPro: 0.277 ± 0.318
0.83HisGln: 0.83 ± 0.282
1.106HisArg: 1.106 ± 0.728
0.277HisSer: 0.277 ± 0.263
1.106HisThr: 1.106 ± 0.568
2.212HisVal: 2.212 ± 0.683
0.0HisTrp: 0.0 ± 0.0
0.277HisTyr: 0.277 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
5.808IleAla: 5.808 ± 1.613
0.277IleCys: 0.277 ± 0.198
3.319IleAsp: 3.319 ± 1.128
3.595IleGlu: 3.595 ± 1.227
1.106IlePhe: 1.106 ± 0.439
6.361IleGly: 6.361 ± 1.32
0.83IleHis: 0.83 ± 0.282
2.212IleIle: 2.212 ± 1.465
2.489IleLys: 2.489 ± 1.298
3.319IleLeu: 3.319 ± 0.59
1.383IleMet: 1.383 ± 0.598
1.936IleAsn: 1.936 ± 0.345
3.042IlePro: 3.042 ± 0.768
0.553IleGln: 0.553 ± 0.476
3.042IleArg: 3.042 ± 0.606
4.148IleSer: 4.148 ± 0.791
2.765IleThr: 2.765 ± 0.89
4.148IleVal: 4.148 ± 1.386
1.106IleTrp: 1.106 ± 0.689
1.383IleTyr: 1.383 ± 0.8
0.0IleXaa: 0.0 ± 0.0
Lys
6.361LysAla: 6.361 ± 1.749
0.0LysCys: 0.0 ± 0.0
3.319LysAsp: 3.319 ± 0.836
4.148LysGlu: 4.148 ± 0.747
1.383LysPhe: 1.383 ± 0.881
3.595LysGly: 3.595 ± 0.552
0.277LysHis: 0.277 ± 0.313
2.489LysIle: 2.489 ± 0.938
3.319LysLys: 3.319 ± 1.93
3.042LysLeu: 3.042 ± 0.91
1.383LysMet: 1.383 ± 0.478
1.383LysAsn: 1.383 ± 0.439
1.936LysPro: 1.936 ± 0.93
1.383LysGln: 1.383 ± 0.545
1.936LysArg: 1.936 ± 0.723
2.489LysSer: 2.489 ± 0.55
2.212LysThr: 2.212 ± 0.889
3.872LysVal: 3.872 ± 0.736
0.553LysTrp: 0.553 ± 0.281
0.83LysTyr: 0.83 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
12.168LeuAla: 12.168 ± 1.408
0.553LeuCys: 0.553 ± 0.365
5.254LeuAsp: 5.254 ± 1.627
5.254LeuGlu: 5.254 ± 1.219
4.425LeuPhe: 4.425 ± 1.112
8.573LeuGly: 8.573 ± 1.132
1.383LeuHis: 1.383 ± 0.365
5.808LeuIle: 5.808 ± 2.015
6.361LeuLys: 6.361 ± 1.068
9.679LeuLeu: 9.679 ± 1.548
3.319LeuMet: 3.319 ± 0.905
2.765LeuAsn: 2.765 ± 0.748
4.978LeuPro: 4.978 ± 1.486
3.872LeuGln: 3.872 ± 0.637
6.084LeuArg: 6.084 ± 1.787
6.084LeuSer: 6.084 ± 1.132
4.701LeuThr: 4.701 ± 0.891
5.808LeuVal: 5.808 ± 1.571
0.83LeuTrp: 0.83 ± 0.511
2.765LeuTyr: 2.765 ± 1.11
0.0LeuXaa: 0.0 ± 0.0
Met
2.765MetAla: 2.765 ± 0.553
0.0MetCys: 0.0 ± 0.0
1.659MetAsp: 1.659 ± 0.941
0.553MetGlu: 0.553 ± 0.429
1.383MetPhe: 1.383 ± 0.396
3.319MetGly: 3.319 ± 0.906
0.553MetHis: 0.553 ± 0.258
1.936MetIle: 1.936 ± 0.527
0.83MetLys: 0.83 ± 0.437
3.042MetLeu: 3.042 ± 0.971
0.83MetMet: 0.83 ± 0.36
0.277MetAsn: 0.277 ± 0.198
2.489MetPro: 2.489 ± 0.998
0.83MetGln: 0.83 ± 0.282
0.553MetArg: 0.553 ± 0.365
2.489MetSer: 2.489 ± 0.631
1.383MetThr: 1.383 ± 0.54
1.383MetVal: 1.383 ± 0.677
0.277MetTrp: 0.277 ± 0.238
0.277MetTyr: 0.277 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
2.765AsnAla: 2.765 ± 0.976
0.0AsnCys: 0.0 ± 0.0
1.936AsnAsp: 1.936 ± 0.882
1.936AsnGlu: 1.936 ± 0.65
1.936AsnPhe: 1.936 ± 0.628
3.319AsnGly: 3.319 ± 0.652
0.553AsnHis: 0.553 ± 0.369
1.383AsnIle: 1.383 ± 0.422
2.212AsnLys: 2.212 ± 0.549
4.701AsnLeu: 4.701 ± 0.65
0.83AsnMet: 0.83 ± 0.455
1.383AsnAsn: 1.383 ± 0.394
2.212AsnPro: 2.212 ± 0.573
1.383AsnGln: 1.383 ± 0.599
1.659AsnArg: 1.659 ± 0.915
2.212AsnSer: 2.212 ± 0.396
1.936AsnThr: 1.936 ± 0.83
2.212AsnVal: 2.212 ± 0.722
0.277AsnTrp: 0.277 ± 0.329
0.83AsnTyr: 0.83 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
6.084ProAla: 6.084 ± 1.569
0.0ProCys: 0.0 ± 0.0
2.489ProAsp: 2.489 ± 0.559
2.212ProGlu: 2.212 ± 0.593
2.489ProPhe: 2.489 ± 0.817
2.765ProGly: 2.765 ± 0.828
0.553ProHis: 0.553 ± 0.274
3.319ProIle: 3.319 ± 0.906
1.936ProLys: 1.936 ± 0.716
5.254ProLeu: 5.254 ± 0.738
1.106ProMet: 1.106 ± 0.285
1.936ProAsn: 1.936 ± 0.434
1.106ProPro: 1.106 ± 0.433
0.83ProGln: 0.83 ± 0.468
1.659ProArg: 1.659 ± 0.618
4.978ProSer: 4.978 ± 0.892
3.595ProThr: 3.595 ± 0.833
4.148ProVal: 4.148 ± 0.861
0.83ProTrp: 0.83 ± 0.455
1.383ProTyr: 1.383 ± 0.888
0.0ProXaa: 0.0 ± 0.0
Gln
3.872GlnAla: 3.872 ± 0.689
0.277GlnCys: 0.277 ± 0.306
1.383GlnAsp: 1.383 ± 0.493
0.553GlnGlu: 0.553 ± 0.332
1.106GlnPhe: 1.106 ± 0.327
2.212GlnGly: 2.212 ± 0.713
0.553GlnHis: 0.553 ± 0.34
1.936GlnIle: 1.936 ± 0.68
0.83GlnLys: 0.83 ± 0.766
4.701GlnLeu: 4.701 ± 0.75
1.659GlnMet: 1.659 ± 0.647
0.553GlnAsn: 0.553 ± 0.275
1.659GlnPro: 1.659 ± 0.523
0.553GlnGln: 0.553 ± 0.281
2.212GlnArg: 2.212 ± 0.922
1.659GlnSer: 1.659 ± 0.645
2.212GlnThr: 2.212 ± 0.621
1.383GlnVal: 1.383 ± 0.721
0.0GlnTrp: 0.0 ± 0.0
2.489GlnTyr: 2.489 ± 1.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.701ArgAla: 4.701 ± 0.822
0.553ArgCys: 0.553 ± 0.337
1.936ArgAsp: 1.936 ± 0.887
1.936ArgGlu: 1.936 ± 0.537
1.936ArgPhe: 1.936 ± 0.708
3.319ArgGly: 3.319 ± 1.067
0.83ArgHis: 0.83 ± 0.394
2.765ArgIle: 2.765 ± 0.983
2.765ArgLys: 2.765 ± 0.793
3.595ArgLeu: 3.595 ± 0.533
1.936ArgMet: 1.936 ± 0.695
2.489ArgAsn: 2.489 ± 0.724
2.212ArgPro: 2.212 ± 1.21
1.659ArgGln: 1.659 ± 0.755
3.319ArgArg: 3.319 ± 0.738
5.254ArgSer: 5.254 ± 0.931
4.148ArgThr: 4.148 ± 1.385
4.148ArgVal: 4.148 ± 1.045
0.553ArgTrp: 0.553 ± 0.316
1.106ArgTyr: 1.106 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
8.85SerAla: 8.85 ± 1.473
0.277SerCys: 0.277 ± 0.306
5.254SerAsp: 5.254 ± 0.746
2.489SerGlu: 2.489 ± 0.765
3.042SerPhe: 3.042 ± 1.016
6.637SerGly: 6.637 ± 1.93
0.83SerHis: 0.83 ± 0.455
1.659SerIle: 1.659 ± 0.915
4.148SerLys: 4.148 ± 1.14
5.808SerLeu: 5.808 ± 1.603
1.659SerMet: 1.659 ± 0.571
4.148SerAsn: 4.148 ± 0.695
3.042SerPro: 3.042 ± 0.786
2.212SerGln: 2.212 ± 0.621
3.319SerArg: 3.319 ± 1.044
6.361SerSer: 6.361 ± 0.776
4.148SerThr: 4.148 ± 0.837
6.637SerVal: 6.637 ± 1.4
1.383SerTrp: 1.383 ± 0.851
1.936SerTyr: 1.936 ± 0.582
0.0SerXaa: 0.0 ± 0.0
Thr
6.914ThrAla: 6.914 ± 1.66
1.106ThrCys: 1.106 ± 0.46
4.425ThrAsp: 4.425 ± 1.084
2.765ThrGlu: 2.765 ± 0.784
2.765ThrPhe: 2.765 ± 0.814
4.148ThrGly: 4.148 ± 1.109
0.553ThrHis: 0.553 ± 0.33
2.765ThrIle: 2.765 ± 0.94
1.936ThrLys: 1.936 ± 0.519
6.637ThrLeu: 6.637 ± 1.006
0.553ThrMet: 0.553 ± 0.316
1.383ThrAsn: 1.383 ± 0.29
4.425ThrPro: 4.425 ± 1.092
2.489ThrGln: 2.489 ± 1.08
2.212ThrArg: 2.212 ± 0.525
4.148ThrSer: 4.148 ± 0.848
5.254ThrThr: 5.254 ± 1.433
3.872ThrVal: 3.872 ± 0.995
1.106ThrTrp: 1.106 ± 0.532
1.659ThrTyr: 1.659 ± 0.577
0.0ThrXaa: 0.0 ± 0.0
Val
8.573ValAla: 8.573 ± 1.702
0.553ValCys: 0.553 ± 0.356
5.254ValAsp: 5.254 ± 1.27
3.042ValGlu: 3.042 ± 0.804
3.042ValPhe: 3.042 ± 1.058
6.084ValGly: 6.084 ± 0.87
2.212ValHis: 2.212 ± 1.143
2.765ValIle: 2.765 ± 1.38
4.978ValLys: 4.978 ± 0.929
5.808ValLeu: 5.808 ± 1.794
2.765ValMet: 2.765 ± 1.022
2.489ValAsn: 2.489 ± 0.645
2.765ValPro: 2.765 ± 0.8
2.765ValGln: 2.765 ± 1.202
3.872ValArg: 3.872 ± 0.976
6.084ValSer: 6.084 ± 1.553
4.978ValThr: 4.978 ± 1.199
8.573ValVal: 8.573 ± 1.599
0.553ValTrp: 0.553 ± 0.397
2.765ValTyr: 2.765 ± 0.614
0.0ValXaa: 0.0 ± 0.0
Trp
2.212TrpAla: 2.212 ± 0.887
0.0TrpCys: 0.0 ± 0.0
0.277TrpAsp: 0.277 ± 0.335
0.277TrpGlu: 0.277 ± 0.238
0.553TrpPhe: 0.553 ± 0.476
1.106TrpGly: 1.106 ± 0.568
0.277TrpHis: 0.277 ± 0.238
0.553TrpIle: 0.553 ± 0.307
0.0TrpLys: 0.0 ± 0.0
1.106TrpLeu: 1.106 ± 1.172
0.0TrpMet: 0.0 ± 0.0
0.83TrpAsn: 0.83 ± 0.29
1.936TrpPro: 1.936 ± 0.409
0.553TrpGln: 0.553 ± 0.258
0.553TrpArg: 0.553 ± 0.275
1.106TrpSer: 1.106 ± 0.43
0.553TrpThr: 0.553 ± 0.397
0.277TrpVal: 0.277 ± 0.198
0.83TrpTrp: 0.83 ± 0.545
0.277TrpTyr: 0.277 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.319TyrAla: 3.319 ± 0.616
0.553TyrCys: 0.553 ± 0.429
1.383TyrAsp: 1.383 ± 0.691
2.212TyrGlu: 2.212 ± 0.73
0.553TyrPhe: 0.553 ± 0.333
2.765TyrGly: 2.765 ± 1.064
0.553TyrHis: 0.553 ± 0.34
1.383TyrIle: 1.383 ± 0.478
1.106TyrLys: 1.106 ± 0.53
2.489TyrLeu: 2.489 ± 1.111
0.553TyrMet: 0.553 ± 0.274
0.83TyrAsn: 0.83 ± 0.468
0.83TyrPro: 0.83 ± 0.352
0.553TyrGln: 0.553 ± 0.369
3.042TyrArg: 3.042 ± 0.83
2.212TyrSer: 2.212 ± 0.706
1.106TyrThr: 1.106 ± 0.272
2.212TyrVal: 2.212 ± 0.654
0.277TyrTrp: 0.277 ± 0.198
0.83TyrTyr: 0.83 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (3617 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski