Amino acid dipepetide frequency for Streptococcus phage CHPC954

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.715AlaAla: 2.715 ± 0.747
0.175AlaCys: 0.175 ± 0.127
4.904AlaAsp: 4.904 ± 0.723
3.765AlaGlu: 3.765 ± 0.562
2.452AlaPhe: 2.452 ± 0.834
3.678AlaGly: 3.678 ± 0.645
0.876AlaHis: 0.876 ± 0.283
4.904AlaIle: 4.904 ± 0.768
6.655AlaLys: 6.655 ± 1.151
5.867AlaLeu: 5.867 ± 0.673
1.751AlaMet: 1.751 ± 0.39
4.466AlaAsn: 4.466 ± 0.758
1.926AlaPro: 1.926 ± 0.355
2.715AlaGln: 2.715 ± 0.45
2.102AlaArg: 2.102 ± 0.396
3.853AlaSer: 3.853 ± 0.494
4.291AlaThr: 4.291 ± 0.774
3.853AlaVal: 3.853 ± 0.732
1.051AlaTrp: 1.051 ± 0.303
2.715AlaTyr: 2.715 ± 0.653
0.0AlaXaa: 0.0 ± 0.0
Cys
0.088CysAla: 0.088 ± 0.074
0.0CysCys: 0.0 ± 0.0
0.788CysAsp: 0.788 ± 0.357
0.263CysGlu: 0.263 ± 0.134
0.263CysPhe: 0.263 ± 0.199
0.175CysGly: 0.175 ± 0.134
0.088CysHis: 0.088 ± 0.091
0.35CysIle: 0.35 ± 0.195
0.263CysLys: 0.263 ± 0.169
0.35CysLeu: 0.35 ± 0.176
0.088CysMet: 0.088 ± 0.089
0.175CysAsn: 0.175 ± 0.138
0.175CysPro: 0.175 ± 0.119
0.263CysGln: 0.263 ± 0.16
0.263CysArg: 0.263 ± 0.212
0.175CysSer: 0.175 ± 0.114
0.35CysThr: 0.35 ± 0.181
0.175CysVal: 0.175 ± 0.089
0.175CysTrp: 0.175 ± 0.141
0.088CysTyr: 0.088 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
3.765AspAla: 3.765 ± 0.744
0.263AspCys: 0.263 ± 0.149
4.378AspAsp: 4.378 ± 0.615
4.729AspGlu: 4.729 ± 0.847
3.94AspPhe: 3.94 ± 0.644
6.392AspGly: 6.392 ± 1.096
0.788AspHis: 0.788 ± 0.363
5.254AspIle: 5.254 ± 0.702
4.553AspLys: 4.553 ± 0.592
3.94AspLeu: 3.94 ± 0.718
2.452AspMet: 2.452 ± 0.544
3.59AspAsn: 3.59 ± 0.577
2.277AspPro: 2.277 ± 0.46
1.576AspGln: 1.576 ± 0.273
2.715AspArg: 2.715 ± 0.486
4.203AspSer: 4.203 ± 0.72
4.291AspThr: 4.291 ± 0.64
3.765AspVal: 3.765 ± 0.673
1.138AspTrp: 1.138 ± 0.3
3.415AspTyr: 3.415 ± 0.602
0.0AspXaa: 0.0 ± 0.0
Glu
4.466GluAla: 4.466 ± 0.675
0.263GluCys: 0.263 ± 0.126
3.59GluAsp: 3.59 ± 0.546
4.991GluGlu: 4.991 ± 0.969
2.364GluPhe: 2.364 ± 0.516
3.59GluGly: 3.59 ± 0.395
1.138GluHis: 1.138 ± 0.348
6.13GluIle: 6.13 ± 0.726
4.466GluLys: 4.466 ± 1.08
6.305GluLeu: 6.305 ± 0.862
2.189GluMet: 2.189 ± 0.516
4.641GluAsn: 4.641 ± 0.615
1.664GluPro: 1.664 ± 0.436
2.89GluGln: 2.89 ± 0.462
3.065GluArg: 3.065 ± 0.516
2.802GluSer: 2.802 ± 0.484
3.327GluThr: 3.327 ± 0.588
5.429GluVal: 5.429 ± 0.757
1.138GluTrp: 1.138 ± 0.292
3.59GluTyr: 3.59 ± 0.709
0.0GluXaa: 0.0 ± 0.0
Phe
2.977PheAla: 2.977 ± 0.514
0.175PheCys: 0.175 ± 0.112
3.24PheAsp: 3.24 ± 0.459
2.189PheGlu: 2.189 ± 0.558
1.751PhePhe: 1.751 ± 0.421
3.678PheGly: 3.678 ± 0.59
0.35PheHis: 0.35 ± 0.139
2.452PheIle: 2.452 ± 0.586
4.378PheLys: 4.378 ± 0.595
2.627PheLeu: 2.627 ± 0.487
0.525PheMet: 0.525 ± 0.244
3.678PheAsn: 3.678 ± 0.674
0.35PhePro: 0.35 ± 0.181
1.401PheGln: 1.401 ± 0.305
1.751PheArg: 1.751 ± 0.342
3.065PheSer: 3.065 ± 0.642
2.977PheThr: 2.977 ± 0.555
2.627PheVal: 2.627 ± 0.417
0.525PheTrp: 0.525 ± 0.177
1.664PheTyr: 1.664 ± 0.364
0.0PheXaa: 0.0 ± 0.0
Gly
3.678GlyAla: 3.678 ± 0.662
0.263GlyCys: 0.263 ± 0.145
4.378GlyAsp: 4.378 ± 0.629
3.24GlyGlu: 3.24 ± 0.598
2.89GlyPhe: 2.89 ± 0.44
4.553GlyGly: 4.553 ± 0.86
0.788GlyHis: 0.788 ± 0.253
5.342GlyIle: 5.342 ± 0.664
6.567GlyLys: 6.567 ± 0.912
5.867GlyLeu: 5.867 ± 0.812
1.401GlyMet: 1.401 ± 0.295
3.94GlyAsn: 3.94 ± 0.709
1.576GlyPro: 1.576 ± 0.426
2.977GlyGln: 2.977 ± 0.578
3.152GlyArg: 3.152 ± 0.581
4.904GlySer: 4.904 ± 0.762
4.291GlyThr: 4.291 ± 0.715
3.24GlyVal: 3.24 ± 0.692
1.401GlyTrp: 1.401 ± 0.336
3.152GlyTyr: 3.152 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
0.525HisAla: 0.525 ± 0.278
0.0HisCys: 0.0 ± 0.0
0.788HisAsp: 0.788 ± 0.244
0.525HisGlu: 0.525 ± 0.191
0.701HisPhe: 0.701 ± 0.194
0.788HisGly: 0.788 ± 0.243
0.438HisHis: 0.438 ± 0.18
1.051HisIle: 1.051 ± 0.302
1.051HisLys: 1.051 ± 0.311
1.051HisLeu: 1.051 ± 0.273
0.525HisMet: 0.525 ± 0.223
0.613HisAsn: 0.613 ± 0.305
0.701HisPro: 0.701 ± 0.215
0.613HisGln: 0.613 ± 0.227
0.613HisArg: 0.613 ± 0.217
0.876HisSer: 0.876 ± 0.244
0.613HisThr: 0.613 ± 0.19
1.138HisVal: 1.138 ± 0.254
0.088HisTrp: 0.088 ± 0.109
0.788HisTyr: 0.788 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
5.254IleAla: 5.254 ± 0.827
0.438IleCys: 0.438 ± 0.186
4.991IleAsp: 4.991 ± 0.6
5.429IleGlu: 5.429 ± 0.927
1.751IlePhe: 1.751 ± 0.354
4.378IleGly: 4.378 ± 0.775
0.788IleHis: 0.788 ± 0.236
3.94IleIle: 3.94 ± 0.672
6.918IleLys: 6.918 ± 0.66
3.94IleLeu: 3.94 ± 0.736
1.489IleMet: 1.489 ± 0.445
4.291IleAsn: 4.291 ± 0.547
3.327IlePro: 3.327 ± 0.463
3.24IleGln: 3.24 ± 0.462
3.327IleArg: 3.327 ± 0.64
3.678IleSer: 3.678 ± 0.522
3.415IleThr: 3.415 ± 0.572
3.152IleVal: 3.152 ± 0.576
1.138IleTrp: 1.138 ± 0.275
2.102IleTyr: 2.102 ± 0.514
0.0IleXaa: 0.0 ± 0.0
Lys
6.392LysAla: 6.392 ± 0.809
0.263LysCys: 0.263 ± 0.177
4.729LysAsp: 4.729 ± 0.819
7.706LysGlu: 7.706 ± 1.141
3.853LysPhe: 3.853 ± 0.762
5.692LysGly: 5.692 ± 0.664
1.051LysHis: 1.051 ± 0.413
5.166LysIle: 5.166 ± 0.722
7.356LysLys: 7.356 ± 1.263
7.443LysLeu: 7.443 ± 0.994
1.926LysMet: 1.926 ± 0.439
5.254LysAsn: 5.254 ± 0.608
3.152LysPro: 3.152 ± 0.455
3.678LysGln: 3.678 ± 0.52
3.94LysArg: 3.94 ± 0.544
3.853LysSer: 3.853 ± 0.411
5.429LysThr: 5.429 ± 0.557
4.203LysVal: 4.203 ± 0.569
1.401LysTrp: 1.401 ± 0.255
3.327LysTyr: 3.327 ± 0.63
0.0LysXaa: 0.0 ± 0.0
Leu
6.305LeuAla: 6.305 ± 0.64
0.35LeuCys: 0.35 ± 0.228
5.867LeuAsp: 5.867 ± 0.674
7.268LeuGlu: 7.268 ± 0.932
2.89LeuPhe: 2.89 ± 0.351
5.429LeuGly: 5.429 ± 1.028
0.701LeuHis: 0.701 ± 0.23
4.816LeuIle: 4.816 ± 0.657
6.48LeuLys: 6.48 ± 0.801
5.779LeuLeu: 5.779 ± 0.674
2.277LeuMet: 2.277 ± 0.434
5.954LeuAsn: 5.954 ± 0.735
2.539LeuPro: 2.539 ± 0.413
2.627LeuGln: 2.627 ± 0.516
3.59LeuArg: 3.59 ± 0.623
4.729LeuSer: 4.729 ± 0.74
5.954LeuThr: 5.954 ± 0.89
3.853LeuVal: 3.853 ± 0.578
0.613LeuTrp: 0.613 ± 0.278
2.014LeuTyr: 2.014 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
1.839MetAla: 1.839 ± 0.319
0.088MetCys: 0.088 ± 0.082
1.138MetAsp: 1.138 ± 0.336
1.313MetGlu: 1.313 ± 0.278
1.138MetPhe: 1.138 ± 0.275
1.138MetGly: 1.138 ± 0.315
0.438MetHis: 0.438 ± 0.219
1.401MetIle: 1.401 ± 0.283
3.327MetLys: 3.327 ± 0.645
1.751MetLeu: 1.751 ± 0.302
0.438MetMet: 0.438 ± 0.283
1.051MetAsn: 1.051 ± 0.256
1.138MetPro: 1.138 ± 0.261
0.788MetGln: 0.788 ± 0.208
0.876MetArg: 0.876 ± 0.228
1.751MetSer: 1.751 ± 0.374
1.489MetThr: 1.489 ± 0.308
1.751MetVal: 1.751 ± 0.412
0.088MetTrp: 0.088 ± 0.07
1.051MetTyr: 1.051 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
4.553AsnAla: 4.553 ± 1.073
0.263AsnCys: 0.263 ± 0.149
3.94AsnAsp: 3.94 ± 0.517
3.765AsnGlu: 3.765 ± 0.661
2.802AsnPhe: 2.802 ± 0.48
6.655AsnGly: 6.655 ± 0.85
0.876AsnHis: 0.876 ± 0.281
3.503AsnIle: 3.503 ± 0.509
4.291AsnLys: 4.291 ± 0.632
5.342AsnLeu: 5.342 ± 0.537
1.138AsnMet: 1.138 ± 0.278
4.291AsnAsn: 4.291 ± 0.847
2.89AsnPro: 2.89 ± 0.557
2.627AsnGln: 2.627 ± 0.434
2.452AsnArg: 2.452 ± 0.614
3.59AsnSer: 3.59 ± 0.49
3.503AsnThr: 3.503 ± 0.535
3.503AsnVal: 3.503 ± 0.502
1.138AsnTrp: 1.138 ± 0.288
2.014AsnTyr: 2.014 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.489ProAla: 1.489 ± 0.371
0.0ProCys: 0.0 ± 0.0
1.576ProAsp: 1.576 ± 0.466
2.189ProGlu: 2.189 ± 0.437
1.051ProPhe: 1.051 ± 0.313
1.664ProGly: 1.664 ± 0.553
0.438ProHis: 0.438 ± 0.174
1.226ProIle: 1.226 ± 0.33
3.94ProLys: 3.94 ± 0.512
2.452ProLeu: 2.452 ± 0.443
0.35ProMet: 0.35 ± 0.157
2.189ProAsn: 2.189 ± 0.432
0.525ProPro: 0.525 ± 0.249
1.576ProGln: 1.576 ± 0.328
0.788ProArg: 0.788 ± 0.318
2.627ProSer: 2.627 ± 0.44
2.452ProThr: 2.452 ± 0.439
1.839ProVal: 1.839 ± 0.512
0.613ProTrp: 0.613 ± 0.218
0.963ProTyr: 0.963 ± 0.434
0.0ProXaa: 0.0 ± 0.0
Gln
3.503GlnAla: 3.503 ± 0.573
0.263GlnCys: 0.263 ± 0.144
2.364GlnAsp: 2.364 ± 0.501
2.539GlnGlu: 2.539 ± 0.566
1.576GlnPhe: 1.576 ± 0.341
3.415GlnGly: 3.415 ± 0.718
0.438GlnHis: 0.438 ± 0.166
2.102GlnIle: 2.102 ± 0.564
3.24GlnLys: 3.24 ± 0.542
3.59GlnLeu: 3.59 ± 0.415
1.226GlnMet: 1.226 ± 0.352
2.627GlnAsn: 2.627 ± 0.428
0.263GlnPro: 0.263 ± 0.159
2.539GlnGln: 2.539 ± 0.461
1.313GlnArg: 1.313 ± 0.25
2.277GlnSer: 2.277 ± 0.541
2.364GlnThr: 2.364 ± 0.518
2.014GlnVal: 2.014 ± 0.415
0.788GlnTrp: 0.788 ± 0.292
2.189GlnTyr: 2.189 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
2.102ArgAla: 2.102 ± 0.42
0.088ArgCys: 0.088 ± 0.091
2.802ArgAsp: 2.802 ± 0.479
2.364ArgGlu: 2.364 ± 0.381
2.452ArgPhe: 2.452 ± 0.387
2.89ArgGly: 2.89 ± 0.615
0.701ArgHis: 0.701 ± 0.248
3.152ArgIle: 3.152 ± 0.668
2.977ArgLys: 2.977 ± 0.592
3.503ArgLeu: 3.503 ± 0.662
1.138ArgMet: 1.138 ± 0.354
2.539ArgAsn: 2.539 ± 0.361
0.963ArgPro: 0.963 ± 0.243
1.839ArgGln: 1.839 ± 0.372
1.313ArgArg: 1.313 ± 0.267
1.839ArgSer: 1.839 ± 0.395
3.152ArgThr: 3.152 ± 0.685
2.539ArgVal: 2.539 ± 0.401
1.138ArgTrp: 1.138 ± 0.285
2.102ArgTyr: 2.102 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
3.152SerAla: 3.152 ± 0.559
0.613SerCys: 0.613 ± 0.277
5.079SerAsp: 5.079 ± 0.521
3.678SerGlu: 3.678 ± 0.571
2.627SerPhe: 2.627 ± 0.534
4.116SerGly: 4.116 ± 0.6
0.613SerHis: 0.613 ± 0.225
4.291SerIle: 4.291 ± 0.59
4.991SerLys: 4.991 ± 0.575
4.816SerLeu: 4.816 ± 0.487
1.751SerMet: 1.751 ± 0.329
4.378SerAsn: 4.378 ± 0.51
1.576SerPro: 1.576 ± 0.29
2.715SerGln: 2.715 ± 0.528
2.452SerArg: 2.452 ± 0.748
3.415SerSer: 3.415 ± 0.638
3.59SerThr: 3.59 ± 0.475
4.904SerVal: 4.904 ± 0.672
0.788SerTrp: 0.788 ± 0.288
1.751SerTyr: 1.751 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
4.641ThrAla: 4.641 ± 0.731
0.35ThrCys: 0.35 ± 0.137
3.94ThrAsp: 3.94 ± 0.684
3.678ThrGlu: 3.678 ± 0.506
2.977ThrPhe: 2.977 ± 0.55
3.415ThrGly: 3.415 ± 0.488
1.313ThrHis: 1.313 ± 0.286
4.291ThrIle: 4.291 ± 0.65
5.166ThrLys: 5.166 ± 0.603
6.48ThrLeu: 6.48 ± 0.844
0.701ThrMet: 0.701 ± 0.221
4.203ThrAsn: 4.203 ± 0.725
1.926ThrPro: 1.926 ± 0.507
2.364ThrGln: 2.364 ± 0.487
1.926ThrArg: 1.926 ± 0.286
4.291ThrSer: 4.291 ± 0.645
3.415ThrThr: 3.415 ± 0.591
3.94ThrVal: 3.94 ± 0.774
0.788ThrTrp: 0.788 ± 0.248
3.152ThrTyr: 3.152 ± 0.618
0.0ThrXaa: 0.0 ± 0.0
Val
3.853ValAla: 3.853 ± 0.542
0.175ValCys: 0.175 ± 0.096
5.079ValAsp: 5.079 ± 0.554
4.466ValGlu: 4.466 ± 0.689
2.539ValPhe: 2.539 ± 0.411
3.853ValGly: 3.853 ± 0.548
0.788ValHis: 0.788 ± 0.189
4.116ValIle: 4.116 ± 0.564
5.254ValLys: 5.254 ± 0.568
3.853ValLeu: 3.853 ± 0.719
1.313ValMet: 1.313 ± 0.314
2.89ValAsn: 2.89 ± 0.494
1.664ValPro: 1.664 ± 0.429
1.489ValGln: 1.489 ± 0.389
2.277ValArg: 2.277 ± 0.577
5.166ValSer: 5.166 ± 0.821
4.291ValThr: 4.291 ± 0.703
3.59ValVal: 3.59 ± 0.674
0.788ValTrp: 0.788 ± 0.283
1.839ValTyr: 1.839 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.216
0.088TrpCys: 0.088 ± 0.091
1.051TrpAsp: 1.051 ± 0.332
1.051TrpGlu: 1.051 ± 0.244
0.613TrpPhe: 0.613 ± 0.25
0.438TrpGly: 0.438 ± 0.172
0.175TrpHis: 0.175 ± 0.109
0.876TrpIle: 0.876 ± 0.236
1.226TrpLys: 1.226 ± 0.375
1.401TrpLeu: 1.401 ± 0.354
0.263TrpMet: 0.263 ± 0.14
0.963TrpAsn: 0.963 ± 0.293
0.088TrpPro: 0.088 ± 0.09
0.701TrpGln: 0.701 ± 0.251
0.788TrpArg: 0.788 ± 0.229
1.839TrpSer: 1.839 ± 0.486
1.401TrpThr: 1.401 ± 0.584
1.226TrpVal: 1.226 ± 0.259
0.263TrpTrp: 0.263 ± 0.198
0.35TrpTyr: 0.35 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.89TyrAla: 2.89 ± 0.557
0.438TyrCys: 0.438 ± 0.246
2.802TyrAsp: 2.802 ± 0.53
2.977TyrGlu: 2.977 ± 0.522
1.751TyrPhe: 1.751 ± 0.444
1.664TyrGly: 1.664 ± 0.637
0.701TyrHis: 0.701 ± 0.213
2.627TyrIle: 2.627 ± 0.513
2.715TyrLys: 2.715 ± 0.431
3.678TyrLeu: 3.678 ± 0.509
0.963TyrMet: 0.963 ± 0.27
1.401TyrAsn: 1.401 ± 0.294
1.226TyrPro: 1.226 ± 0.427
2.014TyrGln: 2.014 ± 0.334
2.89TyrArg: 2.89 ± 0.616
2.364TyrSer: 2.364 ± 0.518
2.364TyrThr: 2.364 ± 0.598
2.539TyrVal: 2.539 ± 0.446
0.263TyrTrp: 0.263 ± 0.138
2.539TyrTyr: 2.539 ± 0.774
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (11421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski