Amino acid dipepetide frequency for Staphylococcus phage phiJB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.891AlaAla: 0.891 ± 0.239
0.297AlaCys: 0.297 ± 0.132
2.375AlaAsp: 2.375 ± 0.427
4.379AlaGlu: 4.379 ± 0.557
2.969AlaPhe: 2.969 ± 0.538
3.711AlaGly: 3.711 ± 0.658
1.262AlaHis: 1.262 ± 0.303
4.527AlaIle: 4.527 ± 0.548
5.715AlaLys: 5.715 ± 0.612
4.601AlaLeu: 4.601 ± 0.71
1.559AlaMet: 1.559 ± 0.319
3.711AlaAsn: 3.711 ± 0.504
1.484AlaPro: 1.484 ± 0.353
2.227AlaGln: 2.227 ± 0.397
2.523AlaArg: 2.523 ± 0.349
3.34AlaSer: 3.34 ± 0.578
3.934AlaThr: 3.934 ± 0.63
3.785AlaVal: 3.785 ± 0.71
1.039AlaTrp: 1.039 ± 0.389
2.449AlaTyr: 2.449 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.074CysAla: 0.074 ± 0.077
0.074CysCys: 0.074 ± 0.085
0.223CysAsp: 0.223 ± 0.118
0.223CysGlu: 0.223 ± 0.15
0.371CysPhe: 0.371 ± 0.162
0.371CysGly: 0.371 ± 0.13
0.074CysHis: 0.074 ± 0.068
0.223CysIle: 0.223 ± 0.115
0.52CysLys: 0.52 ± 0.193
0.148CysLeu: 0.148 ± 0.105
0.074CysMet: 0.074 ± 0.063
0.297CysAsn: 0.297 ± 0.152
0.148CysPro: 0.148 ± 0.09
0.148CysGln: 0.148 ± 0.101
0.371CysArg: 0.371 ± 0.188
0.445CysSer: 0.445 ± 0.194
0.074CysThr: 0.074 ± 0.071
0.371CysVal: 0.371 ± 0.147
0.074CysTrp: 0.074 ± 0.066
0.445CysTyr: 0.445 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
3.711AspAla: 3.711 ± 0.692
0.371AspCys: 0.371 ± 0.15
5.121AspAsp: 5.121 ± 0.738
5.047AspGlu: 5.047 ± 0.673
3.562AspPhe: 3.562 ± 0.599
4.082AspGly: 4.082 ± 0.559
0.297AspHis: 0.297 ± 0.137
4.453AspIle: 4.453 ± 0.53
5.566AspLys: 5.566 ± 0.806
5.344AspLeu: 5.344 ± 0.507
1.93AspMet: 1.93 ± 0.332
3.34AspAsn: 3.34 ± 0.433
1.187AspPro: 1.187 ± 0.248
0.891AspGln: 0.891 ± 0.258
2.523AspArg: 2.523 ± 0.358
4.082AspSer: 4.082 ± 0.51
3.785AspThr: 3.785 ± 0.442
4.305AspVal: 4.305 ± 0.718
0.891AspTrp: 0.891 ± 0.281
3.117AspTyr: 3.117 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
4.527GluAla: 4.527 ± 0.624
0.52GluCys: 0.52 ± 0.203
3.785GluAsp: 3.785 ± 0.668
5.566GluGlu: 5.566 ± 0.816
3.043GluPhe: 3.043 ± 0.453
3.637GluGly: 3.637 ± 0.564
1.707GluHis: 1.707 ± 0.33
5.195GluIle: 5.195 ± 0.649
7.125GluLys: 7.125 ± 0.871
7.051GluLeu: 7.051 ± 0.756
2.449GluMet: 2.449 ± 0.397
4.75GluAsn: 4.75 ± 0.572
1.855GluPro: 1.855 ± 0.334
3.711GluGln: 3.711 ± 0.555
2.82GluArg: 2.82 ± 0.341
4.156GluSer: 4.156 ± 0.598
2.82GluThr: 2.82 ± 0.331
5.566GluVal: 5.566 ± 0.683
1.262GluTrp: 1.262 ± 0.323
3.934GluTyr: 3.934 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
2.301PheAla: 2.301 ± 0.355
0.297PheCys: 0.297 ± 0.131
3.859PheAsp: 3.859 ± 0.462
3.414PheGlu: 3.414 ± 0.479
1.113PhePhe: 1.113 ± 0.261
2.598PheGly: 2.598 ± 0.598
0.742PheHis: 0.742 ± 0.195
2.969PheIle: 2.969 ± 0.429
4.453PheLys: 4.453 ± 0.637
2.894PheLeu: 2.894 ± 0.392
1.336PheMet: 1.336 ± 0.275
3.414PheAsn: 3.414 ± 0.414
0.742PhePro: 0.742 ± 0.262
0.816PheGln: 0.816 ± 0.272
1.633PheArg: 1.633 ± 0.301
3.043PheSer: 3.043 ± 0.537
3.043PheThr: 3.043 ± 0.487
3.117PheVal: 3.117 ± 0.445
0.445PheTrp: 0.445 ± 0.193
2.301PheTyr: 2.301 ± 0.405
0.0PheXaa: 0.0 ± 0.0
Gly
3.934GlyAla: 3.934 ± 0.533
0.297GlyCys: 0.297 ± 0.157
3.488GlyAsp: 3.488 ± 0.525
3.34GlyGlu: 3.34 ± 0.401
2.894GlyPhe: 2.894 ± 0.497
2.672GlyGly: 2.672 ± 0.484
1.855GlyHis: 1.855 ± 0.381
4.676GlyIle: 4.676 ± 0.504
5.195GlyLys: 5.195 ± 0.501
3.934GlyLeu: 3.934 ± 0.514
1.781GlyMet: 1.781 ± 0.361
3.266GlyAsn: 3.266 ± 0.484
0.594GlyPro: 0.594 ± 0.195
2.969GlyGln: 2.969 ± 0.397
2.375GlyArg: 2.375 ± 0.448
2.672GlySer: 2.672 ± 0.459
3.934GlyThr: 3.934 ± 0.453
5.047GlyVal: 5.047 ± 0.725
0.891GlyTrp: 0.891 ± 0.258
3.117GlyTyr: 3.117 ± 0.538
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.269
0.0HisCys: 0.0 ± 0.0
0.742HisAsp: 0.742 ± 0.218
1.039HisGlu: 1.039 ± 0.27
0.891HisPhe: 0.891 ± 0.253
1.187HisGly: 1.187 ± 0.254
0.445HisHis: 0.445 ± 0.194
1.262HisIle: 1.262 ± 0.302
1.262HisLys: 1.262 ± 0.228
1.41HisLeu: 1.41 ± 0.297
0.371HisMet: 0.371 ± 0.161
0.965HisAsn: 0.965 ± 0.241
1.039HisPro: 1.039 ± 0.298
0.742HisGln: 0.742 ± 0.241
0.742HisArg: 0.742 ± 0.265
1.187HisSer: 1.187 ± 0.306
1.336HisThr: 1.336 ± 0.318
1.187HisVal: 1.187 ± 0.282
0.148HisTrp: 0.148 ± 0.099
0.816HisTyr: 0.816 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
4.23IleAla: 4.23 ± 0.503
0.223IleCys: 0.223 ± 0.126
5.492IleAsp: 5.492 ± 0.723
7.644IleGlu: 7.644 ± 0.933
2.82IlePhe: 2.82 ± 0.438
4.824IleGly: 4.824 ± 0.795
0.891IleHis: 0.891 ± 0.265
3.043IleIle: 3.043 ± 0.406
8.312IleLys: 8.312 ± 0.658
4.75IleLeu: 4.75 ± 0.571
2.523IleMet: 2.523 ± 0.418
5.269IleAsn: 5.269 ± 0.622
2.004IlePro: 2.004 ± 0.285
2.969IleGln: 2.969 ± 0.329
3.117IleArg: 3.117 ± 0.61
4.156IleSer: 4.156 ± 0.533
4.75IleThr: 4.75 ± 0.629
4.156IleVal: 4.156 ± 0.503
0.742IleTrp: 0.742 ± 0.308
3.191IleTyr: 3.191 ± 0.641
0.0IleXaa: 0.0 ± 0.0
Lys
6.086LysAla: 6.086 ± 0.655
0.594LysCys: 0.594 ± 0.214
5.121LysAsp: 5.121 ± 0.629
7.422LysGlu: 7.422 ± 0.654
4.305LysPhe: 4.305 ± 0.455
5.492LysGly: 5.492 ± 0.756
2.004LysHis: 2.004 ± 0.461
6.68LysIle: 6.68 ± 0.784
8.09LysLys: 8.09 ± 0.979
6.754LysLeu: 6.754 ± 0.714
3.191LysMet: 3.191 ± 0.329
5.863LysAsn: 5.863 ± 0.827
2.82LysPro: 2.82 ± 0.489
3.859LysGln: 3.859 ± 0.555
4.23LysArg: 4.23 ± 0.661
5.121LysSer: 5.121 ± 0.558
5.269LysThr: 5.269 ± 0.696
4.824LysVal: 4.824 ± 0.544
1.039LysTrp: 1.039 ± 0.268
4.453LysTyr: 4.453 ± 0.73
0.0LysXaa: 0.0 ± 0.0
Leu
3.934LeuAla: 3.934 ± 0.576
0.297LeuCys: 0.297 ± 0.146
5.566LeuAsp: 5.566 ± 0.555
5.566LeuGlu: 5.566 ± 0.702
3.488LeuPhe: 3.488 ± 0.429
3.637LeuGly: 3.637 ± 0.581
1.262LeuHis: 1.262 ± 0.336
6.086LeuIle: 6.086 ± 0.595
7.273LeuLys: 7.273 ± 0.64
5.121LeuLeu: 5.121 ± 0.649
1.559LeuMet: 1.559 ± 0.371
5.121LeuAsn: 5.121 ± 0.565
2.152LeuPro: 2.152 ± 0.319
3.043LeuGln: 3.043 ± 0.402
3.043LeuArg: 3.043 ± 0.491
4.601LeuSer: 4.601 ± 0.536
4.601LeuThr: 4.601 ± 0.585
3.934LeuVal: 3.934 ± 0.486
0.371LeuTrp: 0.371 ± 0.177
4.082LeuTyr: 4.082 ± 0.556
0.0LeuXaa: 0.0 ± 0.0
Met
2.152MetAla: 2.152 ± 0.527
0.223MetCys: 0.223 ± 0.117
1.187MetAsp: 1.187 ± 0.268
1.262MetGlu: 1.262 ± 0.321
0.965MetPhe: 0.965 ± 0.196
1.113MetGly: 1.113 ± 0.289
0.445MetHis: 0.445 ± 0.178
1.707MetIle: 1.707 ± 0.372
2.449MetLys: 2.449 ± 0.367
2.301MetLeu: 2.301 ± 0.362
0.594MetMet: 0.594 ± 0.262
1.855MetAsn: 1.855 ± 0.327
1.187MetPro: 1.187 ± 0.285
1.484MetGln: 1.484 ± 0.486
1.262MetArg: 1.262 ± 0.334
1.781MetSer: 1.781 ± 0.407
2.82MetThr: 2.82 ± 0.428
0.742MetVal: 0.742 ± 0.178
0.445MetTrp: 0.445 ± 0.163
1.039MetTyr: 1.039 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
3.488AsnAla: 3.488 ± 0.606
0.223AsnCys: 0.223 ± 0.131
5.121AsnAsp: 5.121 ± 0.614
4.527AsnGlu: 4.527 ± 0.513
2.746AsnPhe: 2.746 ± 0.449
4.824AsnGly: 4.824 ± 0.606
0.891AsnHis: 0.891 ± 0.294
4.676AsnIle: 4.676 ± 0.467
6.086AsnLys: 6.086 ± 0.657
3.637AsnLeu: 3.637 ± 0.511
1.855AsnMet: 1.855 ± 0.324
4.973AsnAsn: 4.973 ± 0.685
3.117AsnPro: 3.117 ± 0.368
2.227AsnGln: 2.227 ± 0.406
2.375AsnArg: 2.375 ± 0.423
3.562AsnSer: 3.562 ± 0.415
3.117AsnThr: 3.117 ± 0.418
4.305AsnVal: 4.305 ± 0.587
0.668AsnTrp: 0.668 ± 0.192
2.375AsnTyr: 2.375 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
1.484ProAla: 1.484 ± 0.269
0.074ProCys: 0.074 ± 0.074
1.41ProAsp: 1.41 ± 0.277
2.078ProGlu: 2.078 ± 0.363
1.336ProPhe: 1.336 ± 0.32
1.781ProGly: 1.781 ± 0.44
0.297ProHis: 0.297 ± 0.143
2.672ProIle: 2.672 ± 0.485
2.969ProLys: 2.969 ± 0.519
1.633ProLeu: 1.633 ± 0.429
0.668ProMet: 0.668 ± 0.169
2.152ProAsn: 2.152 ± 0.372
0.445ProPro: 0.445 ± 0.152
1.039ProGln: 1.039 ± 0.276
0.445ProArg: 0.445 ± 0.148
1.633ProSer: 1.633 ± 0.353
2.746ProThr: 2.746 ± 0.436
1.187ProVal: 1.187 ± 0.251
0.148ProTrp: 0.148 ± 0.101
1.633ProTyr: 1.633 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
3.266GlnAla: 3.266 ± 0.458
0.297GlnCys: 0.297 ± 0.166
1.633GlnAsp: 1.633 ± 0.353
2.969GlnGlu: 2.969 ± 0.601
2.301GlnPhe: 2.301 ± 0.419
2.598GlnGly: 2.598 ± 0.412
0.891GlnHis: 0.891 ± 0.231
2.672GlnIle: 2.672 ± 0.304
2.598GlnLys: 2.598 ± 0.459
2.82GlnLeu: 2.82 ± 0.432
0.668GlnMet: 0.668 ± 0.261
2.301GlnAsn: 2.301 ± 0.387
1.113GlnPro: 1.113 ± 0.283
1.559GlnGln: 1.559 ± 0.481
2.078GlnArg: 2.078 ± 0.427
2.152GlnSer: 2.152 ± 0.355
1.781GlnThr: 1.781 ± 0.334
2.375GlnVal: 2.375 ± 0.472
0.297GlnTrp: 0.297 ± 0.126
1.633GlnTyr: 1.633 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
1.855ArgAla: 1.855 ± 0.297
0.371ArgCys: 0.371 ± 0.173
2.894ArgAsp: 2.894 ± 0.509
3.414ArgGlu: 3.414 ± 0.571
1.93ArgPhe: 1.93 ± 0.419
1.93ArgGly: 1.93 ± 0.399
1.336ArgHis: 1.336 ± 0.302
3.785ArgIle: 3.785 ± 0.558
4.008ArgLys: 4.008 ± 0.452
4.008ArgLeu: 4.008 ± 0.507
0.965ArgMet: 0.965 ± 0.24
2.598ArgAsn: 2.598 ± 0.393
1.113ArgPro: 1.113 ± 0.241
1.559ArgGln: 1.559 ± 0.411
1.559ArgArg: 1.559 ± 0.372
1.113ArgSer: 1.113 ± 0.322
1.113ArgThr: 1.113 ± 0.302
1.781ArgVal: 1.781 ± 0.285
0.445ArgTrp: 0.445 ± 0.173
2.672ArgTyr: 2.672 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
3.934SerAla: 3.934 ± 0.549
0.0SerCys: 0.0 ± 0.0
4.23SerAsp: 4.23 ± 0.539
3.266SerGlu: 3.266 ± 0.516
3.043SerPhe: 3.043 ± 0.417
4.082SerGly: 4.082 ± 0.605
0.594SerHis: 0.594 ± 0.176
5.64SerIle: 5.64 ± 0.677
5.195SerLys: 5.195 ± 0.663
3.711SerLeu: 3.711 ± 0.484
2.004SerMet: 2.004 ± 0.313
3.488SerAsn: 3.488 ± 0.545
1.262SerPro: 1.262 ± 0.304
2.82SerGln: 2.82 ± 0.591
2.078SerArg: 2.078 ± 0.367
3.414SerSer: 3.414 ± 0.633
3.562SerThr: 3.562 ± 0.453
3.488SerVal: 3.488 ± 0.543
0.52SerTrp: 0.52 ± 0.2
1.781SerTyr: 1.781 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
2.672ThrAla: 2.672 ± 0.463
0.148ThrCys: 0.148 ± 0.095
3.711ThrAsp: 3.711 ± 0.452
3.859ThrGlu: 3.859 ± 0.553
2.746ThrPhe: 2.746 ± 0.509
3.859ThrGly: 3.859 ± 0.45
1.41ThrHis: 1.41 ± 0.292
5.492ThrIle: 5.492 ± 0.759
4.824ThrLys: 4.824 ± 0.603
5.121ThrLeu: 5.121 ± 0.644
0.965ThrMet: 0.965 ± 0.265
3.934ThrAsn: 3.934 ± 0.676
1.781ThrPro: 1.781 ± 0.407
2.152ThrGln: 2.152 ± 0.485
2.672ThrArg: 2.672 ± 0.486
4.453ThrSer: 4.453 ± 0.785
3.711ThrThr: 3.711 ± 0.581
3.043ThrVal: 3.043 ± 0.411
0.742ThrTrp: 0.742 ± 0.255
2.449ThrTyr: 2.449 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
3.711ValAla: 3.711 ± 0.628
0.148ValCys: 0.148 ± 0.102
4.75ValAsp: 4.75 ± 0.651
6.012ValGlu: 6.012 ± 0.667
1.707ValPhe: 1.707 ± 0.4
3.043ValGly: 3.043 ± 0.471
0.742ValHis: 0.742 ± 0.242
4.898ValIle: 4.898 ± 0.55
6.457ValLys: 6.457 ± 0.64
4.75ValLeu: 4.75 ± 0.564
1.484ValMet: 1.484 ± 0.324
3.266ValAsn: 3.266 ± 0.535
2.598ValPro: 2.598 ± 0.457
1.559ValGln: 1.559 ± 0.368
2.078ValArg: 2.078 ± 0.336
3.562ValSer: 3.562 ± 0.642
3.637ValThr: 3.637 ± 0.52
4.453ValVal: 4.453 ± 0.535
0.594ValTrp: 0.594 ± 0.171
2.152ValTyr: 2.152 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.891TrpAla: 0.891 ± 0.249
0.074TrpCys: 0.074 ± 0.066
0.52TrpAsp: 0.52 ± 0.179
0.594TrpGlu: 0.594 ± 0.2
0.594TrpPhe: 0.594 ± 0.161
0.742TrpGly: 0.742 ± 0.233
0.223TrpHis: 0.223 ± 0.115
0.668TrpIle: 0.668 ± 0.211
0.891TrpLys: 0.891 ± 0.241
1.113TrpLeu: 1.113 ± 0.296
0.297TrpMet: 0.297 ± 0.134
0.816TrpAsn: 0.816 ± 0.186
0.074TrpPro: 0.074 ± 0.057
0.594TrpGln: 0.594 ± 0.228
0.445TrpArg: 0.445 ± 0.193
0.668TrpSer: 0.668 ± 0.283
0.816TrpThr: 0.816 ± 0.206
0.891TrpVal: 0.891 ± 0.226
0.0TrpTrp: 0.0 ± 0.0
0.52TrpTyr: 0.52 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.375TyrAla: 2.375 ± 0.423
0.223TyrCys: 0.223 ± 0.11
2.375TyrAsp: 2.375 ± 0.545
3.859TyrGlu: 3.859 ± 0.59
1.707TyrPhe: 1.707 ± 0.432
2.82TyrGly: 2.82 ± 0.616
0.668TyrHis: 0.668 ± 0.22
3.785TyrIle: 3.785 ± 0.627
4.23TyrLys: 4.23 ± 0.564
3.637TyrLeu: 3.637 ± 0.548
0.668TyrMet: 0.668 ± 0.239
3.414TyrAsn: 3.414 ± 0.502
1.113TyrPro: 1.113 ± 0.325
1.633TyrGln: 1.633 ± 0.357
2.078TyrArg: 2.078 ± 0.463
2.969TyrSer: 2.969 ± 0.419
2.894TyrThr: 2.894 ± 0.466
3.043TyrVal: 3.043 ± 0.415
0.668TyrTrp: 0.668 ± 0.187
1.707TyrTyr: 1.707 ± 0.34
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13475 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski