Amino acid dipepetide frequency for Rhodoferax phage P26218

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.953AlaAla: 14.953 ± 1.639
0.979AlaCys: 0.979 ± 0.302
7.566AlaAsp: 7.566 ± 0.869
6.32AlaGlu: 6.32 ± 0.839
2.848AlaPhe: 2.848 ± 0.479
10.325AlaGly: 10.325 ± 1.32
1.691AlaHis: 1.691 ± 0.41
4.361AlaIle: 4.361 ± 0.629
7.299AlaLys: 7.299 ± 1.157
8.723AlaLeu: 8.723 ± 1.002
4.005AlaMet: 4.005 ± 0.728
4.183AlaAsn: 4.183 ± 0.696
5.785AlaPro: 5.785 ± 0.758
5.874AlaGln: 5.874 ± 0.834
6.854AlaArg: 6.854 ± 0.967
5.518AlaSer: 5.518 ± 0.915
8.1AlaThr: 8.1 ± 0.974
6.765AlaVal: 6.765 ± 0.609
1.157AlaTrp: 1.157 ± 0.295
2.492AlaTyr: 2.492 ± 0.503
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.262
0.178CysCys: 0.178 ± 0.147
0.356CysAsp: 0.356 ± 0.142
0.623CysGlu: 0.623 ± 0.195
0.267CysPhe: 0.267 ± 0.14
0.356CysGly: 0.356 ± 0.145
0.267CysHis: 0.267 ± 0.133
0.89CysIle: 0.89 ± 0.271
0.534CysLys: 0.534 ± 0.217
0.623CysLeu: 0.623 ± 0.219
0.267CysMet: 0.267 ± 0.108
0.712CysAsn: 0.712 ± 0.268
0.534CysPro: 0.534 ± 0.223
0.267CysGln: 0.267 ± 0.152
0.356CysArg: 0.356 ± 0.219
0.445CysSer: 0.445 ± 0.193
0.267CysThr: 0.267 ± 0.161
0.267CysVal: 0.267 ± 0.178
0.178CysTrp: 0.178 ± 0.119
0.356CysTyr: 0.356 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
6.587AspAla: 6.587 ± 0.667
0.178AspCys: 0.178 ± 0.113
3.56AspAsp: 3.56 ± 0.672
4.361AspGlu: 4.361 ± 0.639
2.047AspPhe: 2.047 ± 0.465
4.361AspGly: 4.361 ± 0.63
0.979AspHis: 0.979 ± 0.212
3.382AspIle: 3.382 ± 0.562
2.937AspLys: 2.937 ± 0.583
4.094AspLeu: 4.094 ± 0.642
1.869AspMet: 1.869 ± 0.378
2.225AspAsn: 2.225 ± 0.495
3.115AspPro: 3.115 ± 0.596
2.136AspGln: 2.136 ± 0.584
2.759AspArg: 2.759 ± 0.557
1.78AspSer: 1.78 ± 0.401
3.293AspThr: 3.293 ± 0.588
3.738AspVal: 3.738 ± 0.612
1.335AspTrp: 1.335 ± 0.257
1.869AspTyr: 1.869 ± 0.353
0.0AspXaa: 0.0 ± 0.0
Glu
7.21GluAla: 7.21 ± 0.962
0.445GluCys: 0.445 ± 0.149
3.293GluAsp: 3.293 ± 0.567
3.827GluGlu: 3.827 ± 0.721
1.78GluPhe: 1.78 ± 0.42
4.005GluGly: 4.005 ± 0.565
1.157GluHis: 1.157 ± 0.329
2.225GluIle: 2.225 ± 0.52
2.136GluLys: 2.136 ± 0.48
4.717GluLeu: 4.717 ± 0.792
1.246GluMet: 1.246 ± 0.314
2.136GluAsn: 2.136 ± 0.382
1.424GluPro: 1.424 ± 0.336
4.005GluGln: 4.005 ± 0.571
3.293GluArg: 3.293 ± 0.513
2.581GluSer: 2.581 ± 0.507
3.115GluThr: 3.115 ± 0.503
3.115GluVal: 3.115 ± 0.528
0.534GluTrp: 0.534 ± 0.179
1.869GluTyr: 1.869 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.848PheAla: 2.848 ± 0.538
0.356PheCys: 0.356 ± 0.153
2.047PheAsp: 2.047 ± 0.462
1.157PheGlu: 1.157 ± 0.264
0.534PhePhe: 0.534 ± 0.187
2.759PheGly: 2.759 ± 0.479
0.534PheHis: 0.534 ± 0.214
1.78PheIle: 1.78 ± 0.311
1.869PheLys: 1.869 ± 0.488
1.78PheLeu: 1.78 ± 0.319
0.267PheMet: 0.267 ± 0.132
1.335PheAsn: 1.335 ± 0.532
1.424PhePro: 1.424 ± 0.504
0.979PheGln: 0.979 ± 0.258
1.246PheArg: 1.246 ± 0.236
1.602PheSer: 1.602 ± 0.42
2.937PheThr: 2.937 ± 0.58
2.225PheVal: 2.225 ± 0.338
0.356PheTrp: 0.356 ± 0.157
1.068PheTyr: 1.068 ± 0.411
0.0PheXaa: 0.0 ± 0.0
Gly
9.613GlyAla: 9.613 ± 1.23
0.623GlyCys: 0.623 ± 0.18
3.115GlyAsp: 3.115 ± 0.59
2.937GlyGlu: 2.937 ± 0.532
2.403GlyPhe: 2.403 ± 0.432
6.409GlyGly: 6.409 ± 0.691
1.157GlyHis: 1.157 ± 0.303
4.361GlyIle: 4.361 ± 0.764
5.874GlyLys: 5.874 ± 0.687
6.32GlyLeu: 6.32 ± 0.649
3.115GlyMet: 3.115 ± 0.52
2.403GlyAsn: 2.403 ± 0.527
3.026GlyPro: 3.026 ± 0.478
3.56GlyGln: 3.56 ± 0.609
3.738GlyArg: 3.738 ± 0.565
4.717GlySer: 4.717 ± 0.687
6.32GlyThr: 6.32 ± 1.048
6.765GlyVal: 6.765 ± 0.649
1.513GlyTrp: 1.513 ± 0.372
3.471GlyTyr: 3.471 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
1.958HisAla: 1.958 ± 0.418
0.178HisCys: 0.178 ± 0.131
1.335HisAsp: 1.335 ± 0.321
0.534HisGlu: 0.534 ± 0.178
0.712HisPhe: 0.712 ± 0.251
0.979HisGly: 0.979 ± 0.246
0.267HisHis: 0.267 ± 0.15
0.712HisIle: 0.712 ± 0.243
0.534HisLys: 0.534 ± 0.197
0.979HisLeu: 0.979 ± 0.226
0.712HisMet: 0.712 ± 0.241
0.801HisAsn: 0.801 ± 0.271
1.068HisPro: 1.068 ± 0.313
0.623HisGln: 0.623 ± 0.187
1.424HisArg: 1.424 ± 0.392
0.623HisSer: 0.623 ± 0.251
1.068HisThr: 1.068 ± 0.281
1.602HisVal: 1.602 ± 0.381
0.534HisTrp: 0.534 ± 0.194
0.801HisTyr: 0.801 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
5.696IleAla: 5.696 ± 0.769
0.356IleCys: 0.356 ± 0.178
2.67IleAsp: 2.67 ± 0.299
3.026IleGlu: 3.026 ± 0.6
0.801IlePhe: 0.801 ± 0.3
4.005IleGly: 4.005 ± 0.74
0.445IleHis: 0.445 ± 0.192
1.424IleIle: 1.424 ± 0.253
2.492IleLys: 2.492 ± 0.394
3.382IleLeu: 3.382 ± 0.47
1.068IleMet: 1.068 ± 0.283
1.958IleAsn: 1.958 ± 0.379
2.403IlePro: 2.403 ± 0.401
1.602IleGln: 1.602 ± 0.359
2.759IleArg: 2.759 ± 0.611
2.759IleSer: 2.759 ± 0.417
3.916IleThr: 3.916 ± 0.697
2.848IleVal: 2.848 ± 0.537
0.534IleTrp: 0.534 ± 0.182
0.712IleTyr: 0.712 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
6.498LysAla: 6.498 ± 1.105
0.356LysCys: 0.356 ± 0.141
2.937LysAsp: 2.937 ± 0.51
2.225LysGlu: 2.225 ± 0.425
1.78LysPhe: 1.78 ± 0.444
4.806LysGly: 4.806 ± 0.725
1.246LysHis: 1.246 ± 0.388
1.157LysIle: 1.157 ± 0.372
3.115LysLys: 3.115 ± 0.716
6.053LysLeu: 6.053 ± 0.908
1.869LysMet: 1.869 ± 0.419
1.869LysAsn: 1.869 ± 0.38
3.649LysPro: 3.649 ± 0.604
2.67LysGln: 2.67 ± 0.434
3.471LysArg: 3.471 ± 0.561
2.225LysSer: 2.225 ± 0.438
4.005LysThr: 4.005 ± 0.587
4.183LysVal: 4.183 ± 0.56
0.712LysTrp: 0.712 ± 0.266
0.801LysTyr: 0.801 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
8.723LeuAla: 8.723 ± 1.24
0.445LeuCys: 0.445 ± 0.205
5.874LeuAsp: 5.874 ± 0.6
4.539LeuGlu: 4.539 ± 0.644
1.513LeuPhe: 1.513 ± 0.302
6.32LeuGly: 6.32 ± 0.935
1.424LeuHis: 1.424 ± 0.34
3.026LeuIle: 3.026 ± 0.616
5.162LeuLys: 5.162 ± 0.776
6.142LeuLeu: 6.142 ± 0.729
2.314LeuMet: 2.314 ± 0.542
3.204LeuAsn: 3.204 ± 0.546
4.183LeuPro: 4.183 ± 0.612
3.204LeuGln: 3.204 ± 0.573
4.539LeuArg: 4.539 ± 0.56
4.717LeuSer: 4.717 ± 0.576
4.361LeuThr: 4.361 ± 0.718
5.429LeuVal: 5.429 ± 0.625
0.89LeuTrp: 0.89 ± 0.323
3.115LeuTyr: 3.115 ± 0.58
0.0LeuXaa: 0.0 ± 0.0
Met
3.115MetAla: 3.115 ± 0.384
0.089MetCys: 0.089 ± 0.078
1.691MetAsp: 1.691 ± 0.523
1.602MetGlu: 1.602 ± 0.357
1.335MetPhe: 1.335 ± 0.397
1.869MetGly: 1.869 ± 0.461
0.356MetHis: 0.356 ± 0.168
1.157MetIle: 1.157 ± 0.361
1.157MetLys: 1.157 ± 0.308
3.026MetLeu: 3.026 ± 0.577
0.801MetMet: 0.801 ± 0.261
1.513MetAsn: 1.513 ± 0.409
2.225MetPro: 2.225 ± 0.36
1.869MetGln: 1.869 ± 0.509
2.047MetArg: 2.047 ± 0.334
2.492MetSer: 2.492 ± 0.58
2.047MetThr: 2.047 ± 0.48
1.78MetVal: 1.78 ± 0.371
0.356MetTrp: 0.356 ± 0.147
0.534MetTyr: 0.534 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
4.717AsnAla: 4.717 ± 0.802
0.089AsnCys: 0.089 ± 0.099
1.958AsnAsp: 1.958 ± 0.423
2.047AsnGlu: 2.047 ± 0.471
1.068AsnPhe: 1.068 ± 0.285
3.738AsnGly: 3.738 ± 0.671
0.534AsnHis: 0.534 ± 0.243
1.513AsnIle: 1.513 ± 0.373
0.979AsnLys: 0.979 ± 0.269
2.225AsnLeu: 2.225 ± 0.595
1.246AsnMet: 1.246 ± 0.299
1.691AsnAsn: 1.691 ± 0.411
3.115AsnPro: 3.115 ± 0.562
1.691AsnGln: 1.691 ± 0.475
1.78AsnArg: 1.78 ± 0.553
1.869AsnSer: 1.869 ± 0.484
2.759AsnThr: 2.759 ± 0.573
3.827AsnVal: 3.827 ± 0.588
0.534AsnTrp: 0.534 ± 0.263
0.801AsnTyr: 0.801 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
6.765ProAla: 6.765 ± 0.87
0.534ProCys: 0.534 ± 0.262
2.937ProAsp: 2.937 ± 0.497
3.026ProGlu: 3.026 ± 0.516
1.602ProPhe: 1.602 ± 0.38
4.005ProGly: 4.005 ± 0.555
0.712ProHis: 0.712 ± 0.226
2.581ProIle: 2.581 ± 0.508
3.382ProLys: 3.382 ± 0.682
3.738ProLeu: 3.738 ± 0.621
1.157ProMet: 1.157 ± 0.304
1.691ProAsn: 1.691 ± 0.372
1.869ProPro: 1.869 ± 0.503
1.513ProGln: 1.513 ± 0.398
2.492ProArg: 2.492 ± 0.481
2.225ProSer: 2.225 ± 0.426
3.827ProThr: 3.827 ± 0.839
3.916ProVal: 3.916 ± 0.582
0.356ProTrp: 0.356 ± 0.171
1.958ProTyr: 1.958 ± 0.532
0.0ProXaa: 0.0 ± 0.0
Gln
5.518GlnAla: 5.518 ± 0.768
0.356GlnCys: 0.356 ± 0.158
2.403GlnAsp: 2.403 ± 0.295
3.115GlnGlu: 3.115 ± 0.408
1.602GlnPhe: 1.602 ± 0.396
4.005GlnGly: 4.005 ± 0.667
0.979GlnHis: 0.979 ± 0.212
1.78GlnIle: 1.78 ± 0.413
1.335GlnLys: 1.335 ± 0.515
3.649GlnLeu: 3.649 ± 0.499
1.513GlnMet: 1.513 ± 0.453
1.157GlnAsn: 1.157 ± 0.318
1.78GlnPro: 1.78 ± 0.373
3.115GlnGln: 3.115 ± 0.975
3.738GlnArg: 3.738 ± 0.541
2.314GlnSer: 2.314 ± 0.458
2.492GlnThr: 2.492 ± 0.391
3.916GlnVal: 3.916 ± 0.603
0.89GlnTrp: 0.89 ± 0.289
1.691GlnTyr: 1.691 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
6.231ArgAla: 6.231 ± 0.839
0.267ArgCys: 0.267 ± 0.131
2.759ArgAsp: 2.759 ± 0.744
3.115ArgGlu: 3.115 ± 0.569
1.869ArgPhe: 1.869 ± 0.348
3.916ArgGly: 3.916 ± 0.5
1.068ArgHis: 1.068 ± 0.269
2.67ArgIle: 2.67 ± 0.527
4.094ArgLys: 4.094 ± 0.668
3.649ArgLeu: 3.649 ± 0.591
2.492ArgMet: 2.492 ± 0.437
1.068ArgAsn: 1.068 ± 0.27
2.136ArgPro: 2.136 ± 0.378
2.937ArgGln: 2.937 ± 0.622
1.869ArgArg: 1.869 ± 0.448
2.67ArgSer: 2.67 ± 0.429
3.204ArgThr: 3.204 ± 0.507
3.649ArgVal: 3.649 ± 0.56
0.712ArgTrp: 0.712 ± 0.203
2.403ArgTyr: 2.403 ± 0.463
0.0ArgXaa: 0.0 ± 0.0
Ser
5.251SerAla: 5.251 ± 0.752
0.356SerCys: 0.356 ± 0.164
2.581SerAsp: 2.581 ± 0.45
1.78SerGlu: 1.78 ± 0.436
1.513SerPhe: 1.513 ± 0.308
4.539SerGly: 4.539 ± 0.692
0.534SerHis: 0.534 ± 0.259
2.67SerIle: 2.67 ± 0.415
3.827SerLys: 3.827 ± 0.544
3.56SerLeu: 3.56 ± 0.413
1.869SerMet: 1.869 ± 0.402
2.403SerAsn: 2.403 ± 0.632
2.492SerPro: 2.492 ± 0.513
2.314SerGln: 2.314 ± 0.319
1.691SerArg: 1.691 ± 0.377
2.67SerSer: 2.67 ± 0.564
3.56SerThr: 3.56 ± 0.61
4.183SerVal: 4.183 ± 0.608
0.979SerTrp: 0.979 ± 0.26
1.513SerTyr: 1.513 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
6.498ThrAla: 6.498 ± 0.782
0.623ThrCys: 0.623 ± 0.2
2.759ThrAsp: 2.759 ± 0.488
2.937ThrGlu: 2.937 ± 0.51
1.78ThrPhe: 1.78 ± 0.399
6.765ThrGly: 6.765 ± 0.95
1.068ThrHis: 1.068 ± 0.266
4.183ThrIle: 4.183 ± 0.744
3.382ThrLys: 3.382 ± 0.459
6.409ThrLeu: 6.409 ± 0.928
2.047ThrMet: 2.047 ± 0.437
2.937ThrAsn: 2.937 ± 0.604
5.073ThrPro: 5.073 ± 0.7
3.115ThrGln: 3.115 ± 0.438
2.403ThrArg: 2.403 ± 0.449
3.382ThrSer: 3.382 ± 0.556
5.251ThrThr: 5.251 ± 1.002
5.785ThrVal: 5.785 ± 0.692
0.979ThrTrp: 0.979 ± 0.257
2.047ThrTyr: 2.047 ± 0.476
0.0ThrXaa: 0.0 ± 0.0
Val
8.1ValAla: 8.1 ± 1.15
1.157ValCys: 1.157 ± 0.313
4.094ValAsp: 4.094 ± 0.55
5.162ValGlu: 5.162 ± 0.81
1.869ValPhe: 1.869 ± 0.406
5.696ValGly: 5.696 ± 0.725
1.691ValHis: 1.691 ± 0.451
2.848ValIle: 2.848 ± 0.531
3.026ValLys: 3.026 ± 0.49
7.032ValLeu: 7.032 ± 0.659
1.869ValMet: 1.869 ± 0.418
3.026ValAsn: 3.026 ± 0.606
2.67ValPro: 2.67 ± 0.417
3.382ValGln: 3.382 ± 0.718
3.56ValArg: 3.56 ± 0.499
3.738ValSer: 3.738 ± 0.632
5.696ValThr: 5.696 ± 0.8
5.785ValVal: 5.785 ± 0.688
0.801ValTrp: 0.801 ± 0.255
1.958ValTyr: 1.958 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.424TrpAla: 1.424 ± 0.324
0.623TrpCys: 0.623 ± 0.21
0.712TrpAsp: 0.712 ± 0.237
0.267TrpGlu: 0.267 ± 0.157
0.445TrpPhe: 0.445 ± 0.275
0.623TrpGly: 0.623 ± 0.253
0.534TrpHis: 0.534 ± 0.194
0.89TrpIle: 0.89 ± 0.308
0.712TrpLys: 0.712 ± 0.218
1.335TrpLeu: 1.335 ± 0.319
0.267TrpMet: 0.267 ± 0.133
0.534TrpAsn: 0.534 ± 0.183
0.712TrpPro: 0.712 ± 0.284
0.89TrpGln: 0.89 ± 0.336
0.979TrpArg: 0.979 ± 0.238
0.712TrpSer: 0.712 ± 0.224
0.712TrpThr: 0.712 ± 0.252
1.068TrpVal: 1.068 ± 0.368
0.178TrpTrp: 0.178 ± 0.119
0.356TrpTyr: 0.356 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.026TyrAla: 3.026 ± 0.434
0.623TyrCys: 0.623 ± 0.291
1.958TyrAsp: 1.958 ± 0.386
1.513TyrGlu: 1.513 ± 0.401
1.335TyrPhe: 1.335 ± 0.327
1.958TyrGly: 1.958 ± 0.421
0.801TyrHis: 0.801 ± 0.227
1.246TyrIle: 1.246 ± 0.329
1.869TyrLys: 1.869 ± 0.411
1.691TyrLeu: 1.691 ± 0.357
0.979TyrMet: 0.979 ± 0.286
1.335TyrAsn: 1.335 ± 0.325
1.602TyrPro: 1.602 ± 0.402
1.602TyrGln: 1.602 ± 0.35
1.78TyrArg: 1.78 ± 0.397
1.246TyrSer: 1.246 ± 0.377
2.492TyrThr: 2.492 ± 0.477
2.314TyrVal: 2.314 ± 0.479
0.445TyrTrp: 0.445 ± 0.181
0.712TyrTyr: 0.712 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (11236 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski