Amino acid dipepetide frequency for Paracoccus phage vB_PthS_Pthi1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.313AlaAla: 18.313 ± 1.848
1.446AlaCys: 1.446 ± 0.372
6.747AlaAsp: 6.747 ± 0.716
7.871AlaGlu: 7.871 ± 0.795
3.775AlaPhe: 3.775 ± 0.693
10.361AlaGly: 10.361 ± 1.022
1.928AlaHis: 1.928 ± 0.35
5.141AlaIle: 5.141 ± 0.628
5.542AlaLys: 5.542 ± 0.96
10.602AlaLeu: 10.602 ± 0.999
3.855AlaMet: 3.855 ± 0.719
2.731AlaAsn: 2.731 ± 0.704
4.98AlaPro: 4.98 ± 0.736
3.936AlaGln: 3.936 ± 0.596
9.719AlaArg: 9.719 ± 1.148
6.426AlaSer: 6.426 ± 0.836
5.141AlaThr: 5.141 ± 0.533
7.068AlaVal: 7.068 ± 0.634
2.088AlaTrp: 2.088 ± 0.41
1.526AlaTyr: 1.526 ± 0.242
0.0AlaXaa: 0.0 ± 0.0
Cys
1.687CysAla: 1.687 ± 0.422
0.08CysCys: 0.08 ± 0.076
0.402CysAsp: 0.402 ± 0.15
0.321CysGlu: 0.321 ± 0.126
0.161CysPhe: 0.161 ± 0.102
1.205CysGly: 1.205 ± 0.4
0.241CysHis: 0.241 ± 0.189
0.562CysIle: 0.562 ± 0.192
0.08CysLys: 0.08 ± 0.073
0.723CysLeu: 0.723 ± 0.279
0.08CysMet: 0.08 ± 0.068
0.321CysAsn: 0.321 ± 0.133
0.964CysPro: 0.964 ± 0.235
0.321CysGln: 0.321 ± 0.188
0.884CysArg: 0.884 ± 0.295
0.321CysSer: 0.321 ± 0.192
0.562CysThr: 0.562 ± 0.217
0.562CysVal: 0.562 ± 0.242
0.08CysTrp: 0.08 ± 0.089
0.161CysTyr: 0.161 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
6.667AspAla: 6.667 ± 0.743
0.643AspCys: 0.643 ± 0.198
3.855AspAsp: 3.855 ± 0.525
3.614AspGlu: 3.614 ± 0.464
2.41AspPhe: 2.41 ± 0.34
5.863AspGly: 5.863 ± 0.584
1.365AspHis: 1.365 ± 0.311
2.972AspIle: 2.972 ± 0.504
2.249AspLys: 2.249 ± 0.52
6.827AspLeu: 6.827 ± 0.701
2.008AspMet: 2.008 ± 0.426
1.044AspAsn: 1.044 ± 0.284
4.177AspPro: 4.177 ± 0.556
3.133AspGln: 3.133 ± 0.468
5.542AspArg: 5.542 ± 0.565
1.606AspSer: 1.606 ± 0.295
2.972AspThr: 2.972 ± 0.396
4.177AspVal: 4.177 ± 0.74
1.526AspTrp: 1.526 ± 0.32
1.446AspTyr: 1.446 ± 0.388
0.0AspXaa: 0.0 ± 0.0
Glu
8.434GluAla: 8.434 ± 0.746
0.482GluCys: 0.482 ± 0.183
2.651GluAsp: 2.651 ± 0.508
3.133GluGlu: 3.133 ± 0.602
2.651GluPhe: 2.651 ± 0.499
4.659GluGly: 4.659 ± 0.65
0.643GluHis: 0.643 ± 0.256
3.052GluIle: 3.052 ± 0.48
2.249GluLys: 2.249 ± 0.523
5.382GluLeu: 5.382 ± 0.661
1.847GluMet: 1.847 ± 0.305
2.088GluAsn: 2.088 ± 0.483
2.49GluPro: 2.49 ± 0.464
1.847GluGln: 1.847 ± 0.469
4.659GluArg: 4.659 ± 0.701
2.41GluSer: 2.41 ± 0.487
3.133GluThr: 3.133 ± 0.353
3.534GluVal: 3.534 ± 0.545
1.205GluTrp: 1.205 ± 0.385
1.446GluTyr: 1.446 ± 0.271
0.0GluXaa: 0.0 ± 0.0
Phe
3.293PheAla: 3.293 ± 0.454
0.0PheCys: 0.0 ± 0.0
2.731PheAsp: 2.731 ± 0.448
1.606PheGlu: 1.606 ± 0.342
1.446PhePhe: 1.446 ± 0.256
4.096PheGly: 4.096 ± 0.557
0.402PheHis: 0.402 ± 0.148
1.124PheIle: 1.124 ± 0.281
1.365PheLys: 1.365 ± 0.384
2.249PheLeu: 2.249 ± 0.392
1.205PheMet: 1.205 ± 0.252
0.803PheAsn: 0.803 ± 0.152
1.044PhePro: 1.044 ± 0.233
1.365PheGln: 1.365 ± 0.322
1.606PheArg: 1.606 ± 0.354
1.928PheSer: 1.928 ± 0.475
1.285PheThr: 1.285 ± 0.411
1.928PheVal: 1.928 ± 0.312
1.285PheTrp: 1.285 ± 0.282
0.241PheTyr: 0.241 ± 0.109
0.0PheXaa: 0.0 ± 0.0
Gly
8.112GlyAla: 8.112 ± 0.784
1.044GlyCys: 1.044 ± 0.357
6.667GlyAsp: 6.667 ± 0.844
5.141GlyGlu: 5.141 ± 0.742
2.892GlyPhe: 2.892 ± 0.476
8.434GlyGly: 8.434 ± 0.72
1.446GlyHis: 1.446 ± 0.385
4.9GlyIle: 4.9 ± 0.514
3.855GlyLys: 3.855 ± 0.528
7.149GlyLeu: 7.149 ± 0.885
1.687GlyMet: 1.687 ± 0.362
3.133GlyAsn: 3.133 ± 0.515
4.578GlyPro: 4.578 ± 0.607
4.819GlyGln: 4.819 ± 0.946
6.345GlyArg: 6.345 ± 0.74
4.739GlySer: 4.739 ± 0.675
4.739GlyThr: 4.739 ± 0.633
6.345GlyVal: 6.345 ± 0.644
1.847GlyTrp: 1.847 ± 0.407
2.972GlyTyr: 2.972 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
2.008HisAla: 2.008 ± 0.439
0.241HisCys: 0.241 ± 0.113
1.044HisAsp: 1.044 ± 0.298
1.285HisGlu: 1.285 ± 0.331
0.562HisPhe: 0.562 ± 0.197
1.687HisGly: 1.687 ± 0.315
0.562HisHis: 0.562 ± 0.209
0.321HisIle: 0.321 ± 0.17
0.562HisLys: 0.562 ± 0.212
2.088HisLeu: 2.088 ± 0.548
0.562HisMet: 0.562 ± 0.169
0.402HisAsn: 0.402 ± 0.159
1.446HisPro: 1.446 ± 0.437
0.321HisGln: 0.321 ± 0.123
1.767HisArg: 1.767 ± 0.401
0.402HisSer: 0.402 ± 0.149
0.562HisThr: 0.562 ± 0.194
1.446HisVal: 1.446 ± 0.282
0.482HisTrp: 0.482 ± 0.255
0.402HisTyr: 0.402 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
5.382IleAla: 5.382 ± 0.543
0.723IleCys: 0.723 ± 0.247
3.695IleAsp: 3.695 ± 0.55
4.257IleGlu: 4.257 ± 0.601
1.606IlePhe: 1.606 ± 0.314
5.141IleGly: 5.141 ± 0.63
0.884IleHis: 0.884 ± 0.242
1.446IleIle: 1.446 ± 0.345
1.767IleLys: 1.767 ± 0.358
2.731IleLeu: 2.731 ± 0.532
1.365IleMet: 1.365 ± 0.447
1.767IleAsn: 1.767 ± 0.288
3.373IlePro: 3.373 ± 0.487
1.446IleGln: 1.446 ± 0.383
3.855IleArg: 3.855 ± 0.477
3.293IleSer: 3.293 ± 0.571
3.373IleThr: 3.373 ± 0.654
2.249IleVal: 2.249 ± 0.365
1.205IleTrp: 1.205 ± 0.25
0.723IleTyr: 0.723 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
6.506LysAla: 6.506 ± 0.844
0.241LysCys: 0.241 ± 0.136
1.847LysAsp: 1.847 ± 0.509
1.606LysGlu: 1.606 ± 0.415
0.884LysPhe: 0.884 ± 0.226
3.293LysGly: 3.293 ± 0.631
0.723LysHis: 0.723 ± 0.236
1.687LysIle: 1.687 ± 0.308
1.847LysLys: 1.847 ± 0.498
3.454LysLeu: 3.454 ± 0.554
0.884LysMet: 0.884 ± 0.194
0.964LysAsn: 0.964 ± 0.301
1.767LysPro: 1.767 ± 0.318
1.365LysGln: 1.365 ± 0.524
2.329LysArg: 2.329 ± 0.512
1.687LysSer: 1.687 ± 0.516
2.651LysThr: 2.651 ± 0.578
3.373LysVal: 3.373 ± 0.526
0.643LysTrp: 0.643 ± 0.243
0.321LysTyr: 0.321 ± 0.144
0.0LysXaa: 0.0 ± 0.0
Leu
8.835LeuAla: 8.835 ± 1.041
1.205LeuCys: 1.205 ± 0.269
6.426LeuAsp: 6.426 ± 0.766
4.096LeuGlu: 4.096 ± 0.637
2.249LeuPhe: 2.249 ± 0.469
5.863LeuGly: 5.863 ± 0.676
1.285LeuHis: 1.285 ± 0.377
5.141LeuIle: 5.141 ± 0.681
3.855LeuLys: 3.855 ± 0.662
5.301LeuLeu: 5.301 ± 0.539
2.731LeuMet: 2.731 ± 0.54
2.329LeuAsn: 2.329 ± 0.308
4.659LeuPro: 4.659 ± 0.572
3.293LeuGln: 3.293 ± 0.45
7.068LeuArg: 7.068 ± 0.711
6.506LeuSer: 6.506 ± 0.632
5.622LeuThr: 5.622 ± 0.527
4.98LeuVal: 4.98 ± 0.719
0.884LeuTrp: 0.884 ± 0.212
1.526LeuTyr: 1.526 ± 0.288
0.0LeuXaa: 0.0 ± 0.0
Met
4.096MetAla: 4.096 ± 0.514
0.08MetCys: 0.08 ± 0.075
1.928MetAsp: 1.928 ± 0.297
1.365MetGlu: 1.365 ± 0.275
0.562MetPhe: 0.562 ± 0.274
1.285MetGly: 1.285 ± 0.321
0.402MetHis: 0.402 ± 0.161
1.928MetIle: 1.928 ± 0.371
0.803MetLys: 0.803 ± 0.264
2.49MetLeu: 2.49 ± 0.538
0.482MetMet: 0.482 ± 0.198
1.044MetAsn: 1.044 ± 0.288
1.124MetPro: 1.124 ± 0.252
1.767MetGln: 1.767 ± 0.43
2.249MetArg: 2.249 ± 0.462
2.329MetSer: 2.329 ± 0.509
1.767MetThr: 1.767 ± 0.323
1.767MetVal: 1.767 ± 0.459
0.402MetTrp: 0.402 ± 0.165
0.482MetTyr: 0.482 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
2.49AsnAla: 2.49 ± 0.514
0.08AsnCys: 0.08 ± 0.08
1.606AsnAsp: 1.606 ± 0.377
1.526AsnGlu: 1.526 ± 0.314
0.723AsnPhe: 0.723 ± 0.334
3.373AsnGly: 3.373 ± 0.589
0.402AsnHis: 0.402 ± 0.178
1.687AsnIle: 1.687 ± 0.373
0.964AsnLys: 0.964 ± 0.324
2.49AsnLeu: 2.49 ± 0.427
1.205AsnMet: 1.205 ± 0.302
0.643AsnAsn: 0.643 ± 0.258
1.847AsnPro: 1.847 ± 0.339
1.044AsnGln: 1.044 ± 0.318
2.41AsnArg: 2.41 ± 0.339
1.606AsnSer: 1.606 ± 0.36
1.285AsnThr: 1.285 ± 0.404
0.964AsnVal: 0.964 ± 0.251
0.803AsnTrp: 0.803 ± 0.212
0.723AsnTyr: 0.723 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
5.221ProAla: 5.221 ± 0.654
0.482ProCys: 0.482 ± 0.177
4.498ProAsp: 4.498 ± 0.611
3.614ProGlu: 3.614 ± 0.699
1.767ProPhe: 1.767 ± 0.345
4.9ProGly: 4.9 ± 0.724
1.124ProHis: 1.124 ± 0.337
3.133ProIle: 3.133 ± 0.63
1.767ProLys: 1.767 ± 0.407
3.213ProLeu: 3.213 ± 0.387
1.285ProMet: 1.285 ± 0.314
2.088ProAsn: 2.088 ± 0.346
3.133ProPro: 3.133 ± 0.579
2.249ProGln: 2.249 ± 0.357
3.213ProArg: 3.213 ± 0.44
2.972ProSer: 2.972 ± 0.433
1.767ProThr: 1.767 ± 0.405
4.578ProVal: 4.578 ± 0.557
0.482ProTrp: 0.482 ± 0.231
0.643ProTyr: 0.643 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
4.9GlnAla: 4.9 ± 0.643
0.321GlnCys: 0.321 ± 0.146
1.847GlnAsp: 1.847 ± 0.367
2.249GlnGlu: 2.249 ± 0.38
0.643GlnPhe: 0.643 ± 0.218
3.293GlnGly: 3.293 ± 0.377
0.964GlnHis: 0.964 ± 0.223
2.651GlnIle: 2.651 ± 0.331
1.205GlnLys: 1.205 ± 0.333
2.731GlnLeu: 2.731 ± 0.459
1.205GlnMet: 1.205 ± 0.303
0.964GlnAsn: 0.964 ± 0.313
2.329GlnPro: 2.329 ± 0.466
1.928GlnGln: 1.928 ± 0.355
3.775GlnArg: 3.775 ± 0.7
2.169GlnSer: 2.169 ± 0.473
2.008GlnThr: 2.008 ± 0.372
2.41GlnVal: 2.41 ± 0.525
0.402GlnTrp: 0.402 ± 0.155
0.803GlnTyr: 0.803 ± 0.254
0.0GlnXaa: 0.0 ± 0.0
Arg
8.193ArgAla: 8.193 ± 0.897
1.044ArgCys: 1.044 ± 0.263
5.141ArgAsp: 5.141 ± 0.773
4.498ArgGlu: 4.498 ± 0.899
2.811ArgPhe: 2.811 ± 0.428
7.068ArgGly: 7.068 ± 0.683
1.526ArgHis: 1.526 ± 0.324
4.659ArgIle: 4.659 ± 0.524
2.57ArgLys: 2.57 ± 0.497
8.594ArgLeu: 8.594 ± 0.895
3.133ArgMet: 3.133 ± 0.471
1.365ArgAsn: 1.365 ± 0.266
3.293ArgPro: 3.293 ± 0.486
2.49ArgGln: 2.49 ± 0.423
6.345ArgArg: 6.345 ± 0.688
4.337ArgSer: 4.337 ± 0.709
3.373ArgThr: 3.373 ± 0.606
4.819ArgVal: 4.819 ± 0.61
1.847ArgTrp: 1.847 ± 0.385
2.329ArgTyr: 2.329 ± 0.507
0.0ArgXaa: 0.0 ± 0.0
Ser
7.149SerAla: 7.149 ± 0.927
0.161SerCys: 0.161 ± 0.124
2.731SerAsp: 2.731 ± 0.463
3.133SerGlu: 3.133 ± 0.528
1.446SerPhe: 1.446 ± 0.421
6.345SerGly: 6.345 ± 0.902
1.205SerHis: 1.205 ± 0.387
2.731SerIle: 2.731 ± 0.517
2.008SerLys: 2.008 ± 0.287
4.016SerLeu: 4.016 ± 0.506
1.044SerMet: 1.044 ± 0.322
2.169SerAsn: 2.169 ± 0.313
3.133SerPro: 3.133 ± 0.613
1.928SerGln: 1.928 ± 0.423
4.016SerArg: 4.016 ± 0.521
2.892SerSer: 2.892 ± 0.584
2.731SerThr: 2.731 ± 0.438
3.614SerVal: 3.614 ± 0.533
1.124SerTrp: 1.124 ± 0.231
1.446SerTyr: 1.446 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
6.586ThrAla: 6.586 ± 0.581
0.643ThrCys: 0.643 ± 0.301
2.811ThrAsp: 2.811 ± 0.466
2.57ThrGlu: 2.57 ± 0.397
1.446ThrPhe: 1.446 ± 0.334
6.185ThrGly: 6.185 ± 0.936
0.964ThrHis: 0.964 ± 0.274
2.57ThrIle: 2.57 ± 0.38
1.767ThrLys: 1.767 ± 0.318
5.382ThrLeu: 5.382 ± 0.479
0.964ThrMet: 0.964 ± 0.193
1.285ThrAsn: 1.285 ± 0.347
2.892ThrPro: 2.892 ± 0.458
1.205ThrGln: 1.205 ± 0.246
3.775ThrArg: 3.775 ± 0.494
2.169ThrSer: 2.169 ± 0.458
1.687ThrThr: 1.687 ± 0.311
3.855ThrVal: 3.855 ± 0.543
1.044ThrTrp: 1.044 ± 0.301
1.205ThrTyr: 1.205 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
6.988ValAla: 6.988 ± 0.761
0.562ValCys: 0.562 ± 0.166
5.382ValAsp: 5.382 ± 0.588
3.936ValGlu: 3.936 ± 0.626
1.928ValPhe: 1.928 ± 0.386
4.578ValGly: 4.578 ± 0.702
1.124ValHis: 1.124 ± 0.25
2.088ValIle: 2.088 ± 0.425
2.811ValLys: 2.811 ± 0.313
4.659ValLeu: 4.659 ± 0.511
1.847ValMet: 1.847 ± 0.345
2.008ValAsn: 2.008 ± 0.363
3.454ValPro: 3.454 ± 0.604
2.651ValGln: 2.651 ± 0.44
6.104ValArg: 6.104 ± 0.654
4.337ValSer: 4.337 ± 0.529
4.257ValThr: 4.257 ± 0.519
3.293ValVal: 3.293 ± 0.632
1.205ValTrp: 1.205 ± 0.242
0.884ValTyr: 0.884 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
1.928TrpAla: 1.928 ± 0.401
0.161TrpCys: 0.161 ± 0.11
1.205TrpAsp: 1.205 ± 0.297
0.884TrpGlu: 0.884 ± 0.256
0.723TrpPhe: 0.723 ± 0.238
1.606TrpGly: 1.606 ± 0.369
0.482TrpHis: 0.482 ± 0.222
1.124TrpIle: 1.124 ± 0.28
0.482TrpLys: 0.482 ± 0.213
1.687TrpLeu: 1.687 ± 0.45
0.402TrpMet: 0.402 ± 0.231
0.241TrpAsn: 0.241 ± 0.115
0.803TrpPro: 0.803 ± 0.269
0.964TrpGln: 0.964 ± 0.295
1.847TrpArg: 1.847 ± 0.327
1.687TrpSer: 1.687 ± 0.377
1.044TrpThr: 1.044 ± 0.281
1.526TrpVal: 1.526 ± 0.284
0.482TrpTrp: 0.482 ± 0.17
0.402TrpTyr: 0.402 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.41TyrAla: 2.41 ± 0.378
0.08TyrCys: 0.08 ± 0.073
0.884TyrAsp: 0.884 ± 0.359
1.285TyrGlu: 1.285 ± 0.358
0.321TyrPhe: 0.321 ± 0.218
1.606TyrGly: 1.606 ± 0.382
0.482TyrHis: 0.482 ± 0.193
1.365TyrIle: 1.365 ± 0.387
0.321TyrLys: 0.321 ± 0.165
2.088TyrLeu: 2.088 ± 0.377
0.482TyrMet: 0.482 ± 0.214
0.482TyrAsn: 0.482 ± 0.167
0.643TyrPro: 0.643 ± 0.209
0.723TyrGln: 0.723 ± 0.323
1.928TyrArg: 1.928 ± 0.379
1.205TyrSer: 1.205 ± 0.302
1.044TyrThr: 1.044 ± 0.257
1.687TyrVal: 1.687 ± 0.28
0.643TyrTrp: 0.643 ± 0.252
0.723TyrTyr: 0.723 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (12451 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski