Amino acid dipepetide frequency for Pelagibacter phage HTVC010P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.529AlaAla: 4.529 ± 0.815
0.289AlaCys: 0.289 ± 0.177
5.107AlaAsp: 5.107 ± 0.848
3.854AlaGlu: 3.854 ± 0.826
2.987AlaPhe: 2.987 ± 0.618
5.396AlaGly: 5.396 ± 0.793
0.385AlaHis: 0.385 ± 0.18
4.432AlaIle: 4.432 ± 0.608
5.107AlaLys: 5.107 ± 0.981
4.432AlaLeu: 4.432 ± 0.662
2.602AlaMet: 2.602 ± 0.611
4.24AlaAsn: 4.24 ± 0.798
2.794AlaPro: 2.794 ± 0.76
3.373AlaGln: 3.373 ± 0.804
4.625AlaArg: 4.625 ± 0.813
4.818AlaSer: 4.818 ± 0.989
5.492AlaThr: 5.492 ± 1.025
4.722AlaVal: 4.722 ± 0.812
1.06AlaTrp: 1.06 ± 0.287
2.794AlaTyr: 2.794 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.096CysAla: 0.096 ± 0.088
0.096CysCys: 0.096 ± 0.104
0.385CysAsp: 0.385 ± 0.213
0.771CysGlu: 0.771 ± 0.26
0.482CysPhe: 0.482 ± 0.214
0.096CysGly: 0.096 ± 0.087
0.289CysHis: 0.289 ± 0.173
0.482CysIle: 0.482 ± 0.209
0.867CysLys: 0.867 ± 0.269
0.771CysLeu: 0.771 ± 0.363
0.482CysMet: 0.482 ± 0.223
0.385CysAsn: 0.385 ± 0.22
0.289CysPro: 0.289 ± 0.186
0.289CysGln: 0.289 ± 0.16
0.289CysArg: 0.289 ± 0.166
0.771CysSer: 0.771 ± 0.3
0.289CysThr: 0.289 ± 0.183
0.675CysVal: 0.675 ± 0.251
0.193CysTrp: 0.193 ± 0.147
0.385CysTyr: 0.385 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
5.589AspAla: 5.589 ± 0.877
0.385AspCys: 0.385 ± 0.193
4.432AspAsp: 4.432 ± 0.686
2.602AspGlu: 2.602 ± 0.398
2.409AspPhe: 2.409 ± 0.397
4.336AspGly: 4.336 ± 0.688
1.349AspHis: 1.349 ± 0.463
5.974AspIle: 5.974 ± 0.577
5.589AspLys: 5.589 ± 0.684
5.011AspLeu: 5.011 ± 0.628
1.542AspMet: 1.542 ± 0.499
3.373AspAsn: 3.373 ± 0.61
2.505AspPro: 2.505 ± 0.453
2.409AspGln: 2.409 ± 0.456
2.024AspArg: 2.024 ± 0.428
4.047AspSer: 4.047 ± 0.633
3.469AspThr: 3.469 ± 0.856
3.276AspVal: 3.276 ± 0.452
1.156AspTrp: 1.156 ± 0.4
3.18AspTyr: 3.18 ± 0.5
0.0AspXaa: 0.0 ± 0.0
Glu
3.373GluAla: 3.373 ± 0.655
0.578GluCys: 0.578 ± 0.255
2.505GluAsp: 2.505 ± 0.56
4.24GluGlu: 4.24 ± 0.842
2.891GluPhe: 2.891 ± 0.453
3.565GluGly: 3.565 ± 0.61
0.964GluHis: 0.964 ± 0.332
4.143GluIle: 4.143 ± 0.746
4.625GluLys: 4.625 ± 0.675
5.396GluLeu: 5.396 ± 0.767
1.349GluMet: 1.349 ± 0.351
2.698GluAsn: 2.698 ± 0.506
1.542GluPro: 1.542 ± 0.385
3.565GluGln: 3.565 ± 0.722
1.927GluArg: 1.927 ± 0.361
2.987GluSer: 2.987 ± 0.584
3.565GluThr: 3.565 ± 0.591
4.625GluVal: 4.625 ± 0.73
0.675GluTrp: 0.675 ± 0.317
2.794GluTyr: 2.794 ± 0.524
0.0GluXaa: 0.0 ± 0.0
Phe
3.469PheAla: 3.469 ± 0.515
0.193PheCys: 0.193 ± 0.13
3.662PheAsp: 3.662 ± 0.533
3.276PheGlu: 3.276 ± 0.574
1.06PhePhe: 1.06 ± 0.435
1.445PheGly: 1.445 ± 0.376
0.193PheHis: 0.193 ± 0.131
2.794PheIle: 2.794 ± 0.519
3.565PheLys: 3.565 ± 0.635
2.891PheLeu: 2.891 ± 0.565
1.349PheMet: 1.349 ± 0.352
2.987PheAsn: 2.987 ± 0.503
0.771PhePro: 0.771 ± 0.299
1.927PheGln: 1.927 ± 0.41
2.024PheArg: 2.024 ± 0.557
2.602PheSer: 2.602 ± 0.52
4.432PheThr: 4.432 ± 0.656
2.409PheVal: 2.409 ± 0.544
0.193PheTrp: 0.193 ± 0.131
1.927PheTyr: 1.927 ± 0.482
0.0PheXaa: 0.0 ± 0.0
Gly
5.011GlyAla: 5.011 ± 0.743
0.482GlyCys: 0.482 ± 0.193
4.432GlyAsp: 4.432 ± 0.526
2.216GlyGlu: 2.216 ± 0.454
2.698GlyPhe: 2.698 ± 0.482
3.373GlyGly: 3.373 ± 0.616
0.578GlyHis: 0.578 ± 0.237
3.662GlyIle: 3.662 ± 0.585
4.625GlyLys: 4.625 ± 0.59
4.818GlyLeu: 4.818 ± 0.737
1.638GlyMet: 1.638 ± 0.312
3.276GlyAsn: 3.276 ± 0.72
1.253GlyPro: 1.253 ± 0.314
2.024GlyGln: 2.024 ± 0.376
2.409GlyArg: 2.409 ± 0.456
5.878GlySer: 5.878 ± 0.955
4.818GlyThr: 4.818 ± 0.761
3.662GlyVal: 3.662 ± 0.614
0.482GlyTrp: 0.482 ± 0.2
2.794GlyTyr: 2.794 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.341
0.193HisCys: 0.193 ± 0.139
1.445HisAsp: 1.445 ± 0.518
0.578HisGlu: 0.578 ± 0.218
0.964HisPhe: 0.964 ± 0.273
0.867HisGly: 0.867 ± 0.298
0.385HisHis: 0.385 ± 0.171
1.542HisIle: 1.542 ± 0.358
1.06HisLys: 1.06 ± 0.281
1.06HisLeu: 1.06 ± 0.259
0.289HisMet: 0.289 ± 0.181
1.06HisAsn: 1.06 ± 0.335
1.253HisPro: 1.253 ± 0.424
0.385HisGln: 0.385 ± 0.176
0.771HisArg: 0.771 ± 0.305
0.675HisSer: 0.675 ± 0.234
0.675HisThr: 0.675 ± 0.301
0.385HisVal: 0.385 ± 0.191
0.289HisTrp: 0.289 ± 0.161
0.867HisTyr: 0.867 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
5.492IleAla: 5.492 ± 0.809
0.482IleCys: 0.482 ± 0.257
5.781IleAsp: 5.781 ± 0.701
4.529IleGlu: 4.529 ± 0.875
2.602IlePhe: 2.602 ± 0.431
4.047IleGly: 4.047 ± 0.621
1.156IleHis: 1.156 ± 0.272
4.529IleIle: 4.529 ± 0.755
7.323IleLys: 7.323 ± 1.095
5.685IleLeu: 5.685 ± 0.74
1.156IleMet: 1.156 ± 0.311
4.432IleAsn: 4.432 ± 0.678
2.313IlePro: 2.313 ± 0.481
2.313IleGln: 2.313 ± 0.452
2.698IleArg: 2.698 ± 0.581
3.469IleSer: 3.469 ± 0.689
5.396IleThr: 5.396 ± 0.675
2.987IleVal: 2.987 ± 0.434
0.482IleTrp: 0.482 ± 0.229
2.987IleTyr: 2.987 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
4.432LysAla: 4.432 ± 0.677
0.482LysCys: 0.482 ± 0.238
5.107LysAsp: 5.107 ± 0.619
5.3LysGlu: 5.3 ± 0.911
2.505LysPhe: 2.505 ± 0.39
3.758LysGly: 3.758 ± 0.629
1.445LysHis: 1.445 ± 0.313
7.227LysIle: 7.227 ± 0.932
7.805LysLys: 7.805 ± 1.133
6.649LysLeu: 6.649 ± 0.823
3.373LysMet: 3.373 ± 0.574
5.589LysAsn: 5.589 ± 0.951
2.409LysPro: 2.409 ± 0.481
4.047LysGln: 4.047 ± 0.648
2.987LysArg: 2.987 ± 0.467
3.951LysSer: 3.951 ± 0.775
5.781LysThr: 5.781 ± 0.689
4.625LysVal: 4.625 ± 0.709
0.385LysTrp: 0.385 ± 0.195
3.565LysTyr: 3.565 ± 0.622
0.0LysXaa: 0.0 ± 0.0
Leu
6.071LeuAla: 6.071 ± 0.767
0.867LeuCys: 0.867 ± 0.306
4.625LeuAsp: 4.625 ± 0.603
4.914LeuGlu: 4.914 ± 0.73
3.469LeuPhe: 3.469 ± 0.588
4.818LeuGly: 4.818 ± 0.658
1.06LeuHis: 1.06 ± 0.287
4.529LeuIle: 4.529 ± 0.616
5.107LeuLys: 5.107 ± 0.927
4.336LeuLeu: 4.336 ± 0.88
1.638LeuMet: 1.638 ± 0.364
5.107LeuAsn: 5.107 ± 0.879
2.794LeuPro: 2.794 ± 0.676
2.698LeuGln: 2.698 ± 0.442
3.373LeuArg: 3.373 ± 0.589
5.781LeuSer: 5.781 ± 0.683
5.203LeuThr: 5.203 ± 0.668
3.758LeuVal: 3.758 ± 0.544
0.193LeuTrp: 0.193 ± 0.12
2.505LeuTyr: 2.505 ± 0.442
0.0LeuXaa: 0.0 ± 0.0
Met
2.313MetAla: 2.313 ± 0.475
0.193MetCys: 0.193 ± 0.133
1.349MetAsp: 1.349 ± 0.314
2.024MetGlu: 2.024 ± 0.425
0.964MetPhe: 0.964 ± 0.238
1.253MetGly: 1.253 ± 0.446
0.385MetHis: 0.385 ± 0.241
1.734MetIle: 1.734 ± 0.392
2.12MetLys: 2.12 ± 0.551
2.794MetLeu: 2.794 ± 0.591
0.482MetMet: 0.482 ± 0.218
1.253MetAsn: 1.253 ± 0.37
0.675MetPro: 0.675 ± 0.276
1.156MetGln: 1.156 ± 0.516
0.964MetArg: 0.964 ± 0.345
2.024MetSer: 2.024 ± 0.595
2.505MetThr: 2.505 ± 0.384
1.156MetVal: 1.156 ± 0.37
0.0MetTrp: 0.0 ± 0.0
0.867MetTyr: 0.867 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
4.047AsnAla: 4.047 ± 0.655
0.675AsnCys: 0.675 ± 0.264
3.854AsnAsp: 3.854 ± 0.65
4.432AsnGlu: 4.432 ± 0.796
3.083AsnPhe: 3.083 ± 0.473
4.047AsnGly: 4.047 ± 0.596
0.867AsnHis: 0.867 ± 0.21
5.203AsnIle: 5.203 ± 0.638
4.722AsnLys: 4.722 ± 0.73
4.336AsnLeu: 4.336 ± 0.604
1.638AsnMet: 1.638 ± 0.431
4.914AsnAsn: 4.914 ± 0.875
2.313AsnPro: 2.313 ± 0.4
2.216AsnGln: 2.216 ± 0.523
2.313AsnArg: 2.313 ± 0.55
3.18AsnSer: 3.18 ± 0.527
3.951AsnThr: 3.951 ± 0.714
2.794AsnVal: 2.794 ± 0.45
0.193AsnTrp: 0.193 ± 0.143
2.987AsnTyr: 2.987 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
2.216ProAla: 2.216 ± 0.552
0.0ProCys: 0.0 ± 0.0
1.927ProAsp: 1.927 ± 0.398
1.638ProGlu: 1.638 ± 0.513
2.216ProPhe: 2.216 ± 0.508
1.638ProGly: 1.638 ± 0.348
0.482ProHis: 0.482 ± 0.191
2.216ProIle: 2.216 ± 0.585
2.024ProLys: 2.024 ± 0.528
2.216ProLeu: 2.216 ± 0.718
0.385ProMet: 0.385 ± 0.175
2.216ProAsn: 2.216 ± 0.414
0.867ProPro: 0.867 ± 0.415
1.349ProGln: 1.349 ± 0.269
1.831ProArg: 1.831 ± 0.461
2.987ProSer: 2.987 ± 0.508
2.409ProThr: 2.409 ± 0.513
1.734ProVal: 1.734 ± 0.421
0.193ProTrp: 0.193 ± 0.118
1.349ProTyr: 1.349 ± 0.427
0.0ProXaa: 0.0 ± 0.0
Gln
2.891GlnAla: 2.891 ± 0.668
0.385GlnCys: 0.385 ± 0.19
2.602GlnAsp: 2.602 ± 0.468
3.18GlnGlu: 3.18 ± 0.585
1.06GlnPhe: 1.06 ± 0.252
2.409GlnGly: 2.409 ± 0.562
0.482GlnHis: 0.482 ± 0.193
2.216GlnIle: 2.216 ± 0.44
4.143GlnLys: 4.143 ± 0.834
3.565GlnLeu: 3.565 ± 0.602
1.638GlnMet: 1.638 ± 0.391
1.927GlnAsn: 1.927 ± 0.409
1.253GlnPro: 1.253 ± 0.335
3.854GlnGln: 3.854 ± 0.921
1.831GlnArg: 1.831 ± 0.542
2.698GlnSer: 2.698 ± 0.511
2.891GlnThr: 2.891 ± 0.661
2.602GlnVal: 2.602 ± 0.565
0.289GlnTrp: 0.289 ± 0.164
0.771GlnTyr: 0.771 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
2.602ArgAla: 2.602 ± 0.631
0.385ArgCys: 0.385 ± 0.197
1.927ArgAsp: 1.927 ± 0.552
2.024ArgGlu: 2.024 ± 0.402
2.794ArgPhe: 2.794 ± 0.43
2.505ArgGly: 2.505 ± 0.422
1.06ArgHis: 1.06 ± 0.305
3.662ArgIle: 3.662 ± 0.556
4.047ArgLys: 4.047 ± 0.73
2.891ArgLeu: 2.891 ± 0.545
1.06ArgMet: 1.06 ± 0.288
1.831ArgAsn: 1.831 ± 0.422
1.445ArgPro: 1.445 ± 0.362
2.12ArgGln: 2.12 ± 0.429
2.505ArgArg: 2.505 ± 0.501
2.891ArgSer: 2.891 ± 0.535
1.831ArgThr: 1.831 ± 0.473
1.927ArgVal: 1.927 ± 0.379
0.289ArgTrp: 0.289 ± 0.166
1.445ArgTyr: 1.445 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.589SerAla: 5.589 ± 1.044
0.771SerCys: 0.771 ± 0.246
3.854SerAsp: 3.854 ± 0.726
2.891SerGlu: 2.891 ± 0.669
3.083SerPhe: 3.083 ± 0.492
6.456SerGly: 6.456 ± 1.37
1.253SerHis: 1.253 ± 0.3
4.336SerIle: 4.336 ± 0.591
3.758SerLys: 3.758 ± 0.71
4.432SerLeu: 4.432 ± 0.651
1.349SerMet: 1.349 ± 0.354
4.722SerAsn: 4.722 ± 0.649
1.542SerPro: 1.542 ± 0.374
2.12SerGln: 2.12 ± 0.373
2.024SerArg: 2.024 ± 0.457
6.167SerSer: 6.167 ± 0.977
6.552SerThr: 6.552 ± 1.414
3.565SerVal: 3.565 ± 0.558
0.771SerTrp: 0.771 ± 0.248
2.505SerTyr: 2.505 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
7.42ThrAla: 7.42 ± 0.936
0.385ThrCys: 0.385 ± 0.189
5.011ThrAsp: 5.011 ± 1.016
3.565ThrGlu: 3.565 ± 0.647
3.565ThrPhe: 3.565 ± 0.642
5.107ThrGly: 5.107 ± 1.171
1.156ThrHis: 1.156 ± 0.318
4.722ThrIle: 4.722 ± 0.86
5.396ThrLys: 5.396 ± 0.627
5.781ThrLeu: 5.781 ± 0.757
1.349ThrMet: 1.349 ± 0.29
4.432ThrAsn: 4.432 ± 0.719
2.313ThrPro: 2.313 ± 0.381
2.313ThrGln: 2.313 ± 0.467
2.698ThrArg: 2.698 ± 0.421
4.914ThrSer: 4.914 ± 0.928
9.058ThrThr: 9.058 ± 1.618
4.143ThrVal: 4.143 ± 0.924
0.675ThrTrp: 0.675 ± 0.317
2.602ThrTyr: 2.602 ± 0.436
0.0ThrXaa: 0.0 ± 0.0
Val
4.625ValAla: 4.625 ± 0.696
0.771ValCys: 0.771 ± 0.261
3.373ValAsp: 3.373 ± 0.618
3.662ValGlu: 3.662 ± 0.73
2.794ValPhe: 2.794 ± 0.58
2.505ValGly: 2.505 ± 0.612
0.578ValHis: 0.578 ± 0.216
2.891ValIle: 2.891 ± 0.413
4.529ValLys: 4.529 ± 0.572
3.373ValLeu: 3.373 ± 0.58
1.06ValMet: 1.06 ± 0.293
3.565ValAsn: 3.565 ± 0.576
2.505ValPro: 2.505 ± 0.615
1.831ValGln: 1.831 ± 0.382
2.12ValArg: 2.12 ± 0.452
5.203ValSer: 5.203 ± 0.864
4.529ValThr: 4.529 ± 1.004
3.373ValVal: 3.373 ± 0.466
0.289ValTrp: 0.289 ± 0.171
1.927ValTyr: 1.927 ± 0.359
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.096TrpCys: 0.096 ± 0.102
0.771TrpAsp: 0.771 ± 0.324
0.385TrpGlu: 0.385 ± 0.19
0.289TrpPhe: 0.289 ± 0.156
0.289TrpGly: 0.289 ± 0.16
0.193TrpHis: 0.193 ± 0.121
0.675TrpIle: 0.675 ± 0.295
0.771TrpLys: 0.771 ± 0.258
0.771TrpLeu: 0.771 ± 0.316
0.193TrpMet: 0.193 ± 0.142
0.771TrpAsn: 0.771 ± 0.272
0.0TrpPro: 0.0 ± 0.0
0.675TrpGln: 0.675 ± 0.273
0.289TrpArg: 0.289 ± 0.181
0.482TrpSer: 0.482 ± 0.189
0.578TrpThr: 0.578 ± 0.238
0.578TrpVal: 0.578 ± 0.233
0.193TrpTrp: 0.193 ± 0.134
0.578TrpTyr: 0.578 ± 0.249
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.409TyrAla: 2.409 ± 0.401
0.675TyrCys: 0.675 ± 0.228
2.505TyrAsp: 2.505 ± 0.524
1.927TyrGlu: 1.927 ± 0.401
1.253TyrPhe: 1.253 ± 0.294
2.216TyrGly: 2.216 ± 0.515
1.349TyrHis: 1.349 ± 0.353
2.891TyrIle: 2.891 ± 0.573
4.24TyrLys: 4.24 ± 0.747
1.638TyrLeu: 1.638 ± 0.385
1.542TyrMet: 1.542 ± 0.36
2.987TyrAsn: 2.987 ± 0.717
1.06TyrPro: 1.06 ± 0.26
2.024TyrGln: 2.024 ± 0.501
1.542TyrArg: 1.542 ± 0.375
2.216TyrSer: 2.216 ± 0.378
3.18TyrThr: 3.18 ± 0.442
2.505TyrVal: 2.505 ± 0.477
0.578TyrTrp: 0.578 ± 0.207
1.927TyrTyr: 1.927 ± 0.578
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (10379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski