Amino acid dipepetide frequency for Streptococcus phage Javan580

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.623AlaAla: 3.623 ± 1.091
0.315AlaCys: 0.315 ± 0.276
4.568AlaAsp: 4.568 ± 1.003
6.459AlaGlu: 6.459 ± 1.045
3.308AlaPhe: 3.308 ± 0.653
6.931AlaGly: 6.931 ± 1.896
0.473AlaHis: 0.473 ± 0.534
7.246AlaIle: 7.246 ± 1.377
7.404AlaLys: 7.404 ± 1.007
7.246AlaLeu: 7.246 ± 0.912
1.575AlaMet: 1.575 ± 0.587
5.198AlaAsn: 5.198 ± 0.846
1.418AlaPro: 1.418 ± 0.418
3.938AlaGln: 3.938 ± 0.643
2.993AlaArg: 2.993 ± 0.735
6.301AlaSer: 6.301 ± 1.226
7.876AlaThr: 7.876 ± 1.418
4.726AlaVal: 4.726 ± 0.759
0.788AlaTrp: 0.788 ± 0.446
1.733AlaTyr: 1.733 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.315CysAla: 0.315 ± 0.234
0.0CysCys: 0.0 ± 0.0
0.473CysAsp: 0.473 ± 0.251
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.473CysGly: 0.473 ± 0.251
0.473CysHis: 0.473 ± 0.311
0.473CysIle: 0.473 ± 0.297
0.473CysLys: 0.473 ± 0.294
0.158CysLeu: 0.158 ± 0.181
0.158CysMet: 0.158 ± 0.171
0.158CysAsn: 0.158 ± 0.177
0.158CysPro: 0.158 ± 0.112
0.158CysGln: 0.158 ± 0.178
0.473CysArg: 0.473 ± 0.253
0.158CysSer: 0.158 ± 0.181
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.158CysTrp: 0.158 ± 0.112
0.473CysTyr: 0.473 ± 0.382
0.0CysXaa: 0.0 ± 0.0
Asp
3.938AspAla: 3.938 ± 0.742
0.63AspCys: 0.63 ± 0.397
1.26AspAsp: 1.26 ± 0.687
3.151AspGlu: 3.151 ± 0.853
2.993AspPhe: 2.993 ± 0.533
5.356AspGly: 5.356 ± 1.303
0.315AspHis: 0.315 ± 0.285
4.411AspIle: 4.411 ± 0.655
3.308AspLys: 3.308 ± 0.804
4.726AspLeu: 4.726 ± 0.766
2.205AspMet: 2.205 ± 0.726
1.89AspAsn: 1.89 ± 0.575
0.473AspPro: 0.473 ± 0.407
1.26AspGln: 1.26 ± 0.478
1.733AspArg: 1.733 ± 0.567
4.411AspSer: 4.411 ± 0.762
4.096AspThr: 4.096 ± 1.002
4.411AspVal: 4.411 ± 0.687
0.945AspTrp: 0.945 ± 0.33
2.205AspTyr: 2.205 ± 0.615
0.0AspXaa: 0.0 ± 0.0
Glu
5.514GluAla: 5.514 ± 0.933
0.473GluCys: 0.473 ± 0.242
3.308GluAsp: 3.308 ± 0.812
3.938GluGlu: 3.938 ± 1.517
1.733GluPhe: 1.733 ± 0.508
3.466GluGly: 3.466 ± 0.899
1.733GluHis: 1.733 ± 0.752
5.041GluIle: 5.041 ± 0.867
4.096GluLys: 4.096 ± 1.237
7.561GluLeu: 7.561 ± 1.669
2.048GluMet: 2.048 ± 0.813
3.781GluAsn: 3.781 ± 0.749
1.418GluPro: 1.418 ± 0.894
2.993GluGln: 2.993 ± 0.463
3.466GluArg: 3.466 ± 0.698
5.041GluSer: 5.041 ± 0.581
3.623GluThr: 3.623 ± 0.631
5.356GluVal: 5.356 ± 0.993
0.63GluTrp: 0.63 ± 0.425
2.048GluTyr: 2.048 ± 0.743
0.0GluXaa: 0.0 ± 0.0
Phe
1.89PheAla: 1.89 ± 0.473
0.0PheCys: 0.0 ± 0.0
2.363PheAsp: 2.363 ± 0.597
3.623PheGlu: 3.623 ± 1.04
1.418PhePhe: 1.418 ± 0.435
3.781PheGly: 3.781 ± 0.94
0.473PheHis: 0.473 ± 0.252
1.575PheIle: 1.575 ± 0.389
3.151PheLys: 3.151 ± 0.401
3.308PheLeu: 3.308 ± 1.041
0.788PheMet: 0.788 ± 0.357
1.26PheAsn: 1.26 ± 0.511
1.103PhePro: 1.103 ± 0.429
0.945PheGln: 0.945 ± 0.455
1.103PheArg: 1.103 ± 0.431
2.205PheSer: 2.205 ± 0.735
1.26PheThr: 1.26 ± 0.443
2.678PheVal: 2.678 ± 0.672
0.158PheTrp: 0.158 ± 0.184
1.89PheTyr: 1.89 ± 0.474
0.0PheXaa: 0.0 ± 0.0
Gly
5.829GlyAla: 5.829 ± 1.704
0.158GlyCys: 0.158 ± 0.192
4.253GlyAsp: 4.253 ± 0.907
4.883GlyGlu: 4.883 ± 0.895
2.678GlyPhe: 2.678 ± 0.951
4.568GlyGly: 4.568 ± 1.313
0.788GlyHis: 0.788 ± 0.348
6.459GlyIle: 6.459 ± 1.246
5.356GlyLys: 5.356 ± 1.046
5.356GlyLeu: 5.356 ± 1.345
1.418GlyMet: 1.418 ± 0.652
4.096GlyAsn: 4.096 ± 1.055
1.26GlyPro: 1.26 ± 0.4
3.308GlyGln: 3.308 ± 0.573
3.623GlyArg: 3.623 ± 0.759
5.829GlySer: 5.829 ± 1.491
4.568GlyThr: 4.568 ± 1.05
3.938GlyVal: 3.938 ± 1.449
1.103GlyTrp: 1.103 ± 0.4
3.623GlyTyr: 3.623 ± 0.951
0.0GlyXaa: 0.0 ± 0.0
His
0.158HisAla: 0.158 ± 0.178
0.0HisCys: 0.0 ± 0.0
0.63HisAsp: 0.63 ± 0.35
0.473HisGlu: 0.473 ± 0.244
0.63HisPhe: 0.63 ± 0.447
1.418HisGly: 1.418 ± 0.458
0.315HisHis: 0.315 ± 0.237
0.945HisIle: 0.945 ± 0.507
0.945HisLys: 0.945 ± 0.488
1.418HisLeu: 1.418 ± 0.421
0.158HisMet: 0.158 ± 0.192
1.575HisAsn: 1.575 ± 0.524
0.63HisPro: 0.63 ± 0.462
0.315HisGln: 0.315 ± 0.232
0.63HisArg: 0.63 ± 0.311
0.788HisSer: 0.788 ± 0.254
1.26HisThr: 1.26 ± 0.433
1.103HisVal: 1.103 ± 0.481
0.158HisTrp: 0.158 ± 0.178
0.63HisTyr: 0.63 ± 0.361
0.0HisXaa: 0.0 ± 0.0
Ile
6.774IleAla: 6.774 ± 0.916
0.0IleCys: 0.0 ± 0.0
5.356IleAsp: 5.356 ± 0.656
4.883IleGlu: 4.883 ± 0.803
1.89IlePhe: 1.89 ± 0.291
5.041IleGly: 5.041 ± 0.893
1.418IleHis: 1.418 ± 0.452
4.726IleIle: 4.726 ± 0.998
4.568IleLys: 4.568 ± 0.859
4.253IleLeu: 4.253 ± 1.04
1.89IleMet: 1.89 ± 0.925
4.253IleAsn: 4.253 ± 0.993
2.205IlePro: 2.205 ± 0.653
2.048IleGln: 2.048 ± 0.876
2.993IleArg: 2.993 ± 0.608
5.198IleSer: 5.198 ± 1.04
5.041IleThr: 5.041 ± 0.745
3.781IleVal: 3.781 ± 0.887
0.158IleTrp: 0.158 ± 0.161
2.048IleTyr: 2.048 ± 0.542
0.0IleXaa: 0.0 ± 0.0
Lys
5.514LysAla: 5.514 ± 1.041
0.473LysCys: 0.473 ± 0.297
4.411LysAsp: 4.411 ± 0.795
2.993LysGlu: 2.993 ± 0.677
1.418LysPhe: 1.418 ± 0.434
4.568LysGly: 4.568 ± 1.015
0.315LysHis: 0.315 ± 0.192
3.151LysIle: 3.151 ± 0.683
4.568LysLys: 4.568 ± 1.072
5.356LysLeu: 5.356 ± 0.824
2.048LysMet: 2.048 ± 0.615
3.938LysAsn: 3.938 ± 0.967
2.836LysPro: 2.836 ± 1.064
2.52LysGln: 2.52 ± 0.465
5.671LysArg: 5.671 ± 1.552
4.883LysSer: 4.883 ± 1.191
6.144LysThr: 6.144 ± 1.202
4.726LysVal: 4.726 ± 0.771
0.473LysTrp: 0.473 ± 0.252
1.575LysTyr: 1.575 ± 0.57
0.0LysXaa: 0.0 ± 0.0
Leu
10.397LeuAla: 10.397 ± 1.058
0.315LeuCys: 0.315 ± 0.224
5.041LeuAsp: 5.041 ± 1.093
6.301LeuGlu: 6.301 ± 1.62
2.205LeuPhe: 2.205 ± 0.363
6.144LeuGly: 6.144 ± 1.982
1.575LeuHis: 1.575 ± 0.504
4.568LeuIle: 4.568 ± 1.627
6.144LeuLys: 6.144 ± 1.052
5.829LeuLeu: 5.829 ± 1.28
0.945LeuMet: 0.945 ± 0.358
5.671LeuAsn: 5.671 ± 1.055
2.363LeuPro: 2.363 ± 0.724
2.52LeuGln: 2.52 ± 0.556
3.938LeuArg: 3.938 ± 1.042
7.089LeuSer: 7.089 ± 1.534
5.986LeuThr: 5.986 ± 1.164
3.938LeuVal: 3.938 ± 0.703
0.788LeuTrp: 0.788 ± 0.444
1.575LeuTyr: 1.575 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
2.048MetAla: 2.048 ± 0.675
0.158MetCys: 0.158 ± 0.178
0.63MetAsp: 0.63 ± 0.353
1.26MetGlu: 1.26 ± 0.497
1.103MetPhe: 1.103 ± 0.348
1.733MetGly: 1.733 ± 1.222
1.26MetHis: 1.26 ± 0.348
1.575MetIle: 1.575 ± 0.499
1.575MetLys: 1.575 ± 0.708
1.89MetLeu: 1.89 ± 0.664
0.315MetMet: 0.315 ± 0.434
0.63MetAsn: 0.63 ± 0.313
0.473MetPro: 0.473 ± 0.295
0.788MetGln: 0.788 ± 0.289
1.103MetArg: 1.103 ± 0.457
0.788MetSer: 0.788 ± 0.323
2.678MetThr: 2.678 ± 0.945
1.733MetVal: 1.733 ± 0.455
0.158MetTrp: 0.158 ± 0.143
0.63MetTyr: 0.63 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
2.993AsnAla: 2.993 ± 0.782
0.158AsnCys: 0.158 ± 0.178
1.575AsnAsp: 1.575 ± 0.377
2.52AsnGlu: 2.52 ± 0.683
1.575AsnPhe: 1.575 ± 0.624
4.883AsnGly: 4.883 ± 0.874
0.63AsnHis: 0.63 ± 0.326
4.253AsnIle: 4.253 ± 0.832
2.993AsnLys: 2.993 ± 0.77
5.829AsnLeu: 5.829 ± 1.23
0.315AsnMet: 0.315 ± 0.21
1.418AsnAsn: 1.418 ± 0.508
2.363AsnPro: 2.363 ± 0.841
2.993AsnGln: 2.993 ± 0.612
3.151AsnArg: 3.151 ± 0.997
3.151AsnSer: 3.151 ± 0.752
3.781AsnThr: 3.781 ± 0.842
3.466AsnVal: 3.466 ± 0.707
0.473AsnTrp: 0.473 ± 0.258
2.048AsnTyr: 2.048 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
1.89ProAla: 1.89 ± 0.535
0.788ProCys: 0.788 ± 0.434
1.575ProAsp: 1.575 ± 0.812
2.836ProGlu: 2.836 ± 1.017
1.418ProPhe: 1.418 ± 0.493
1.418ProGly: 1.418 ± 0.458
0.63ProHis: 0.63 ± 0.375
2.205ProIle: 2.205 ± 0.709
1.103ProLys: 1.103 ± 0.563
2.048ProLeu: 2.048 ± 0.607
1.26ProMet: 1.26 ± 0.347
1.103ProAsn: 1.103 ± 0.413
0.788ProPro: 0.788 ± 0.331
0.473ProGln: 0.473 ± 0.293
0.315ProArg: 0.315 ± 0.181
1.89ProSer: 1.89 ± 0.542
2.205ProThr: 2.205 ± 0.758
1.418ProVal: 1.418 ± 0.438
0.315ProTrp: 0.315 ± 0.253
0.63ProTyr: 0.63 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
4.253GlnAla: 4.253 ± 0.754
0.0GlnCys: 0.0 ± 0.0
1.418GlnAsp: 1.418 ± 0.488
3.308GlnGlu: 3.308 ± 0.75
0.63GlnPhe: 0.63 ± 0.207
1.103GlnGly: 1.103 ± 0.477
0.0GlnHis: 0.0 ± 0.0
2.678GlnIle: 2.678 ± 0.558
1.733GlnLys: 1.733 ± 0.588
3.151GlnLeu: 3.151 ± 0.798
0.473GlnMet: 0.473 ± 0.277
2.678GlnAsn: 2.678 ± 0.657
0.945GlnPro: 0.945 ± 0.303
1.103GlnGln: 1.103 ± 0.345
1.26GlnArg: 1.26 ± 0.603
5.041GlnSer: 5.041 ± 1.096
4.883GlnThr: 4.883 ± 0.835
3.466GlnVal: 3.466 ± 0.727
0.473GlnTrp: 0.473 ± 0.244
1.26GlnTyr: 1.26 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
3.151ArgAla: 3.151 ± 0.913
0.315ArgCys: 0.315 ± 0.169
1.733ArgAsp: 1.733 ± 0.492
1.418ArgGlu: 1.418 ± 0.398
2.205ArgPhe: 2.205 ± 0.524
2.048ArgGly: 2.048 ± 0.597
1.26ArgHis: 1.26 ± 0.429
2.363ArgIle: 2.363 ± 0.601
3.623ArgLys: 3.623 ± 1.02
6.301ArgLeu: 6.301 ± 1.35
0.63ArgMet: 0.63 ± 0.483
2.678ArgAsn: 2.678 ± 0.648
0.788ArgPro: 0.788 ± 0.338
2.52ArgGln: 2.52 ± 0.893
1.733ArgArg: 1.733 ± 0.664
1.575ArgSer: 1.575 ± 0.447
3.938ArgThr: 3.938 ± 0.83
3.781ArgVal: 3.781 ± 0.817
0.473ArgTrp: 0.473 ± 0.273
2.048ArgTyr: 2.048 ± 0.682
0.0ArgXaa: 0.0 ± 0.0
Ser
8.034SerAla: 8.034 ± 1.789
0.158SerCys: 0.158 ± 0.178
2.678SerAsp: 2.678 ± 0.653
6.144SerGlu: 6.144 ± 1.07
4.096SerPhe: 4.096 ± 1.1
7.089SerGly: 7.089 ± 1.195
0.63SerHis: 0.63 ± 0.261
4.411SerIle: 4.411 ± 0.972
5.041SerLys: 5.041 ± 0.962
5.986SerLeu: 5.986 ± 1.605
1.733SerMet: 1.733 ± 0.417
2.363SerAsn: 2.363 ± 0.543
2.363SerPro: 2.363 ± 0.522
2.678SerGln: 2.678 ± 0.875
2.205SerArg: 2.205 ± 0.809
6.144SerSer: 6.144 ± 1.258
5.986SerThr: 5.986 ± 0.599
4.883SerVal: 4.883 ± 1.102
0.473SerTrp: 0.473 ± 0.255
2.993SerTyr: 2.993 ± 0.87
0.0SerXaa: 0.0 ± 0.0
Thr
6.931ThrAla: 6.931 ± 2.274
0.0ThrCys: 0.0 ± 0.0
4.253ThrAsp: 4.253 ± 0.944
4.253ThrGlu: 4.253 ± 0.986
2.363ThrPhe: 2.363 ± 0.531
5.671ThrGly: 5.671 ± 1.141
0.158ThrHis: 0.158 ± 0.21
6.144ThrIle: 6.144 ± 0.956
4.883ThrLys: 4.883 ± 0.944
5.198ThrLeu: 5.198 ± 0.708
1.733ThrMet: 1.733 ± 0.401
3.466ThrAsn: 3.466 ± 0.62
2.048ThrPro: 2.048 ± 0.491
3.308ThrGln: 3.308 ± 0.988
2.993ThrArg: 2.993 ± 0.561
6.301ThrSer: 6.301 ± 1.009
5.671ThrThr: 5.671 ± 1.362
7.876ThrVal: 7.876 ± 1.391
0.158ThrTrp: 0.158 ± 0.112
2.993ThrTyr: 2.993 ± 0.705
0.0ThrXaa: 0.0 ± 0.0
Val
5.986ValAla: 5.986 ± 0.857
0.315ValCys: 0.315 ± 0.244
4.411ValAsp: 4.411 ± 1.054
5.829ValGlu: 5.829 ± 1.279
2.363ValPhe: 2.363 ± 0.553
4.411ValGly: 4.411 ± 0.844
1.26ValHis: 1.26 ± 0.503
3.623ValIle: 3.623 ± 0.768
4.568ValLys: 4.568 ± 1.003
5.356ValLeu: 5.356 ± 1.697
1.418ValMet: 1.418 ± 0.416
2.836ValAsn: 2.836 ± 0.57
1.89ValPro: 1.89 ± 0.542
3.308ValGln: 3.308 ± 0.773
2.993ValArg: 2.993 ± 0.811
6.616ValSer: 6.616 ± 0.968
4.568ValThr: 4.568 ± 1.14
3.466ValVal: 3.466 ± 1.239
0.63ValTrp: 0.63 ± 0.227
2.52ValTyr: 2.52 ± 0.542
0.0ValXaa: 0.0 ± 0.0
Trp
0.63TrpAla: 0.63 ± 0.488
0.0TrpCys: 0.0 ± 0.0
0.315TrpAsp: 0.315 ± 0.172
0.473TrpGlu: 0.473 ± 0.341
0.0TrpPhe: 0.0 ± 0.0
0.945TrpGly: 0.945 ± 0.458
0.158TrpHis: 0.158 ± 0.112
1.103TrpIle: 1.103 ± 0.446
0.473TrpLys: 0.473 ± 0.215
0.0TrpLeu: 0.0 ± 0.0
0.158TrpMet: 0.158 ± 0.143
0.63TrpAsn: 0.63 ± 0.265
0.158TrpPro: 0.158 ± 0.178
0.63TrpGln: 0.63 ± 0.264
0.315TrpArg: 0.315 ± 0.275
0.788TrpSer: 0.788 ± 0.405
0.945TrpThr: 0.945 ± 0.332
1.103TrpVal: 1.103 ± 0.487
0.0TrpTrp: 0.0 ± 0.0
0.473TrpTyr: 0.473 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.253TyrAla: 4.253 ± 0.612
0.473TyrCys: 0.473 ± 0.311
3.466TyrAsp: 3.466 ± 0.831
2.52TyrGlu: 2.52 ± 0.62
0.945TyrPhe: 0.945 ± 0.433
2.048TyrGly: 2.048 ± 0.65
0.315TyrHis: 0.315 ± 0.181
1.575TyrIle: 1.575 ± 0.628
1.575TyrLys: 1.575 ± 0.431
2.363TyrLeu: 2.363 ± 0.412
1.103TyrMet: 1.103 ± 0.649
0.945TyrAsn: 0.945 ± 0.381
0.63TyrPro: 0.63 ± 0.265
2.048TyrGln: 2.048 ± 0.42
2.048TyrArg: 2.048 ± 0.583
1.89TyrSer: 1.89 ± 0.527
1.89TyrThr: 1.89 ± 0.593
2.52TyrVal: 2.52 ± 0.595
0.788TyrTrp: 0.788 ± 0.262
1.103TyrTyr: 1.103 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (6349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski