Amino acid dipepetide frequency for Streptococcus phage Javan37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.228AlaAla: 3.228 ± 0.669
0.092AlaCys: 0.092 ± 0.102
3.597AlaAsp: 3.597 ± 0.435
4.243AlaGlu: 4.243 ± 0.651
2.306AlaPhe: 2.306 ± 0.396
3.966AlaGly: 3.966 ± 0.934
1.107AlaHis: 1.107 ± 0.324
5.442AlaIle: 5.442 ± 0.842
6.457AlaLys: 6.457 ± 0.789
5.811AlaLeu: 5.811 ± 0.697
1.384AlaMet: 1.384 ± 0.333
3.782AlaAsn: 3.782 ± 0.677
1.199AlaPro: 1.199 ± 0.358
2.306AlaGln: 2.306 ± 0.435
2.122AlaArg: 2.122 ± 0.515
4.243AlaSer: 4.243 ± 0.64
3.782AlaThr: 3.782 ± 0.598
3.69AlaVal: 3.69 ± 0.632
0.922AlaTrp: 0.922 ± 0.246
2.767AlaTyr: 2.767 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.184CysAla: 0.184 ± 0.159
0.184CysCys: 0.184 ± 0.12
0.83CysAsp: 0.83 ± 0.331
0.738CysGlu: 0.738 ± 0.314
0.369CysPhe: 0.369 ± 0.23
0.646CysGly: 0.646 ± 0.252
0.277CysHis: 0.277 ± 0.165
0.369CysIle: 0.369 ± 0.184
0.277CysLys: 0.277 ± 0.159
0.83CysLeu: 0.83 ± 0.313
0.184CysMet: 0.184 ± 0.15
0.369CysAsn: 0.369 ± 0.167
0.0CysPro: 0.0 ± 0.0
0.184CysGln: 0.184 ± 0.117
0.184CysArg: 0.184 ± 0.13
0.461CysSer: 0.461 ± 0.22
0.184CysThr: 0.184 ± 0.142
0.461CysVal: 0.461 ± 0.204
0.092CysTrp: 0.092 ± 0.101
0.553CysTyr: 0.553 ± 0.298
0.0CysXaa: 0.0 ± 0.0
Asp
3.782AspAla: 3.782 ± 0.673
0.553AspCys: 0.553 ± 0.207
3.597AspAsp: 3.597 ± 0.579
5.073AspGlu: 5.073 ± 0.696
3.136AspPhe: 3.136 ± 0.501
4.428AspGly: 4.428 ± 0.661
0.738AspHis: 0.738 ± 0.224
5.627AspIle: 5.627 ± 0.65
6.918AspLys: 6.918 ± 0.811
4.52AspLeu: 4.52 ± 0.643
1.568AspMet: 1.568 ± 0.326
4.151AspAsn: 4.151 ± 0.516
0.738AspPro: 0.738 ± 0.286
1.291AspGln: 1.291 ± 0.357
2.952AspArg: 2.952 ± 0.45
3.69AspSer: 3.69 ± 0.621
3.413AspThr: 3.413 ± 0.561
3.321AspVal: 3.321 ± 0.519
0.922AspTrp: 0.922 ± 0.25
3.69AspTyr: 3.69 ± 0.672
0.0AspXaa: 0.0 ± 0.0
Glu
4.704GluAla: 4.704 ± 0.766
0.461GluCys: 0.461 ± 0.243
3.505GluAsp: 3.505 ± 0.491
5.811GluGlu: 5.811 ± 0.821
3.228GluPhe: 3.228 ± 0.398
2.767GluGly: 2.767 ± 0.459
1.291GluHis: 1.291 ± 0.414
5.904GluIle: 5.904 ± 0.859
6.457GluLys: 6.457 ± 0.847
7.841GluLeu: 7.841 ± 1.074
2.214GluMet: 2.214 ± 0.576
4.335GluAsn: 4.335 ± 0.78
2.306GluPro: 2.306 ± 0.554
2.767GluGln: 2.767 ± 0.483
3.966GluArg: 3.966 ± 0.7
4.151GluSer: 4.151 ± 0.55
3.874GluThr: 3.874 ± 0.752
3.505GluVal: 3.505 ± 0.557
1.291GluTrp: 1.291 ± 0.342
2.86GluTyr: 2.86 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
2.952PheAla: 2.952 ± 0.561
0.184PheCys: 0.184 ± 0.126
3.69PheAsp: 3.69 ± 0.481
3.505PheGlu: 3.505 ± 0.618
0.646PhePhe: 0.646 ± 0.234
2.86PheGly: 2.86 ± 0.452
0.553PheHis: 0.553 ± 0.229
2.583PheIle: 2.583 ± 0.5
3.966PheLys: 3.966 ± 0.501
3.597PheLeu: 3.597 ± 0.599
0.922PheMet: 0.922 ± 0.328
1.568PheAsn: 1.568 ± 0.394
0.922PhePro: 0.922 ± 0.361
0.646PheGln: 0.646 ± 0.232
1.66PheArg: 1.66 ± 0.47
2.86PheSer: 2.86 ± 0.498
2.122PheThr: 2.122 ± 0.465
3.69PheVal: 3.69 ± 0.565
0.277PheTrp: 0.277 ± 0.17
1.199PheTyr: 1.199 ± 0.301
0.0PheXaa: 0.0 ± 0.0
Gly
3.413GlyAla: 3.413 ± 0.726
0.277GlyCys: 0.277 ± 0.166
4.704GlyAsp: 4.704 ± 0.802
3.966GlyGlu: 3.966 ± 0.489
3.505GlyPhe: 3.505 ± 0.719
4.335GlyGly: 4.335 ± 0.733
0.922GlyHis: 0.922 ± 0.203
5.258GlyIle: 5.258 ± 0.754
6.088GlyLys: 6.088 ± 0.938
4.059GlyLeu: 4.059 ± 0.602
2.122GlyMet: 2.122 ± 0.524
4.52GlyAsn: 4.52 ± 0.635
0.646GlyPro: 0.646 ± 0.216
2.675GlyGln: 2.675 ± 0.567
2.491GlyArg: 2.491 ± 0.456
3.413GlySer: 3.413 ± 0.769
4.704GlyThr: 4.704 ± 0.713
3.228GlyVal: 3.228 ± 0.468
1.199GlyTrp: 1.199 ± 0.278
3.966GlyTyr: 3.966 ± 0.582
0.0GlyXaa: 0.0 ± 0.0
His
0.738HisAla: 0.738 ± 0.216
0.369HisCys: 0.369 ± 0.188
0.553HisAsp: 0.553 ± 0.254
1.199HisGlu: 1.199 ± 0.342
0.738HisPhe: 0.738 ± 0.233
0.646HisGly: 0.646 ± 0.2
0.184HisHis: 0.184 ± 0.171
1.107HisIle: 1.107 ± 0.403
1.291HisLys: 1.291 ± 0.306
1.291HisLeu: 1.291 ± 0.51
0.277HisMet: 0.277 ± 0.17
0.738HisAsn: 0.738 ± 0.314
0.277HisPro: 0.277 ± 0.151
0.83HisGln: 0.83 ± 0.287
0.461HisArg: 0.461 ± 0.202
1.291HisSer: 1.291 ± 0.534
1.107HisThr: 1.107 ± 0.397
0.922HisVal: 0.922 ± 0.299
0.092HisTrp: 0.092 ± 0.092
1.015HisTyr: 1.015 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
5.166IleAla: 5.166 ± 0.622
0.461IleCys: 0.461 ± 0.249
6.641IleAsp: 6.641 ± 0.759
5.166IleGlu: 5.166 ± 0.68
2.398IlePhe: 2.398 ± 0.64
4.335IleGly: 4.335 ± 0.701
1.384IleHis: 1.384 ± 0.319
4.151IleIle: 4.151 ± 0.511
7.287IleLys: 7.287 ± 0.884
4.612IleLeu: 4.612 ± 0.595
1.753IleMet: 1.753 ± 0.357
5.166IleAsn: 5.166 ± 0.66
2.491IlePro: 2.491 ± 0.434
3.136IleGln: 3.136 ± 0.474
2.583IleArg: 2.583 ± 0.586
7.195IleSer: 7.195 ± 1.016
3.874IleThr: 3.874 ± 0.634
4.243IleVal: 4.243 ± 0.767
0.922IleTrp: 0.922 ± 0.391
2.767IleTyr: 2.767 ± 0.567
0.0IleXaa: 0.0 ± 0.0
Lys
5.904LysAla: 5.904 ± 0.807
0.738LysCys: 0.738 ± 0.272
5.166LysAsp: 5.166 ± 0.834
6.549LysGlu: 6.549 ± 1.054
2.029LysPhe: 2.029 ± 0.368
6.365LysGly: 6.365 ± 0.666
0.922LysHis: 0.922 ± 0.3
6.18LysIle: 6.18 ± 0.917
7.01LysLys: 7.01 ± 1.042
6.18LysLeu: 6.18 ± 0.712
2.398LysMet: 2.398 ± 0.661
5.166LysAsn: 5.166 ± 0.602
3.228LysPro: 3.228 ± 0.48
3.782LysGln: 3.782 ± 0.616
4.151LysArg: 4.151 ± 0.549
5.627LysSer: 5.627 ± 0.779
5.258LysThr: 5.258 ± 0.535
5.442LysVal: 5.442 ± 0.638
1.199LysTrp: 1.199 ± 0.365
4.151LysTyr: 4.151 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
4.981LeuAla: 4.981 ± 0.66
0.184LeuCys: 0.184 ± 0.121
5.258LeuAsp: 5.258 ± 0.648
6.272LeuGlu: 6.272 ± 0.766
3.136LeuPhe: 3.136 ± 0.626
5.35LeuGly: 5.35 ± 0.779
1.107LeuHis: 1.107 ± 0.42
7.564LeuIle: 7.564 ± 0.9
7.564LeuLys: 7.564 ± 0.856
6.734LeuLeu: 6.734 ± 0.809
1.845LeuMet: 1.845 ± 0.397
5.442LeuAsn: 5.442 ± 0.697
2.952LeuPro: 2.952 ± 0.586
3.136LeuGln: 3.136 ± 0.578
3.044LeuArg: 3.044 ± 0.462
5.904LeuSer: 5.904 ± 0.684
4.797LeuThr: 4.797 ± 0.627
3.874LeuVal: 3.874 ± 0.583
0.83LeuTrp: 0.83 ± 0.305
1.937LeuTyr: 1.937 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.491MetAla: 2.491 ± 0.481
0.277MetCys: 0.277 ± 0.159
1.476MetAsp: 1.476 ± 0.554
1.291MetGlu: 1.291 ± 0.271
1.199MetPhe: 1.199 ± 0.353
1.199MetGly: 1.199 ± 0.314
0.092MetHis: 0.092 ± 0.087
1.476MetIle: 1.476 ± 0.432
1.199MetLys: 1.199 ± 0.301
2.583MetLeu: 2.583 ± 0.42
0.646MetMet: 0.646 ± 0.246
1.66MetAsn: 1.66 ± 0.434
0.83MetPro: 0.83 ± 0.274
1.384MetGln: 1.384 ± 0.421
0.738MetArg: 0.738 ± 0.219
2.214MetSer: 2.214 ± 0.446
1.937MetThr: 1.937 ± 0.418
1.291MetVal: 1.291 ± 0.345
0.184MetTrp: 0.184 ± 0.13
0.738MetTyr: 0.738 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
2.86AsnAla: 2.86 ± 0.604
0.369AsnCys: 0.369 ± 0.183
2.952AsnAsp: 2.952 ± 0.604
4.335AsnGlu: 4.335 ± 0.58
3.228AsnPhe: 3.228 ± 0.495
4.797AsnGly: 4.797 ± 0.72
1.384AsnHis: 1.384 ± 0.41
3.413AsnIle: 3.413 ± 0.507
4.612AsnLys: 4.612 ± 0.703
5.627AsnLeu: 5.627 ± 0.894
1.291AsnMet: 1.291 ± 0.388
3.69AsnAsn: 3.69 ± 0.53
2.767AsnPro: 2.767 ± 0.474
1.845AsnGln: 1.845 ± 0.446
2.491AsnArg: 2.491 ± 0.513
3.228AsnSer: 3.228 ± 0.565
3.228AsnThr: 3.228 ± 0.563
2.952AsnVal: 2.952 ± 0.563
0.738AsnTrp: 0.738 ± 0.273
2.952AsnTyr: 2.952 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
1.568ProAla: 1.568 ± 0.328
0.092ProCys: 0.092 ± 0.105
2.122ProAsp: 2.122 ± 0.497
2.029ProGlu: 2.029 ± 0.511
1.753ProPhe: 1.753 ± 0.374
1.107ProGly: 1.107 ± 0.458
0.092ProHis: 0.092 ± 0.093
2.675ProIle: 2.675 ± 0.64
2.767ProLys: 2.767 ± 0.611
1.845ProLeu: 1.845 ± 0.474
0.83ProMet: 0.83 ± 0.303
1.015ProAsn: 1.015 ± 0.332
1.199ProPro: 1.199 ± 0.299
0.922ProGln: 0.922 ± 0.288
0.922ProArg: 0.922 ± 0.316
2.398ProSer: 2.398 ± 0.418
1.291ProThr: 1.291 ± 0.429
1.753ProVal: 1.753 ± 0.361
0.184ProTrp: 0.184 ± 0.141
0.738ProTyr: 0.738 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
3.044GlnAla: 3.044 ± 0.54
0.461GlnCys: 0.461 ± 0.218
2.398GlnAsp: 2.398 ± 0.583
3.136GlnGlu: 3.136 ± 0.712
1.107GlnPhe: 1.107 ± 0.238
1.753GlnGly: 1.753 ± 0.436
0.553GlnHis: 0.553 ± 0.2
3.228GlnIle: 3.228 ± 0.681
3.874GlnLys: 3.874 ± 0.499
3.966GlnLeu: 3.966 ± 0.49
1.199GlnMet: 1.199 ± 0.318
1.937GlnAsn: 1.937 ± 0.468
0.922GlnPro: 0.922 ± 0.35
1.199GlnGln: 1.199 ± 0.316
1.568GlnArg: 1.568 ± 0.412
2.306GlnSer: 2.306 ± 0.499
2.214GlnThr: 2.214 ± 0.508
1.66GlnVal: 1.66 ± 0.463
0.646GlnTrp: 0.646 ± 0.18
0.738GlnTyr: 0.738 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
2.767ArgAla: 2.767 ± 0.528
0.184ArgCys: 0.184 ± 0.13
2.675ArgAsp: 2.675 ± 0.599
2.86ArgGlu: 2.86 ± 0.53
0.922ArgPhe: 0.922 ± 0.235
2.491ArgGly: 2.491 ± 0.526
0.922ArgHis: 0.922 ± 0.382
4.059ArgIle: 4.059 ± 0.577
3.136ArgLys: 3.136 ± 0.463
3.597ArgLeu: 3.597 ± 0.667
1.015ArgMet: 1.015 ± 0.257
2.122ArgAsn: 2.122 ± 0.356
1.384ArgPro: 1.384 ± 0.339
1.66ArgGln: 1.66 ± 0.352
1.66ArgArg: 1.66 ± 0.384
1.568ArgSer: 1.568 ± 0.424
2.122ArgThr: 2.122 ± 0.42
1.845ArgVal: 1.845 ± 0.42
0.184ArgTrp: 0.184 ± 0.118
1.66ArgTyr: 1.66 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.321SerAla: 3.321 ± 0.775
0.461SerCys: 0.461 ± 0.249
4.52SerAsp: 4.52 ± 0.707
4.52SerGlu: 4.52 ± 0.686
3.321SerPhe: 3.321 ± 0.601
5.442SerGly: 5.442 ± 0.646
1.199SerHis: 1.199 ± 0.327
3.874SerIle: 3.874 ± 0.472
5.166SerLys: 5.166 ± 0.707
5.442SerLeu: 5.442 ± 0.718
1.568SerMet: 1.568 ± 0.338
4.428SerAsn: 4.428 ± 0.763
1.107SerPro: 1.107 ± 0.256
2.214SerGln: 2.214 ± 0.4
1.476SerArg: 1.476 ± 0.367
4.704SerSer: 4.704 ± 0.675
4.059SerThr: 4.059 ± 0.579
4.52SerVal: 4.52 ± 0.753
1.199SerTrp: 1.199 ± 0.29
3.413SerTyr: 3.413 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
4.151ThrAla: 4.151 ± 0.539
0.092ThrCys: 0.092 ± 0.086
3.136ThrAsp: 3.136 ± 0.406
3.874ThrGlu: 3.874 ± 0.599
2.583ThrPhe: 2.583 ± 0.498
5.258ThrGly: 5.258 ± 0.744
0.646ThrHis: 0.646 ± 0.31
4.981ThrIle: 4.981 ± 0.932
4.797ThrLys: 4.797 ± 0.622
4.981ThrLeu: 4.981 ± 0.727
1.291ThrMet: 1.291 ± 0.316
2.583ThrAsn: 2.583 ± 0.462
1.568ThrPro: 1.568 ± 0.309
3.136ThrGln: 3.136 ± 0.561
2.122ThrArg: 2.122 ± 0.482
4.151ThrSer: 4.151 ± 0.596
3.505ThrThr: 3.505 ± 0.501
4.889ThrVal: 4.889 ± 0.73
0.646ThrTrp: 0.646 ± 0.27
2.029ThrTyr: 2.029 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
3.505ValAla: 3.505 ± 0.385
0.461ValCys: 0.461 ± 0.193
4.151ValAsp: 4.151 ± 0.526
4.52ValGlu: 4.52 ± 0.63
2.122ValPhe: 2.122 ± 0.484
3.782ValGly: 3.782 ± 0.618
0.553ValHis: 0.553 ± 0.209
4.612ValIle: 4.612 ± 0.657
4.612ValLys: 4.612 ± 0.66
4.151ValLeu: 4.151 ± 0.497
1.015ValMet: 1.015 ± 0.249
2.952ValAsn: 2.952 ± 0.755
1.384ValPro: 1.384 ± 0.36
2.306ValGln: 2.306 ± 0.443
2.122ValArg: 2.122 ± 0.468
3.505ValSer: 3.505 ± 0.486
4.981ValThr: 4.981 ± 0.739
3.782ValVal: 3.782 ± 0.553
0.646ValTrp: 0.646 ± 0.237
2.583ValTyr: 2.583 ± 0.537
0.0ValXaa: 0.0 ± 0.0
Trp
0.922TrpAla: 0.922 ± 0.385
0.277TrpCys: 0.277 ± 0.151
0.0TrpAsp: 0.0 ± 0.0
1.291TrpGlu: 1.291 ± 0.394
0.646TrpPhe: 0.646 ± 0.211
1.107TrpGly: 1.107 ± 0.365
0.092TrpHis: 0.092 ± 0.093
0.922TrpIle: 0.922 ± 0.273
0.738TrpLys: 0.738 ± 0.334
1.199TrpLeu: 1.199 ± 0.449
0.277TrpMet: 0.277 ± 0.165
0.369TrpAsn: 0.369 ± 0.181
0.092TrpPro: 0.092 ± 0.092
0.553TrpGln: 0.553 ± 0.191
0.553TrpArg: 0.553 ± 0.238
1.199TrpSer: 1.199 ± 0.368
1.476TrpThr: 1.476 ± 0.373
0.369TrpVal: 0.369 ± 0.205
0.092TrpTrp: 0.092 ± 0.086
0.461TrpTyr: 0.461 ± 0.277
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.767TyrAla: 2.767 ± 0.49
1.199TyrCys: 1.199 ± 0.308
3.136TyrAsp: 3.136 ± 0.478
2.86TyrGlu: 2.86 ± 0.673
2.029TyrPhe: 2.029 ± 0.413
2.86TyrGly: 2.86 ± 0.496
1.107TyrHis: 1.107 ± 0.295
2.306TyrIle: 2.306 ± 0.448
3.228TyrLys: 3.228 ± 0.538
3.044TyrLeu: 3.044 ± 0.553
1.107TyrMet: 1.107 ± 0.298
2.767TyrAsn: 2.767 ± 0.53
1.291TyrPro: 1.291 ± 0.321
1.937TyrGln: 1.937 ± 0.358
1.568TyrArg: 1.568 ± 0.403
1.845TyrSer: 1.845 ± 0.464
2.491TyrThr: 2.491 ± 0.512
2.306TyrVal: 2.306 ± 0.378
0.369TyrTrp: 0.369 ± 0.219
2.029TyrTyr: 2.029 ± 0.43
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (10842 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski