Amino acid dipepetide frequency for Streptococcus phage Javan119

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.681AlaAla: 4.681 ± 1.245
0.576AlaCys: 0.576 ± 0.194
4.033AlaAsp: 4.033 ± 0.466
4.825AlaGlu: 4.825 ± 0.604
2.737AlaPhe: 2.737 ± 0.681
4.753AlaGly: 4.753 ± 1.271
1.08AlaHis: 1.08 ± 0.326
5.977AlaIle: 5.977 ± 1.003
6.625AlaLys: 6.625 ± 0.908
5.545AlaLeu: 5.545 ± 0.884
2.809AlaMet: 2.809 ± 0.517
2.737AlaAsn: 2.737 ± 0.472
0.864AlaPro: 0.864 ± 0.278
2.881AlaGln: 2.881 ± 0.549
2.881AlaArg: 2.881 ± 0.396
4.393AlaSer: 4.393 ± 0.736
4.537AlaThr: 4.537 ± 1.13
5.617AlaVal: 5.617 ± 1.244
0.576AlaTrp: 0.576 ± 0.18
1.8AlaTyr: 1.8 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
0.36CysAla: 0.36 ± 0.15
0.144CysCys: 0.144 ± 0.107
0.432CysAsp: 0.432 ± 0.162
0.648CysGlu: 0.648 ± 0.204
0.216CysPhe: 0.216 ± 0.12
0.432CysGly: 0.432 ± 0.189
0.0CysHis: 0.0 ± 0.0
0.36CysIle: 0.36 ± 0.167
0.432CysLys: 0.432 ± 0.165
0.504CysLeu: 0.504 ± 0.185
0.072CysMet: 0.072 ± 0.086
0.216CysAsn: 0.216 ± 0.134
0.144CysPro: 0.144 ± 0.091
0.288CysGln: 0.288 ± 0.146
0.144CysArg: 0.144 ± 0.088
0.36CysSer: 0.36 ± 0.193
0.288CysThr: 0.288 ± 0.191
0.144CysVal: 0.144 ± 0.107
0.288CysTrp: 0.288 ± 0.125
0.072CysTyr: 0.072 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
2.809AspAla: 2.809 ± 0.435
0.72AspCys: 0.72 ± 0.197
3.961AspAsp: 3.961 ± 0.567
4.825AspGlu: 4.825 ± 0.639
2.737AspPhe: 2.737 ± 0.531
5.329AspGly: 5.329 ± 0.647
0.504AspHis: 0.504 ± 0.164
4.969AspIle: 4.969 ± 0.801
5.689AspLys: 5.689 ± 0.727
6.481AspLeu: 6.481 ± 0.725
1.656AspMet: 1.656 ± 0.358
3.457AspAsn: 3.457 ± 0.586
1.08AspPro: 1.08 ± 0.249
1.368AspGln: 1.368 ± 0.328
2.304AspArg: 2.304 ± 0.428
4.105AspSer: 4.105 ± 0.443
3.457AspThr: 3.457 ± 0.522
3.529AspVal: 3.529 ± 0.487
0.792AspTrp: 0.792 ± 0.3
3.601AspTyr: 3.601 ± 0.529
0.0AspXaa: 0.0 ± 0.0
Glu
5.833GluAla: 5.833 ± 0.685
0.216GluCys: 0.216 ± 0.123
2.593GluAsp: 2.593 ± 0.552
6.193GluGlu: 6.193 ± 0.966
2.953GluPhe: 2.953 ± 0.491
3.529GluGly: 3.529 ± 0.448
1.656GluHis: 1.656 ± 0.347
5.689GluIle: 5.689 ± 0.659
6.481GluLys: 6.481 ± 0.82
7.922GluLeu: 7.922 ± 0.891
2.737GluMet: 2.737 ± 0.38
4.033GluAsn: 4.033 ± 0.774
1.728GluPro: 1.728 ± 0.345
3.601GluGln: 3.601 ± 0.567
2.809GluArg: 2.809 ± 0.431
4.177GluSer: 4.177 ± 0.492
3.241GluThr: 3.241 ± 0.498
5.545GluVal: 5.545 ± 0.726
0.72GluTrp: 0.72 ± 0.324
2.593GluTyr: 2.593 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
2.232PheAla: 2.232 ± 0.453
0.36PheCys: 0.36 ± 0.158
3.673PheAsp: 3.673 ± 0.579
4.033PheGlu: 4.033 ± 0.619
1.512PhePhe: 1.512 ± 0.306
2.953PheGly: 2.953 ± 0.491
0.792PheHis: 0.792 ± 0.246
2.304PheIle: 2.304 ± 0.41
5.185PheLys: 5.185 ± 0.585
1.872PheLeu: 1.872 ± 0.494
0.864PheMet: 0.864 ± 0.262
3.097PheAsn: 3.097 ± 0.626
0.576PhePro: 0.576 ± 0.211
0.936PheGln: 0.936 ± 0.279
2.232PheArg: 2.232 ± 0.394
2.881PheSer: 2.881 ± 0.447
2.088PheThr: 2.088 ± 0.329
2.016PheVal: 2.016 ± 0.279
0.36PheTrp: 0.36 ± 0.182
1.368PheTyr: 1.368 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
5.041GlyAla: 5.041 ± 1.152
0.144GlyCys: 0.144 ± 0.105
3.889GlyAsp: 3.889 ± 0.594
4.105GlyGlu: 4.105 ± 0.399
3.385GlyPhe: 3.385 ± 0.54
3.169GlyGly: 3.169 ± 0.54
0.576GlyHis: 0.576 ± 0.221
4.249GlyIle: 4.249 ± 0.772
5.473GlyLys: 5.473 ± 0.664
5.689GlyLeu: 5.689 ± 0.972
2.16GlyMet: 2.16 ± 0.356
3.241GlyAsn: 3.241 ± 0.431
2.16GlyPro: 2.16 ± 1.252
3.169GlyGln: 3.169 ± 0.462
2.953GlyArg: 2.953 ± 0.541
3.529GlySer: 3.529 ± 0.591
4.249GlyThr: 4.249 ± 0.567
3.961GlyVal: 3.961 ± 0.685
0.792GlyTrp: 0.792 ± 0.23
2.881GlyTyr: 2.881 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
0.576HisAla: 0.576 ± 0.216
0.072HisCys: 0.072 ± 0.067
1.008HisAsp: 1.008 ± 0.28
1.512HisGlu: 1.512 ± 0.33
0.72HisPhe: 0.72 ± 0.254
0.936HisGly: 0.936 ± 0.234
0.216HisHis: 0.216 ± 0.116
0.936HisIle: 0.936 ± 0.316
0.864HisLys: 0.864 ± 0.282
0.936HisLeu: 0.936 ± 0.3
0.216HisMet: 0.216 ± 0.121
1.296HisAsn: 1.296 ± 0.338
0.576HisPro: 0.576 ± 0.188
0.576HisGln: 0.576 ± 0.208
0.648HisArg: 0.648 ± 0.169
0.792HisSer: 0.792 ± 0.278
1.224HisThr: 1.224 ± 0.296
1.152HisVal: 1.152 ± 0.325
0.36HisTrp: 0.36 ± 0.167
0.864HisTyr: 0.864 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
5.041IleAla: 5.041 ± 0.806
0.504IleCys: 0.504 ± 0.167
5.185IleAsp: 5.185 ± 0.529
5.905IleGlu: 5.905 ± 0.62
2.232IlePhe: 2.232 ± 0.399
3.385IleGly: 3.385 ± 0.458
1.296IleHis: 1.296 ± 0.383
4.537IleIle: 4.537 ± 0.468
7.129IleLys: 7.129 ± 0.787
4.753IleLeu: 4.753 ± 0.458
1.224IleMet: 1.224 ± 0.251
3.817IleAsn: 3.817 ± 0.579
1.656IlePro: 1.656 ± 0.316
2.376IleGln: 2.376 ± 0.349
2.737IleArg: 2.737 ± 0.52
5.113IleSer: 5.113 ± 0.673
5.185IleThr: 5.185 ± 0.533
4.177IleVal: 4.177 ± 0.556
0.648IleTrp: 0.648 ± 0.253
2.16IleTyr: 2.16 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
6.913LysAla: 6.913 ± 0.734
0.432LysCys: 0.432 ± 0.193
5.689LysAsp: 5.689 ± 0.718
6.553LysGlu: 6.553 ± 0.833
2.593LysPhe: 2.593 ± 0.471
6.121LysGly: 6.121 ± 0.719
1.512LysHis: 1.512 ± 0.39
5.545LysIle: 5.545 ± 0.623
6.337LysLys: 6.337 ± 0.982
7.274LysLeu: 7.274 ± 0.897
2.737LysMet: 2.737 ± 0.354
5.401LysAsn: 5.401 ± 0.595
2.593LysPro: 2.593 ± 0.41
3.961LysGln: 3.961 ± 0.502
4.825LysArg: 4.825 ± 0.69
4.249LysSer: 4.249 ± 0.587
5.617LysThr: 5.617 ± 0.665
6.625LysVal: 6.625 ± 0.799
0.864LysTrp: 0.864 ± 0.218
2.809LysTyr: 2.809 ± 0.481
0.0LysXaa: 0.0 ± 0.0
Leu
6.265LeuAla: 6.265 ± 0.843
0.432LeuCys: 0.432 ± 0.189
6.913LeuAsp: 6.913 ± 0.78
7.85LeuGlu: 7.85 ± 0.734
3.385LeuPhe: 3.385 ± 0.543
5.329LeuGly: 5.329 ± 1.12
1.152LeuHis: 1.152 ± 0.243
4.897LeuIle: 4.897 ± 0.671
8.858LeuLys: 8.858 ± 0.909
7.129LeuLeu: 7.129 ± 0.706
1.728LeuMet: 1.728 ± 0.316
4.321LeuAsn: 4.321 ± 0.473
1.872LeuPro: 1.872 ± 0.316
2.593LeuGln: 2.593 ± 0.407
3.529LeuArg: 3.529 ± 0.482
6.121LeuSer: 6.121 ± 0.762
6.193LeuThr: 6.193 ± 0.644
4.465LeuVal: 4.465 ± 0.463
0.36LeuTrp: 0.36 ± 0.157
1.944LeuTyr: 1.944 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
2.593MetAla: 2.593 ± 0.547
0.0MetCys: 0.0 ± 0.0
0.936MetAsp: 0.936 ± 0.271
1.368MetGlu: 1.368 ± 0.377
1.224MetPhe: 1.224 ± 0.307
1.44MetGly: 1.44 ± 0.293
0.432MetHis: 0.432 ± 0.17
1.872MetIle: 1.872 ± 0.305
1.656MetLys: 1.656 ± 0.398
1.8MetLeu: 1.8 ± 0.365
0.288MetMet: 0.288 ± 0.138
1.008MetAsn: 1.008 ± 0.256
0.72MetPro: 0.72 ± 0.25
1.944MetGln: 1.944 ± 0.366
1.008MetArg: 1.008 ± 0.273
1.944MetSer: 1.944 ± 0.412
2.016MetThr: 2.016 ± 0.341
1.152MetVal: 1.152 ± 0.342
0.144MetTrp: 0.144 ± 0.123
0.576MetTyr: 0.576 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.313AsnAla: 3.313 ± 0.676
0.288AsnCys: 0.288 ± 0.14
2.593AsnAsp: 2.593 ± 0.46
2.953AsnGlu: 2.953 ± 0.491
2.304AsnPhe: 2.304 ± 0.44
4.897AsnGly: 4.897 ± 0.69
1.008AsnHis: 1.008 ± 0.262
4.177AsnIle: 4.177 ± 0.634
4.753AsnLys: 4.753 ± 0.615
5.905AsnLeu: 5.905 ± 0.491
0.936AsnMet: 0.936 ± 0.298
2.665AsnAsn: 2.665 ± 0.709
1.584AsnPro: 1.584 ± 0.319
2.232AsnGln: 2.232 ± 0.464
1.656AsnArg: 1.656 ± 0.41
3.241AsnSer: 3.241 ± 0.559
2.449AsnThr: 2.449 ± 0.381
3.025AsnVal: 3.025 ± 0.591
0.72AsnTrp: 0.72 ± 0.19
2.232AsnTyr: 2.232 ± 0.445
0.0AsnXaa: 0.0 ± 0.0
Pro
1.368ProAla: 1.368 ± 0.291
0.072ProCys: 0.072 ± 0.054
1.44ProAsp: 1.44 ± 0.308
2.304ProGlu: 2.304 ± 0.414
1.08ProPhe: 1.08 ± 0.28
1.008ProGly: 1.008 ± 0.478
0.504ProHis: 0.504 ± 0.166
2.088ProIle: 2.088 ± 0.408
2.737ProLys: 2.737 ± 0.56
1.44ProLeu: 1.44 ± 0.339
0.432ProMet: 0.432 ± 0.169
0.936ProAsn: 0.936 ± 0.216
1.008ProPro: 1.008 ± 0.347
0.864ProGln: 0.864 ± 0.293
1.224ProArg: 1.224 ± 0.333
1.584ProSer: 1.584 ± 0.346
1.728ProThr: 1.728 ± 0.36
1.008ProVal: 1.008 ± 0.293
0.216ProTrp: 0.216 ± 0.135
0.72ProTyr: 0.72 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 0.879
0.216GlnCys: 0.216 ± 0.126
1.944GlnAsp: 1.944 ± 0.35
3.241GlnGlu: 3.241 ± 0.516
1.584GlnPhe: 1.584 ± 0.359
2.593GlnGly: 2.593 ± 0.47
0.72GlnHis: 0.72 ± 0.205
3.313GlnIle: 3.313 ± 0.442
3.457GlnLys: 3.457 ± 0.556
3.313GlnLeu: 3.313 ± 0.5
0.864GlnMet: 0.864 ± 0.317
2.16GlnAsn: 2.16 ± 0.504
0.504GlnPro: 0.504 ± 0.191
2.232GlnGln: 2.232 ± 0.533
1.872GlnArg: 1.872 ± 0.38
4.537GlnSer: 4.537 ± 0.599
2.16GlnThr: 2.16 ± 0.413
1.728GlnVal: 1.728 ± 0.394
0.36GlnTrp: 0.36 ± 0.147
1.08GlnTyr: 1.08 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
2.593ArgAla: 2.593 ± 0.404
0.144ArgCys: 0.144 ± 0.094
2.521ArgAsp: 2.521 ± 0.481
2.809ArgGlu: 2.809 ± 0.452
1.584ArgPhe: 1.584 ± 0.426
3.817ArgGly: 3.817 ± 0.632
0.504ArgHis: 0.504 ± 0.19
2.449ArgIle: 2.449 ± 0.46
3.961ArgLys: 3.961 ± 0.804
5.401ArgLeu: 5.401 ± 0.596
1.296ArgMet: 1.296 ± 0.286
2.737ArgAsn: 2.737 ± 0.459
0.504ArgPro: 0.504 ± 0.19
1.872ArgGln: 1.872 ± 0.309
2.376ArgArg: 2.376 ± 0.506
2.521ArgSer: 2.521 ± 0.397
1.656ArgThr: 1.656 ± 0.348
2.809ArgVal: 2.809 ± 0.414
0.576ArgTrp: 0.576 ± 0.199
1.944ArgTyr: 1.944 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
4.249SerAla: 4.249 ± 1.169
0.216SerCys: 0.216 ± 0.125
4.105SerAsp: 4.105 ± 0.537
3.817SerGlu: 3.817 ± 0.595
2.881SerPhe: 2.881 ± 0.376
4.537SerGly: 4.537 ± 0.797
1.008SerHis: 1.008 ± 0.283
4.825SerIle: 4.825 ± 0.532
5.041SerLys: 5.041 ± 0.569
5.185SerLeu: 5.185 ± 0.844
1.728SerMet: 1.728 ± 0.366
3.673SerAsn: 3.673 ± 0.577
1.296SerPro: 1.296 ± 0.266
3.601SerGln: 3.601 ± 0.643
3.241SerArg: 3.241 ± 0.448
4.753SerSer: 4.753 ± 0.785
3.313SerThr: 3.313 ± 0.475
4.609SerVal: 4.609 ± 0.525
0.648SerTrp: 0.648 ± 0.243
2.665SerTyr: 2.665 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
4.897ThrAla: 4.897 ± 1.081
0.144ThrCys: 0.144 ± 0.137
4.393ThrAsp: 4.393 ± 0.516
3.817ThrGlu: 3.817 ± 0.664
3.169ThrPhe: 3.169 ± 0.554
4.897ThrGly: 4.897 ± 0.7
0.792ThrHis: 0.792 ± 0.25
4.177ThrIle: 4.177 ± 0.53
4.897ThrLys: 4.897 ± 0.613
5.113ThrLeu: 5.113 ± 0.602
0.792ThrMet: 0.792 ± 0.208
2.449ThrAsn: 2.449 ± 0.408
2.016ThrPro: 2.016 ± 0.395
2.232ThrGln: 2.232 ± 0.342
2.088ThrArg: 2.088 ± 0.429
3.313ThrSer: 3.313 ± 0.45
4.609ThrThr: 4.609 ± 0.665
4.177ThrVal: 4.177 ± 0.532
0.864ThrTrp: 0.864 ± 0.233
2.449ThrTyr: 2.449 ± 0.593
0.0ThrXaa: 0.0 ± 0.0
Val
4.753ValAla: 4.753 ± 0.924
0.432ValCys: 0.432 ± 0.156
4.465ValAsp: 4.465 ± 0.559
4.969ValGlu: 4.969 ± 0.647
2.521ValPhe: 2.521 ± 0.415
3.529ValGly: 3.529 ± 0.443
0.792ValHis: 0.792 ± 0.193
3.745ValIle: 3.745 ± 0.593
5.689ValLys: 5.689 ± 0.593
4.825ValLeu: 4.825 ± 0.482
0.576ValMet: 0.576 ± 0.203
2.881ValAsn: 2.881 ± 0.462
1.8ValPro: 1.8 ± 0.356
2.449ValGln: 2.449 ± 0.4
2.737ValArg: 2.737 ± 0.324
4.321ValSer: 4.321 ± 0.536
4.537ValThr: 4.537 ± 0.54
5.041ValVal: 5.041 ± 0.644
0.432ValTrp: 0.432 ± 0.168
2.16ValTyr: 2.16 ± 0.534
0.0ValXaa: 0.0 ± 0.0
Trp
0.648TrpAla: 0.648 ± 0.255
0.072TrpCys: 0.072 ± 0.07
0.576TrpAsp: 0.576 ± 0.197
0.504TrpGlu: 0.504 ± 0.196
0.648TrpPhe: 0.648 ± 0.2
0.648TrpGly: 0.648 ± 0.27
0.216TrpHis: 0.216 ± 0.107
0.504TrpIle: 0.504 ± 0.197
0.72TrpLys: 0.72 ± 0.188
1.152TrpLeu: 1.152 ± 0.348
0.288TrpMet: 0.288 ± 0.146
0.288TrpAsn: 0.288 ± 0.143
0.072TrpPro: 0.072 ± 0.08
0.144TrpGln: 0.144 ± 0.088
0.936TrpArg: 0.936 ± 0.283
1.296TrpSer: 1.296 ± 0.271
0.504TrpThr: 0.504 ± 0.244
0.36TrpVal: 0.36 ± 0.154
0.144TrpTrp: 0.144 ± 0.113
0.504TrpTyr: 0.504 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.872TyrAla: 1.872 ± 0.402
0.36TyrCys: 0.36 ± 0.15
3.025TyrAsp: 3.025 ± 0.562
2.016TyrGlu: 2.016 ± 0.369
1.872TyrPhe: 1.872 ± 0.369
1.512TyrGly: 1.512 ± 0.301
0.72TyrHis: 0.72 ± 0.259
2.449TyrIle: 2.449 ± 0.331
2.737TyrLys: 2.737 ± 0.516
3.097TyrLeu: 3.097 ± 0.809
0.648TyrMet: 0.648 ± 0.241
2.521TyrAsn: 2.521 ± 0.381
1.008TyrPro: 1.008 ± 0.27
2.449TyrGln: 2.449 ± 0.458
1.872TyrArg: 1.872 ± 0.422
2.16TyrSer: 2.16 ± 0.343
2.232TyrThr: 2.232 ± 0.423
1.512TyrVal: 1.512 ± 0.383
0.36TyrTrp: 0.36 ± 0.157
1.584TyrTyr: 1.584 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (13887 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski