Amino acid dipepetide frequency for Streptococcus phage Javan575

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.607AlaAla: 3.607 ± 1.295
0.314AlaCys: 0.314 ± 0.165
3.842AlaAsp: 3.842 ± 0.489
3.685AlaGlu: 3.685 ± 0.639
3.136AlaPhe: 3.136 ± 0.356
4.626AlaGly: 4.626 ± 0.766
0.47AlaHis: 0.47 ± 0.202
5.959AlaIle: 5.959 ± 0.96
5.802AlaLys: 5.802 ± 0.748
5.175AlaLeu: 5.175 ± 0.831
1.568AlaMet: 1.568 ± 0.399
3.999AlaAsn: 3.999 ± 0.763
1.411AlaPro: 1.411 ± 0.307
2.823AlaGln: 2.823 ± 0.778
3.058AlaArg: 3.058 ± 0.484
4.391AlaSer: 4.391 ± 0.757
3.842AlaThr: 3.842 ± 0.599
3.685AlaVal: 3.685 ± 0.457
1.176AlaTrp: 1.176 ± 0.313
3.058AlaTyr: 3.058 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.549CysAla: 0.549 ± 0.194
0.235CysCys: 0.235 ± 0.141
0.47CysAsp: 0.47 ± 0.201
0.706CysGlu: 0.706 ± 0.199
0.314CysPhe: 0.314 ± 0.164
0.706CysGly: 0.706 ± 0.177
0.314CysHis: 0.314 ± 0.135
0.392CysIle: 0.392 ± 0.2
0.627CysLys: 0.627 ± 0.28
0.706CysLeu: 0.706 ± 0.253
0.0CysMet: 0.0 ± 0.0
0.314CysAsn: 0.314 ± 0.14
0.314CysPro: 0.314 ± 0.135
0.862CysGln: 0.862 ± 0.3
0.392CysArg: 0.392 ± 0.208
0.47CysSer: 0.47 ± 0.211
0.235CysThr: 0.235 ± 0.138
0.627CysVal: 0.627 ± 0.251
0.0CysTrp: 0.0 ± 0.0
0.627CysTyr: 0.627 ± 0.25
0.0CysXaa: 0.0 ± 0.0
Asp
3.215AspAla: 3.215 ± 0.414
0.627AspCys: 0.627 ± 0.25
3.764AspAsp: 3.764 ± 0.87
5.175AspGlu: 5.175 ± 0.878
3.136AspPhe: 3.136 ± 0.522
5.175AspGly: 5.175 ± 0.664
0.862AspHis: 0.862 ± 0.271
4.548AspIle: 4.548 ± 0.51
4.077AspLys: 4.077 ± 0.393
4.861AspLeu: 4.861 ± 0.871
2.274AspMet: 2.274 ± 0.449
2.509AspAsn: 2.509 ± 0.433
1.49AspPro: 1.49 ± 0.397
1.333AspGln: 1.333 ± 0.348
2.744AspArg: 2.744 ± 0.56
3.842AspSer: 3.842 ± 0.594
3.058AspThr: 3.058 ± 0.501
2.823AspVal: 2.823 ± 0.459
1.019AspTrp: 1.019 ± 0.245
3.058AspTyr: 3.058 ± 0.766
0.0AspXaa: 0.0 ± 0.0
Glu
4.626GluAla: 4.626 ± 0.564
0.627GluCys: 0.627 ± 0.268
4.469GluAsp: 4.469 ± 0.755
6.429GluGlu: 6.429 ± 1.053
1.882GluPhe: 1.882 ± 0.458
4.312GluGly: 4.312 ± 0.486
0.941GluHis: 0.941 ± 0.248
4.548GluIle: 4.548 ± 0.603
6.665GluLys: 6.665 ± 1.007
8.782GluLeu: 8.782 ± 0.825
2.431GluMet: 2.431 ± 0.494
4.234GluAsn: 4.234 ± 0.671
1.176GluPro: 1.176 ± 0.396
4.391GluGln: 4.391 ± 0.536
2.666GluArg: 2.666 ± 0.407
3.528GluSer: 3.528 ± 0.493
5.253GluThr: 5.253 ± 0.682
4.156GluVal: 4.156 ± 0.519
0.706GluTrp: 0.706 ± 0.237
1.882GluTyr: 1.882 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 0.458
0.392PheCys: 0.392 ± 0.149
2.901PheAsp: 2.901 ± 0.58
2.587PheGlu: 2.587 ± 0.425
1.49PhePhe: 1.49 ± 0.376
2.666PheGly: 2.666 ± 0.39
0.784PheHis: 0.784 ± 0.229
1.882PheIle: 1.882 ± 0.395
2.979PheLys: 2.979 ± 0.632
3.136PheLeu: 3.136 ± 0.479
0.784PheMet: 0.784 ± 0.219
2.274PheAsn: 2.274 ± 0.336
0.627PhePro: 0.627 ± 0.226
1.411PheGln: 1.411 ± 0.378
2.117PheArg: 2.117 ± 0.464
2.274PheSer: 2.274 ± 0.485
2.274PheThr: 2.274 ± 0.369
1.803PheVal: 1.803 ± 0.392
0.784PheTrp: 0.784 ± 0.255
1.882PheTyr: 1.882 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
3.215GlyAla: 3.215 ± 0.639
0.392GlyCys: 0.392 ± 0.15
4.234GlyAsp: 4.234 ± 0.693
3.842GlyGlu: 3.842 ± 0.531
2.431GlyPhe: 2.431 ± 0.381
4.077GlyGly: 4.077 ± 0.742
1.96GlyHis: 1.96 ± 0.397
5.332GlyIle: 5.332 ± 0.797
4.704GlyLys: 4.704 ± 0.562
6.194GlyLeu: 6.194 ± 0.882
1.803GlyMet: 1.803 ± 0.374
3.842GlyAsn: 3.842 ± 0.514
0.862GlyPro: 0.862 ± 0.227
3.136GlyGln: 3.136 ± 0.532
3.92GlyArg: 3.92 ± 0.428
4.234GlySer: 4.234 ± 0.459
3.92GlyThr: 3.92 ± 0.685
4.234GlyVal: 4.234 ± 0.755
0.627GlyTrp: 0.627 ± 0.209
2.744GlyTyr: 2.744 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.211
0.157HisCys: 0.157 ± 0.113
1.098HisAsp: 1.098 ± 0.36
0.941HisGlu: 0.941 ± 0.302
0.862HisPhe: 0.862 ± 0.258
1.803HisGly: 1.803 ± 0.31
0.549HisHis: 0.549 ± 0.215
1.255HisIle: 1.255 ± 0.228
0.862HisLys: 0.862 ± 0.241
1.96HisLeu: 1.96 ± 0.394
0.47HisMet: 0.47 ± 0.197
1.019HisAsn: 1.019 ± 0.284
1.176HisPro: 1.176 ± 0.297
1.019HisGln: 1.019 ± 0.387
0.862HisArg: 0.862 ± 0.246
0.706HisSer: 0.706 ± 0.208
1.098HisThr: 1.098 ± 0.267
1.019HisVal: 1.019 ± 0.305
0.314HisTrp: 0.314 ± 0.136
0.862HisTyr: 0.862 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
4.783IleAla: 4.783 ± 0.567
0.706IleCys: 0.706 ± 0.228
4.861IleAsp: 4.861 ± 0.428
4.077IleGlu: 4.077 ± 0.575
1.725IlePhe: 1.725 ± 0.456
4.469IleGly: 4.469 ± 0.591
1.019IleHis: 1.019 ± 0.27
3.371IleIle: 3.371 ± 0.475
4.469IleLys: 4.469 ± 0.643
5.332IleLeu: 5.332 ± 0.689
1.019IleMet: 1.019 ± 0.232
3.371IleAsn: 3.371 ± 0.426
2.587IlePro: 2.587 ± 0.311
2.195IleGln: 2.195 ± 0.344
2.744IleArg: 2.744 ± 0.515
5.567IleSer: 5.567 ± 1.284
4.783IleThr: 4.783 ± 1.008
4.626IleVal: 4.626 ± 0.701
1.098IleTrp: 1.098 ± 0.37
2.509IleTyr: 2.509 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
6.037LysAla: 6.037 ± 0.873
0.47LysCys: 0.47 ± 0.177
3.528LysAsp: 3.528 ± 0.586
5.881LysGlu: 5.881 ± 0.635
2.587LysPhe: 2.587 ± 0.45
4.312LysGly: 4.312 ± 0.625
1.803LysHis: 1.803 ± 0.382
4.548LysIle: 4.548 ± 0.487
4.548LysLys: 4.548 ± 0.699
6.273LysLeu: 6.273 ± 0.648
1.803LysMet: 1.803 ± 0.42
3.215LysAsn: 3.215 ± 0.638
2.352LysPro: 2.352 ± 0.442
3.685LysGln: 3.685 ± 0.647
3.92LysArg: 3.92 ± 0.612
3.842LysSer: 3.842 ± 0.593
4.156LysThr: 4.156 ± 0.473
4.861LysVal: 4.861 ± 0.625
1.255LysTrp: 1.255 ± 0.323
2.431LysTyr: 2.431 ± 0.648
0.0LysXaa: 0.0 ± 0.0
Leu
6.273LeuAla: 6.273 ± 0.838
0.549LeuCys: 0.549 ± 0.226
5.959LeuAsp: 5.959 ± 0.693
7.841LeuGlu: 7.841 ± 0.964
2.823LeuPhe: 2.823 ± 0.435
5.253LeuGly: 5.253 ± 0.469
1.647LeuHis: 1.647 ± 0.266
4.94LeuIle: 4.94 ± 0.55
6.821LeuLys: 6.821 ± 0.611
7.449LeuLeu: 7.449 ± 0.873
1.96LeuMet: 1.96 ± 0.333
4.391LeuAsn: 4.391 ± 0.625
3.215LeuPro: 3.215 ± 0.485
3.842LeuGln: 3.842 ± 0.602
3.528LeuArg: 3.528 ± 0.479
7.449LeuSer: 7.449 ± 0.665
6.743LeuThr: 6.743 ± 0.708
5.802LeuVal: 5.802 ± 0.711
0.549LeuTrp: 0.549 ± 0.184
3.999LeuTyr: 3.999 ± 0.748
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 0.315
0.157MetCys: 0.157 ± 0.105
1.96MetAsp: 1.96 ± 0.509
1.647MetGlu: 1.647 ± 0.393
0.862MetPhe: 0.862 ± 0.242
1.49MetGly: 1.49 ± 0.437
0.078MetHis: 0.078 ± 0.084
1.568MetIle: 1.568 ± 0.363
1.568MetLys: 1.568 ± 0.338
0.941MetLeu: 0.941 ± 0.267
0.706MetMet: 0.706 ± 0.305
0.862MetAsn: 0.862 ± 0.261
0.627MetPro: 0.627 ± 0.18
0.941MetGln: 0.941 ± 0.296
1.411MetArg: 1.411 ± 0.363
2.117MetSer: 2.117 ± 0.457
1.803MetThr: 1.803 ± 0.403
1.882MetVal: 1.882 ± 0.419
0.157MetTrp: 0.157 ± 0.111
0.392MetTyr: 0.392 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.312AsnAla: 4.312 ± 0.615
0.314AsnCys: 0.314 ± 0.132
2.274AsnAsp: 2.274 ± 0.384
3.293AsnGlu: 3.293 ± 0.659
1.882AsnPhe: 1.882 ± 0.338
5.332AsnGly: 5.332 ± 0.592
1.49AsnHis: 1.49 ± 0.323
2.274AsnIle: 2.274 ± 0.332
2.823AsnLys: 2.823 ± 0.537
5.175AsnLeu: 5.175 ± 0.826
0.862AsnMet: 0.862 ± 0.264
1.882AsnAsn: 1.882 ± 0.333
2.274AsnPro: 2.274 ± 0.368
2.431AsnGln: 2.431 ± 0.339
2.666AsnArg: 2.666 ± 0.534
2.979AsnSer: 2.979 ± 0.4
2.039AsnThr: 2.039 ± 0.425
2.195AsnVal: 2.195 ± 0.561
1.019AsnTrp: 1.019 ± 0.287
1.176AsnTyr: 1.176 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
1.098ProAla: 1.098 ± 0.33
0.549ProCys: 0.549 ± 0.182
1.568ProAsp: 1.568 ± 0.334
1.882ProGlu: 1.882 ± 0.358
1.255ProPhe: 1.255 ± 0.274
0.941ProGly: 0.941 ± 0.358
0.627ProHis: 0.627 ± 0.198
1.96ProIle: 1.96 ± 0.468
2.587ProLys: 2.587 ± 0.622
3.215ProLeu: 3.215 ± 0.477
0.392ProMet: 0.392 ± 0.151
1.49ProAsn: 1.49 ± 0.352
1.019ProPro: 1.019 ± 0.314
0.941ProGln: 0.941 ± 0.266
1.568ProArg: 1.568 ± 0.349
2.823ProSer: 2.823 ± 0.501
2.117ProThr: 2.117 ± 0.413
2.274ProVal: 2.274 ± 0.429
0.392ProTrp: 0.392 ± 0.158
1.176ProTyr: 1.176 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
3.764GlnAla: 3.764 ± 0.685
0.392GlnCys: 0.392 ± 0.163
2.039GlnAsp: 2.039 ± 0.355
3.45GlnGlu: 3.45 ± 0.575
1.882GlnPhe: 1.882 ± 0.3
1.803GlnGly: 1.803 ± 0.405
0.47GlnHis: 0.47 ± 0.181
2.901GlnIle: 2.901 ± 0.479
2.587GlnLys: 2.587 ± 0.598
4.626GlnLeu: 4.626 ± 0.565
1.255GlnMet: 1.255 ± 0.311
2.274GlnAsn: 2.274 ± 0.424
1.568GlnPro: 1.568 ± 0.353
2.117GlnGln: 2.117 ± 0.507
1.882GlnArg: 1.882 ± 0.44
2.979GlnSer: 2.979 ± 0.503
2.979GlnThr: 2.979 ± 0.935
4.077GlnVal: 4.077 ± 0.689
0.706GlnTrp: 0.706 ± 0.279
0.784GlnTyr: 0.784 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
2.195ArgAla: 2.195 ± 0.407
0.627ArgCys: 0.627 ± 0.215
2.509ArgAsp: 2.509 ± 0.455
3.058ArgGlu: 3.058 ± 0.322
1.333ArgPhe: 1.333 ± 0.379
2.901ArgGly: 2.901 ± 0.445
0.862ArgHis: 0.862 ± 0.27
3.764ArgIle: 3.764 ± 0.624
3.607ArgLys: 3.607 ± 0.675
5.175ArgLeu: 5.175 ± 0.582
0.627ArgMet: 0.627 ± 0.203
2.431ArgAsn: 2.431 ± 0.362
1.568ArgPro: 1.568 ± 0.339
2.901ArgGln: 2.901 ± 0.404
2.195ArgArg: 2.195 ± 0.519
2.587ArgSer: 2.587 ± 0.396
3.136ArgThr: 3.136 ± 0.89
3.215ArgVal: 3.215 ± 0.632
1.098ArgTrp: 1.098 ± 0.238
1.333ArgTyr: 1.333 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
4.234SerAla: 4.234 ± 0.731
0.549SerCys: 0.549 ± 0.216
4.156SerAsp: 4.156 ± 0.672
5.253SerGlu: 5.253 ± 0.617
2.352SerPhe: 2.352 ± 0.577
4.548SerGly: 4.548 ± 0.633
1.647SerHis: 1.647 ± 0.367
4.94SerIle: 4.94 ± 0.707
4.469SerLys: 4.469 ± 0.776
5.724SerLeu: 5.724 ± 0.781
1.176SerMet: 1.176 ± 0.281
2.666SerAsn: 2.666 ± 0.635
2.117SerPro: 2.117 ± 0.333
3.136SerGln: 3.136 ± 0.723
2.979SerArg: 2.979 ± 0.461
5.802SerSer: 5.802 ± 1.005
4.626SerThr: 4.626 ± 0.696
4.234SerVal: 4.234 ± 0.526
1.176SerTrp: 1.176 ± 0.266
2.352SerTyr: 2.352 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
5.488ThrAla: 5.488 ± 0.746
0.235ThrCys: 0.235 ± 0.164
2.901ThrAsp: 2.901 ± 0.601
4.626ThrGlu: 4.626 ± 0.708
2.744ThrPhe: 2.744 ± 0.543
4.704ThrGly: 4.704 ± 0.874
0.784ThrHis: 0.784 ± 0.276
4.548ThrIle: 4.548 ± 0.901
4.704ThrLys: 4.704 ± 0.442
6.037ThrLeu: 6.037 ± 0.475
1.019ThrMet: 1.019 ± 0.283
2.431ThrAsn: 2.431 ± 0.564
2.039ThrPro: 2.039 ± 0.487
2.352ThrGln: 2.352 ± 0.785
2.509ThrArg: 2.509 ± 0.565
4.704ThrSer: 4.704 ± 1.004
5.567ThrThr: 5.567 ± 0.974
5.645ThrVal: 5.645 ± 0.766
1.098ThrTrp: 1.098 ± 0.248
2.195ThrTyr: 2.195 ± 0.513
0.0ThrXaa: 0.0 ± 0.0
Val
4.156ValAla: 4.156 ± 0.544
0.549ValCys: 0.549 ± 0.222
3.685ValAsp: 3.685 ± 0.663
4.94ValGlu: 4.94 ± 0.681
2.117ValPhe: 2.117 ± 0.414
3.607ValGly: 3.607 ± 0.665
1.098ValHis: 1.098 ± 0.322
3.999ValIle: 3.999 ± 0.529
4.94ValLys: 4.94 ± 0.749
6.116ValLeu: 6.116 ± 0.617
1.255ValMet: 1.255 ± 0.327
2.666ValAsn: 2.666 ± 0.472
2.195ValPro: 2.195 ± 0.323
2.274ValGln: 2.274 ± 0.372
3.371ValArg: 3.371 ± 0.636
4.391ValSer: 4.391 ± 0.886
5.096ValThr: 5.096 ± 0.636
3.371ValVal: 3.371 ± 0.444
1.176ValTrp: 1.176 ± 0.301
2.274ValTyr: 2.274 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
1.019TrpAla: 1.019 ± 0.28
0.235TrpCys: 0.235 ± 0.165
0.627TrpAsp: 0.627 ± 0.21
1.176TrpGlu: 1.176 ± 0.292
0.784TrpPhe: 0.784 ± 0.307
0.862TrpGly: 0.862 ± 0.173
0.314TrpHis: 0.314 ± 0.164
0.784TrpIle: 0.784 ± 0.226
0.706TrpLys: 0.706 ± 0.264
0.941TrpLeu: 0.941 ± 0.231
0.706TrpMet: 0.706 ± 0.223
1.333TrpAsn: 1.333 ± 0.344
0.078TrpPro: 0.078 ± 0.076
0.862TrpGln: 0.862 ± 0.314
0.627TrpArg: 0.627 ± 0.216
1.098TrpSer: 1.098 ± 0.347
1.49TrpThr: 1.49 ± 0.353
1.019TrpVal: 1.019 ± 0.238
0.157TrpTrp: 0.157 ± 0.102
0.392TrpTyr: 0.392 ± 0.241
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.274TyrAla: 2.274 ± 0.423
0.784TyrCys: 0.784 ± 0.208
2.744TyrAsp: 2.744 ± 0.654
3.215TyrGlu: 3.215 ± 0.599
1.803TyrPhe: 1.803 ± 0.539
2.117TyrGly: 2.117 ± 0.444
1.098TyrHis: 1.098 ± 0.27
1.803TyrIle: 1.803 ± 0.422
1.96TyrLys: 1.96 ± 0.476
3.215TyrLeu: 3.215 ± 0.693
0.627TyrMet: 0.627 ± 0.272
1.49TyrAsn: 1.49 ± 0.445
1.176TyrPro: 1.176 ± 0.264
1.882TyrGln: 1.882 ± 0.315
1.96TyrArg: 1.96 ± 0.443
2.352TyrSer: 2.352 ± 0.509
2.195TyrThr: 2.195 ± 0.438
1.725TyrVal: 1.725 ± 0.341
0.706TyrTrp: 0.706 ± 0.3
1.333TyrTyr: 1.333 ± 0.41
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (12755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski