Amino acid dipepetide frequency for Streptococcus phage Javan483

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.149AlaAla: 4.149 ± 1.389
0.498AlaCys: 0.498 ± 0.157
4.315AlaAsp: 4.315 ± 0.542
4.979AlaGlu: 4.979 ± 0.789
2.075AlaPhe: 2.075 ± 0.321
4.564AlaGly: 4.564 ± 0.999
0.747AlaHis: 0.747 ± 0.209
5.062AlaIle: 5.062 ± 0.581
6.721AlaLys: 6.721 ± 0.755
6.804AlaLeu: 6.804 ± 0.976
1.909AlaMet: 1.909 ± 0.421
3.9AlaAsn: 3.9 ± 0.538
1.577AlaPro: 1.577 ± 0.315
3.734AlaGln: 3.734 ± 0.715
2.406AlaArg: 2.406 ± 0.429
4.481AlaSer: 4.481 ± 0.871
4.149AlaThr: 4.149 ± 0.78
4.813AlaVal: 4.813 ± 0.892
0.581AlaTrp: 0.581 ± 0.205
1.909AlaTyr: 1.909 ± 0.288
0.0AlaXaa: 0.0 ± 0.0
Cys
0.249CysAla: 0.249 ± 0.128
0.0CysCys: 0.0 ± 0.0
0.332CysAsp: 0.332 ± 0.169
0.415CysGlu: 0.415 ± 0.193
0.498CysPhe: 0.498 ± 0.199
0.498CysGly: 0.498 ± 0.266
0.166CysHis: 0.166 ± 0.118
0.332CysIle: 0.332 ± 0.155
0.581CysLys: 0.581 ± 0.241
0.581CysLeu: 0.581 ± 0.169
0.166CysMet: 0.166 ± 0.112
0.249CysAsn: 0.249 ± 0.147
0.083CysPro: 0.083 ± 0.077
0.083CysGln: 0.083 ± 0.08
0.332CysArg: 0.332 ± 0.159
0.083CysSer: 0.083 ± 0.081
0.166CysThr: 0.166 ± 0.117
0.332CysVal: 0.332 ± 0.195
0.166CysTrp: 0.166 ± 0.095
0.415CysTyr: 0.415 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
2.738AspAla: 2.738 ± 0.436
0.415AspCys: 0.415 ± 0.174
4.813AspAsp: 4.813 ± 0.698
4.73AspGlu: 4.73 ± 0.805
3.07AspPhe: 3.07 ± 0.562
5.643AspGly: 5.643 ± 0.813
0.996AspHis: 0.996 ± 0.272
4.149AspIle: 4.149 ± 0.651
6.472AspLys: 6.472 ± 0.669
6.638AspLeu: 6.638 ± 0.753
2.075AspMet: 2.075 ± 0.327
3.9AspAsn: 3.9 ± 0.465
1.577AspPro: 1.577 ± 0.319
1.079AspGln: 1.079 ± 0.264
1.245AspArg: 1.245 ± 0.29
3.734AspSer: 3.734 ± 0.657
3.319AspThr: 3.319 ± 0.464
4.315AspVal: 4.315 ± 0.58
0.664AspTrp: 0.664 ± 0.251
2.904AspTyr: 2.904 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
3.983GluAla: 3.983 ± 0.729
0.332GluCys: 0.332 ± 0.15
2.821GluAsp: 2.821 ± 0.519
6.058GluGlu: 6.058 ± 0.852
3.236GluPhe: 3.236 ± 0.499
2.821GluGly: 2.821 ± 0.379
1.411GluHis: 1.411 ± 0.45
6.224GluIle: 6.224 ± 0.695
6.307GluLys: 6.307 ± 0.778
7.8GluLeu: 7.8 ± 0.78
1.245GluMet: 1.245 ± 0.307
3.485GluAsn: 3.485 ± 0.459
1.66GluPro: 1.66 ± 0.429
2.489GluGln: 2.489 ± 0.462
3.153GluArg: 3.153 ± 0.533
4.398GluSer: 4.398 ± 0.642
4.979GluThr: 4.979 ± 0.677
4.813GluVal: 4.813 ± 0.762
1.328GluTrp: 1.328 ± 0.322
3.153GluTyr: 3.153 ± 0.503
0.0GluXaa: 0.0 ± 0.0
Phe
2.572PheAla: 2.572 ± 0.495
0.249PheCys: 0.249 ± 0.139
3.236PheAsp: 3.236 ± 0.447
2.904PheGlu: 2.904 ± 0.516
1.328PhePhe: 1.328 ± 0.402
2.904PheGly: 2.904 ± 0.44
0.498PheHis: 0.498 ± 0.242
2.323PheIle: 2.323 ± 0.397
3.236PheLys: 3.236 ± 0.518
2.655PheLeu: 2.655 ± 0.561
1.245PheMet: 1.245 ± 0.315
2.489PheAsn: 2.489 ± 0.333
0.996PhePro: 0.996 ± 0.271
1.079PheGln: 1.079 ± 0.317
1.66PheArg: 1.66 ± 0.344
2.655PheSer: 2.655 ± 0.509
2.904PheThr: 2.904 ± 0.462
2.489PheVal: 2.489 ± 0.434
0.332PheTrp: 0.332 ± 0.164
1.743PheTyr: 1.743 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
3.402GlyAla: 3.402 ± 0.617
0.415GlyCys: 0.415 ± 0.249
4.149GlyAsp: 4.149 ± 0.554
3.734GlyGlu: 3.734 ± 0.568
2.904GlyPhe: 2.904 ± 0.612
4.896GlyGly: 4.896 ± 0.686
1.079GlyHis: 1.079 ± 0.245
4.647GlyIle: 4.647 ± 0.606
6.638GlyLys: 6.638 ± 0.698
5.643GlyLeu: 5.643 ± 0.937
1.909GlyMet: 1.909 ± 0.386
3.9GlyAsn: 3.9 ± 0.647
1.411GlyPro: 1.411 ± 0.582
1.992GlyGln: 1.992 ± 0.341
2.738GlyArg: 2.738 ± 0.493
2.157GlySer: 2.157 ± 0.412
3.236GlyThr: 3.236 ± 0.516
4.564GlyVal: 4.564 ± 0.534
1.411GlyTrp: 1.411 ± 0.458
3.319GlyTyr: 3.319 ± 0.704
0.0GlyXaa: 0.0 ± 0.0
His
0.996HisAla: 0.996 ± 0.305
0.0HisCys: 0.0 ± 0.0
0.83HisAsp: 0.83 ± 0.336
0.913HisGlu: 0.913 ± 0.213
0.913HisPhe: 0.913 ± 0.268
0.747HisGly: 0.747 ± 0.252
0.166HisHis: 0.166 ± 0.109
1.245HisIle: 1.245 ± 0.325
0.913HisLys: 0.913 ± 0.297
0.996HisLeu: 0.996 ± 0.271
0.083HisMet: 0.083 ± 0.088
0.664HisAsn: 0.664 ± 0.205
0.581HisPro: 0.581 ± 0.251
0.664HisGln: 0.664 ± 0.192
1.079HisArg: 1.079 ± 0.244
0.747HisSer: 0.747 ± 0.266
0.913HisThr: 0.913 ± 0.289
0.747HisVal: 0.747 ± 0.247
0.415HisTrp: 0.415 ± 0.206
0.415HisTyr: 0.415 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
5.394IleAla: 5.394 ± 0.616
0.332IleCys: 0.332 ± 0.169
6.141IleAsp: 6.141 ± 0.83
5.726IleGlu: 5.726 ± 0.716
2.075IlePhe: 2.075 ± 0.471
3.734IleGly: 3.734 ± 0.44
1.245IleHis: 1.245 ± 0.309
4.481IleIle: 4.481 ± 0.611
6.97IleLys: 6.97 ± 0.845
6.058IleLeu: 6.058 ± 0.73
1.079IleMet: 1.079 ± 0.223
4.979IleAsn: 4.979 ± 0.667
2.572IlePro: 2.572 ± 0.465
1.992IleGln: 1.992 ± 0.393
2.157IleArg: 2.157 ± 0.376
3.236IleSer: 3.236 ± 0.468
4.564IleThr: 4.564 ± 0.622
3.402IleVal: 3.402 ± 0.489
0.747IleTrp: 0.747 ± 0.179
2.157IleTyr: 2.157 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
6.555LysAla: 6.555 ± 0.896
0.581LysCys: 0.581 ± 0.186
5.643LysAsp: 5.643 ± 0.858
6.307LysGlu: 6.307 ± 0.677
2.904LysPhe: 2.904 ± 0.429
4.813LysGly: 4.813 ± 0.615
1.328LysHis: 1.328 ± 0.249
7.385LysIle: 7.385 ± 1.007
7.468LysLys: 7.468 ± 0.926
6.555LysLeu: 6.555 ± 0.734
2.323LysMet: 2.323 ± 0.381
6.058LysAsn: 6.058 ± 0.781
2.987LysPro: 2.987 ± 0.615
4.979LysGln: 4.979 ± 0.658
3.568LysArg: 3.568 ± 0.522
6.141LysSer: 6.141 ± 0.677
6.39LysThr: 6.39 ± 0.746
5.145LysVal: 5.145 ± 0.641
0.498LysTrp: 0.498 ± 0.171
3.07LysTyr: 3.07 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
6.39LeuAla: 6.39 ± 1.161
0.166LeuCys: 0.166 ± 0.117
7.136LeuAsp: 7.136 ± 0.68
6.887LeuGlu: 6.887 ± 0.852
2.655LeuPhe: 2.655 ± 0.436
4.979LeuGly: 4.979 ± 0.838
0.913LeuHis: 0.913 ± 0.265
5.394LeuIle: 5.394 ± 0.674
9.46LeuLys: 9.46 ± 0.938
7.883LeuLeu: 7.883 ± 0.848
2.157LeuMet: 2.157 ± 0.341
6.058LeuAsn: 6.058 ± 0.677
3.319LeuPro: 3.319 ± 0.548
3.983LeuGln: 3.983 ± 0.527
4.813LeuArg: 4.813 ± 0.757
5.726LeuSer: 5.726 ± 0.689
5.394LeuThr: 5.394 ± 0.724
4.813LeuVal: 4.813 ± 0.672
0.498LeuTrp: 0.498 ± 0.267
2.738LeuTyr: 2.738 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
2.24MetAla: 2.24 ± 0.393
0.166MetCys: 0.166 ± 0.127
1.328MetAsp: 1.328 ± 0.412
1.162MetGlu: 1.162 ± 0.284
1.245MetPhe: 1.245 ± 0.285
0.83MetGly: 0.83 ± 0.266
0.249MetHis: 0.249 ± 0.144
1.66MetIle: 1.66 ± 0.264
1.577MetLys: 1.577 ± 0.344
2.24MetLeu: 2.24 ± 0.473
0.498MetMet: 0.498 ± 0.217
1.079MetAsn: 1.079 ± 0.297
0.913MetPro: 0.913 ± 0.207
0.498MetGln: 0.498 ± 0.171
1.577MetArg: 1.577 ± 0.337
1.743MetSer: 1.743 ± 0.459
2.655MetThr: 2.655 ± 0.424
1.494MetVal: 1.494 ± 0.332
0.166MetTrp: 0.166 ± 0.097
0.83MetTyr: 0.83 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
4.564AsnAla: 4.564 ± 0.675
0.249AsnCys: 0.249 ± 0.143
3.236AsnAsp: 3.236 ± 0.473
4.066AsnGlu: 4.066 ± 0.662
2.24AsnPhe: 2.24 ± 0.385
5.394AsnGly: 5.394 ± 0.7
0.83AsnHis: 0.83 ± 0.311
4.066AsnIle: 4.066 ± 0.568
4.481AsnLys: 4.481 ± 0.652
5.394AsnLeu: 5.394 ± 0.54
1.826AsnMet: 1.826 ± 0.388
3.402AsnAsn: 3.402 ± 0.59
2.406AsnPro: 2.406 ± 0.361
2.655AsnGln: 2.655 ± 0.571
2.075AsnArg: 2.075 ± 0.426
3.734AsnSer: 3.734 ± 0.568
2.075AsnThr: 2.075 ± 0.411
3.236AsnVal: 3.236 ± 0.423
0.747AsnTrp: 0.747 ± 0.234
1.909AsnTyr: 1.909 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
1.826ProAla: 1.826 ± 0.4
0.083ProCys: 0.083 ± 0.08
2.075ProAsp: 2.075 ± 0.35
1.66ProGlu: 1.66 ± 0.396
1.577ProPhe: 1.577 ± 0.384
1.079ProGly: 1.079 ± 0.27
0.332ProHis: 0.332 ± 0.158
1.826ProIle: 1.826 ± 0.34
2.738ProLys: 2.738 ± 0.533
3.153ProLeu: 3.153 ± 0.602
0.249ProMet: 0.249 ± 0.13
1.494ProAsn: 1.494 ± 0.465
0.332ProPro: 0.332 ± 0.163
1.162ProGln: 1.162 ± 0.303
1.245ProArg: 1.245 ± 0.463
1.909ProSer: 1.909 ± 0.37
2.738ProThr: 2.738 ± 0.45
1.328ProVal: 1.328 ± 0.345
0.166ProTrp: 0.166 ± 0.116
1.079ProTyr: 1.079 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
3.651GlnAla: 3.651 ± 0.448
0.249GlnCys: 0.249 ± 0.175
1.577GlnAsp: 1.577 ± 0.342
2.987GlnGlu: 2.987 ± 0.457
1.909GlnPhe: 1.909 ± 0.348
1.992GlnGly: 1.992 ± 0.361
0.498GlnHis: 0.498 ± 0.179
2.489GlnIle: 2.489 ± 0.457
3.07GlnLys: 3.07 ± 0.435
4.149GlnLeu: 4.149 ± 0.539
1.245GlnMet: 1.245 ± 0.392
2.406GlnAsn: 2.406 ± 0.529
0.664GlnPro: 0.664 ± 0.229
1.66GlnGln: 1.66 ± 0.415
2.157GlnArg: 2.157 ± 0.421
3.734GlnSer: 3.734 ± 0.617
2.075GlnThr: 2.075 ± 0.512
1.411GlnVal: 1.411 ± 0.324
0.581GlnTrp: 0.581 ± 0.255
0.747GlnTyr: 0.747 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
2.489ArgAla: 2.489 ± 0.55
0.332ArgCys: 0.332 ± 0.17
1.743ArgAsp: 1.743 ± 0.495
3.07ArgGlu: 3.07 ± 0.438
1.909ArgPhe: 1.909 ± 0.461
2.738ArgGly: 2.738 ± 0.53
0.664ArgHis: 0.664 ± 0.267
3.485ArgIle: 3.485 ± 0.547
3.983ArgLys: 3.983 ± 0.65
3.817ArgLeu: 3.817 ± 0.569
0.664ArgMet: 0.664 ± 0.223
2.572ArgAsn: 2.572 ± 0.515
0.83ArgPro: 0.83 ± 0.243
1.411ArgGln: 1.411 ± 0.383
2.075ArgArg: 2.075 ± 0.423
1.909ArgSer: 1.909 ± 0.392
3.07ArgThr: 3.07 ± 0.419
2.075ArgVal: 2.075 ± 0.522
0.415ArgTrp: 0.415 ± 0.174
1.328ArgTyr: 1.328 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
5.311SerAla: 5.311 ± 0.961
0.249SerCys: 0.249 ± 0.117
4.232SerAsp: 4.232 ± 0.503
4.647SerGlu: 4.647 ± 0.494
2.572SerPhe: 2.572 ± 0.371
4.066SerGly: 4.066 ± 0.608
0.415SerHis: 0.415 ± 0.218
3.817SerIle: 3.817 ± 0.501
4.315SerLys: 4.315 ± 0.643
5.394SerLeu: 5.394 ± 0.649
1.826SerMet: 1.826 ± 0.306
3.07SerAsn: 3.07 ± 0.567
1.328SerPro: 1.328 ± 0.377
2.987SerGln: 2.987 ± 0.477
1.743SerArg: 1.743 ± 0.379
4.564SerSer: 4.564 ± 0.636
2.987SerThr: 2.987 ± 0.679
4.315SerVal: 4.315 ± 0.599
0.996SerTrp: 0.996 ± 0.385
1.909SerTyr: 1.909 ± 0.384
0.0SerXaa: 0.0 ± 0.0
Thr
5.311ThrAla: 5.311 ± 1.027
0.415ThrCys: 0.415 ± 0.19
3.236ThrAsp: 3.236 ± 0.539
3.402ThrGlu: 3.402 ± 0.587
2.738ThrPhe: 2.738 ± 0.497
5.311ThrGly: 5.311 ± 0.731
0.83ThrHis: 0.83 ± 0.268
3.817ThrIle: 3.817 ± 0.549
5.643ThrLys: 5.643 ± 0.626
5.311ThrLeu: 5.311 ± 0.714
1.577ThrMet: 1.577 ± 0.38
3.153ThrAsn: 3.153 ± 0.434
2.24ThrPro: 2.24 ± 0.411
2.406ThrGln: 2.406 ± 0.433
2.24ThrArg: 2.24 ± 0.387
3.9ThrSer: 3.9 ± 0.609
4.232ThrThr: 4.232 ± 0.615
4.149ThrVal: 4.149 ± 0.532
0.581ThrTrp: 0.581 ± 0.197
1.826ThrTyr: 1.826 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
4.564ValAla: 4.564 ± 0.851
0.415ValCys: 0.415 ± 0.204
4.066ValAsp: 4.066 ± 0.475
4.564ValGlu: 4.564 ± 0.591
1.992ValPhe: 1.992 ± 0.484
4.232ValGly: 4.232 ± 0.597
0.913ValHis: 0.913 ± 0.285
4.066ValIle: 4.066 ± 0.622
5.809ValLys: 5.809 ± 0.619
6.555ValLeu: 6.555 ± 0.706
1.328ValMet: 1.328 ± 0.405
3.319ValAsn: 3.319 ± 0.628
1.079ValPro: 1.079 ± 0.306
1.826ValGln: 1.826 ± 0.412
1.826ValArg: 1.826 ± 0.321
3.983ValSer: 3.983 ± 0.545
4.066ValThr: 4.066 ± 0.579
3.319ValVal: 3.319 ± 0.495
0.249ValTrp: 0.249 ± 0.117
1.66ValTyr: 1.66 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.498TrpAla: 0.498 ± 0.17
0.332TrpCys: 0.332 ± 0.155
0.913TrpAsp: 0.913 ± 0.25
0.664TrpGlu: 0.664 ± 0.232
0.581TrpPhe: 0.581 ± 0.258
0.83TrpGly: 0.83 ± 0.334
0.083TrpHis: 0.083 ± 0.092
0.581TrpIle: 0.581 ± 0.257
1.162TrpLys: 1.162 ± 0.293
1.162TrpLeu: 1.162 ± 0.329
0.083TrpMet: 0.083 ± 0.08
0.664TrpAsn: 0.664 ± 0.229
0.166TrpPro: 0.166 ± 0.11
0.664TrpGln: 0.664 ± 0.222
0.747TrpArg: 0.747 ± 0.283
0.415TrpSer: 0.415 ± 0.169
0.415TrpThr: 0.415 ± 0.207
0.581TrpVal: 0.581 ± 0.218
0.083TrpTrp: 0.083 ± 0.077
0.498TrpTyr: 0.498 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.489TyrAla: 2.489 ± 0.451
0.249TyrCys: 0.249 ± 0.144
2.655TyrAsp: 2.655 ± 0.45
2.821TyrGlu: 2.821 ± 0.531
1.079TyrPhe: 1.079 ± 0.346
2.075TyrGly: 2.075 ± 0.493
0.664TyrHis: 0.664 ± 0.23
1.992TyrIle: 1.992 ± 0.455
3.153TyrLys: 3.153 ± 0.438
2.572TyrLeu: 2.572 ± 0.433
0.581TyrMet: 0.581 ± 0.218
1.743TyrAsn: 1.743 ± 0.389
1.328TyrPro: 1.328 ± 0.375
1.909TyrGln: 1.909 ± 0.431
1.743TyrArg: 1.743 ± 0.416
1.494TyrSer: 1.494 ± 0.351
1.992TyrThr: 1.992 ± 0.323
2.489TyrVal: 2.489 ± 0.484
0.498TyrTrp: 0.498 ± 0.215
1.826TyrTyr: 1.826 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski