Amino acid dipepetide frequency for Streptococcus phage Javan124

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.328AlaAla: 4.328 ± 1.307
0.499AlaCys: 0.499 ± 0.164
4.245AlaAsp: 4.245 ± 0.634
3.662AlaGlu: 3.662 ± 0.577
1.914AlaPhe: 1.914 ± 0.471
4.578AlaGly: 4.578 ± 0.781
0.666AlaHis: 0.666 ± 0.254
4.578AlaIle: 4.578 ± 0.655
5.494AlaLys: 5.494 ± 0.637
5.993AlaLeu: 5.993 ± 1.005
1.748AlaMet: 1.748 ± 0.341
2.997AlaAsn: 2.997 ± 0.399
1.498AlaPro: 1.498 ± 0.336
2.747AlaGln: 2.747 ± 0.739
3.413AlaArg: 3.413 ± 0.496
3.995AlaSer: 3.995 ± 0.601
4.578AlaThr: 4.578 ± 0.803
4.412AlaVal: 4.412 ± 0.706
0.749AlaTrp: 0.749 ± 0.263
3.163AlaTyr: 3.163 ± 0.533
0.0AlaXaa: 0.0 ± 0.0
Cys
0.416CysAla: 0.416 ± 0.18
0.25CysCys: 0.25 ± 0.133
0.499CysAsp: 0.499 ± 0.192
0.583CysGlu: 0.583 ± 0.199
0.166CysPhe: 0.166 ± 0.115
0.583CysGly: 0.583 ± 0.224
0.166CysHis: 0.166 ± 0.107
0.166CysIle: 0.166 ± 0.109
0.333CysLys: 0.333 ± 0.2
0.832CysLeu: 0.832 ± 0.26
0.083CysMet: 0.083 ± 0.082
0.333CysAsn: 0.333 ± 0.188
0.333CysPro: 0.333 ± 0.143
0.583CysGln: 0.583 ± 0.182
0.583CysArg: 0.583 ± 0.238
0.749CysSer: 0.749 ± 0.239
0.25CysThr: 0.25 ± 0.14
0.583CysVal: 0.583 ± 0.233
0.0CysTrp: 0.0 ± 0.0
0.583CysTyr: 0.583 ± 0.215
0.0CysXaa: 0.0 ± 0.0
Asp
3.329AspAla: 3.329 ± 0.558
0.749AspCys: 0.749 ± 0.264
2.58AspAsp: 2.58 ± 0.543
4.744AspGlu: 4.744 ± 0.624
3.246AspPhe: 3.246 ± 0.473
4.911AspGly: 4.911 ± 0.697
1.082AspHis: 1.082 ± 0.367
3.329AspIle: 3.329 ± 0.461
3.829AspLys: 3.829 ± 0.506
5.327AspLeu: 5.327 ± 0.821
2.164AspMet: 2.164 ± 0.472
2.497AspAsn: 2.497 ± 0.536
1.249AspPro: 1.249 ± 0.419
1.831AspGln: 1.831 ± 0.476
2.497AspArg: 2.497 ± 0.57
3.995AspSer: 3.995 ± 0.669
2.497AspThr: 2.497 ± 0.413
3.829AspVal: 3.829 ± 0.513
0.916AspTrp: 0.916 ± 0.247
2.58AspTyr: 2.58 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
4.828GluAla: 4.828 ± 0.747
0.666GluCys: 0.666 ± 0.213
4.412GluAsp: 4.412 ± 0.716
6.492GluGlu: 6.492 ± 1.113
2.497GluPhe: 2.497 ± 0.635
4.328GluGly: 4.328 ± 0.523
1.249GluHis: 1.249 ± 0.368
4.328GluIle: 4.328 ± 0.679
5.993GluLys: 5.993 ± 0.672
8.407GluLeu: 8.407 ± 0.904
1.998GluMet: 1.998 ± 0.484
4.495GluAsn: 4.495 ± 0.573
1.581GluPro: 1.581 ± 0.401
3.496GluGln: 3.496 ± 0.543
2.913GluArg: 2.913 ± 0.364
3.579GluSer: 3.579 ± 0.522
4.495GluThr: 4.495 ± 0.805
4.328GluVal: 4.328 ± 0.512
0.916GluTrp: 0.916 ± 0.274
2.081GluTyr: 2.081 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.747PheAla: 2.747 ± 0.507
0.416PheCys: 0.416 ± 0.198
2.414PheAsp: 2.414 ± 0.372
3.329PheGlu: 3.329 ± 0.51
1.998PhePhe: 1.998 ± 0.481
3.413PheGly: 3.413 ± 0.594
0.583PheHis: 0.583 ± 0.273
2.331PheIle: 2.331 ± 0.59
3.413PheLys: 3.413 ± 0.633
2.83PheLeu: 2.83 ± 0.538
0.916PheMet: 0.916 ± 0.311
2.164PheAsn: 2.164 ± 0.346
0.583PhePro: 0.583 ± 0.238
1.581PheGln: 1.581 ± 0.278
1.748PheArg: 1.748 ± 0.273
3.163PheSer: 3.163 ± 0.482
1.914PheThr: 1.914 ± 0.368
1.914PheVal: 1.914 ± 0.395
0.499PheTrp: 0.499 ± 0.184
2.247PheTyr: 2.247 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
3.662GlyAla: 3.662 ± 0.519
0.333GlyCys: 0.333 ± 0.154
3.829GlyAsp: 3.829 ± 0.59
3.246GlyGlu: 3.246 ± 0.433
2.497GlyPhe: 2.497 ± 0.346
4.911GlyGly: 4.911 ± 0.966
1.914GlyHis: 1.914 ± 0.446
6.076GlyIle: 6.076 ± 0.713
5.077GlyLys: 5.077 ± 0.533
5.494GlyLeu: 5.494 ± 0.606
2.247GlyMet: 2.247 ± 0.454
3.579GlyAsn: 3.579 ± 0.715
0.916GlyPro: 0.916 ± 0.268
3.246GlyGln: 3.246 ± 0.539
4.079GlyArg: 4.079 ± 0.446
4.578GlySer: 4.578 ± 1.052
3.829GlyThr: 3.829 ± 0.547
4.079GlyVal: 4.079 ± 0.54
0.666GlyTrp: 0.666 ± 0.182
2.913GlyTyr: 2.913 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
0.666HisAla: 0.666 ± 0.209
0.083HisCys: 0.083 ± 0.087
0.749HisAsp: 0.749 ± 0.234
0.999HisGlu: 0.999 ± 0.326
1.415HisPhe: 1.415 ± 0.371
1.665HisGly: 1.665 ± 0.386
0.749HisHis: 0.749 ± 0.234
1.581HisIle: 1.581 ± 0.35
0.749HisLys: 0.749 ± 0.253
2.081HisLeu: 2.081 ± 0.284
0.25HisMet: 0.25 ± 0.184
0.999HisAsn: 0.999 ± 0.233
1.165HisPro: 1.165 ± 0.347
0.499HisGln: 0.499 ± 0.202
0.749HisArg: 0.749 ± 0.261
1.082HisSer: 1.082 ± 0.297
0.999HisThr: 0.999 ± 0.349
0.916HisVal: 0.916 ± 0.296
0.25HisTrp: 0.25 ± 0.125
0.333HisTyr: 0.333 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.744IleAla: 4.744 ± 0.568
0.666IleCys: 0.666 ± 0.203
5.327IleAsp: 5.327 ± 0.583
4.162IleGlu: 4.162 ± 0.627
1.665IlePhe: 1.665 ± 0.439
4.578IleGly: 4.578 ± 0.662
0.749IleHis: 0.749 ± 0.266
3.912IleIle: 3.912 ± 1.021
5.161IleLys: 5.161 ± 0.61
5.161IleLeu: 5.161 ± 0.587
0.999IleMet: 0.999 ± 0.294
2.747IleAsn: 2.747 ± 0.512
2.664IlePro: 2.664 ± 0.447
2.83IleGln: 2.83 ± 0.389
2.58IleArg: 2.58 ± 0.475
4.661IleSer: 4.661 ± 0.857
4.828IleThr: 4.828 ± 0.71
4.245IleVal: 4.245 ± 0.68
1.165IleTrp: 1.165 ± 0.353
1.914IleTyr: 1.914 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
5.66LysAla: 5.66 ± 0.647
0.25LysCys: 0.25 ± 0.123
3.995LysAsp: 3.995 ± 0.698
5.494LysGlu: 5.494 ± 0.591
2.414LysPhe: 2.414 ± 0.583
4.495LysGly: 4.495 ± 0.55
1.914LysHis: 1.914 ± 0.491
3.829LysIle: 3.829 ± 0.562
4.911LysLys: 4.911 ± 0.604
5.41LysLeu: 5.41 ± 0.715
1.831LysMet: 1.831 ± 0.429
2.997LysAsn: 2.997 ± 0.407
2.331LysPro: 2.331 ± 0.471
3.246LysGln: 3.246 ± 0.564
3.912LysArg: 3.912 ± 0.75
5.077LysSer: 5.077 ± 0.695
4.661LysThr: 4.661 ± 0.618
4.911LysVal: 4.911 ± 0.551
1.082LysTrp: 1.082 ± 0.345
2.081LysTyr: 2.081 ± 0.486
0.0LysXaa: 0.0 ± 0.0
Leu
5.993LeuAla: 5.993 ± 0.798
0.499LeuCys: 0.499 ± 0.161
4.994LeuAsp: 4.994 ± 0.499
7.158LeuGlu: 7.158 ± 0.778
2.83LeuPhe: 2.83 ± 0.575
5.327LeuGly: 5.327 ± 0.629
1.415LeuHis: 1.415 ± 0.349
4.495LeuIle: 4.495 ± 0.424
7.491LeuLys: 7.491 ± 0.821
7.741LeuLeu: 7.741 ± 0.614
2.081LeuMet: 2.081 ± 0.401
4.744LeuAsn: 4.744 ± 0.643
3.329LeuPro: 3.329 ± 0.574
3.496LeuGln: 3.496 ± 0.561
4.245LeuArg: 4.245 ± 0.679
6.659LeuSer: 6.659 ± 0.667
6.992LeuThr: 6.992 ± 0.784
6.159LeuVal: 6.159 ± 0.799
0.916LeuTrp: 0.916 ± 0.194
4.412LeuTyr: 4.412 ± 0.764
0.0LeuXaa: 0.0 ± 0.0
Met
1.831MetAla: 1.831 ± 0.424
0.083MetCys: 0.083 ± 0.087
1.332MetAsp: 1.332 ± 0.337
1.665MetGlu: 1.665 ± 0.411
0.666MetPhe: 0.666 ± 0.215
1.998MetGly: 1.998 ± 0.422
0.0MetHis: 0.0 ± 0.0
1.998MetIle: 1.998 ± 0.36
1.998MetLys: 1.998 ± 0.366
1.581MetLeu: 1.581 ± 0.345
0.999MetMet: 0.999 ± 0.288
0.832MetAsn: 0.832 ± 0.238
0.499MetPro: 0.499 ± 0.183
0.832MetGln: 0.832 ± 0.268
1.082MetArg: 1.082 ± 0.235
2.497MetSer: 2.497 ± 0.369
1.998MetThr: 1.998 ± 0.4
1.415MetVal: 1.415 ± 0.324
0.083MetTrp: 0.083 ± 0.078
0.666MetTyr: 0.666 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
3.912AsnAla: 3.912 ± 0.784
0.083AsnCys: 0.083 ± 0.082
1.914AsnAsp: 1.914 ± 0.373
3.746AsnGlu: 3.746 ± 0.552
2.414AsnPhe: 2.414 ± 0.596
4.994AsnGly: 4.994 ± 0.603
0.999AsnHis: 0.999 ± 0.288
2.747AsnIle: 2.747 ± 0.663
2.83AsnLys: 2.83 ± 0.444
3.995AsnLeu: 3.995 ± 0.6
1.249AsnMet: 1.249 ± 0.294
2.414AsnAsn: 2.414 ± 0.609
2.331AsnPro: 2.331 ± 0.356
1.914AsnGln: 1.914 ± 0.316
2.081AsnArg: 2.081 ± 0.35
2.997AsnSer: 2.997 ± 0.654
2.331AsnThr: 2.331 ± 0.576
2.747AsnVal: 2.747 ± 0.451
1.165AsnTrp: 1.165 ± 0.326
1.165AsnTyr: 1.165 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
0.999ProAla: 0.999 ± 0.352
0.333ProCys: 0.333 ± 0.157
1.914ProAsp: 1.914 ± 0.405
2.247ProGlu: 2.247 ± 0.563
0.999ProPhe: 0.999 ± 0.249
1.165ProGly: 1.165 ± 0.309
0.999ProHis: 0.999 ± 0.252
2.164ProIle: 2.164 ± 0.437
2.331ProLys: 2.331 ± 0.656
3.08ProLeu: 3.08 ± 0.488
0.416ProMet: 0.416 ± 0.164
1.748ProAsn: 1.748 ± 0.367
0.832ProPro: 0.832 ± 0.316
1.082ProGln: 1.082 ± 0.258
1.831ProArg: 1.831 ± 0.369
2.997ProSer: 2.997 ± 0.476
2.081ProThr: 2.081 ± 0.487
1.914ProVal: 1.914 ± 0.388
0.416ProTrp: 0.416 ± 0.176
1.498ProTyr: 1.498 ± 0.406
0.0ProXaa: 0.0 ± 0.0
Gln
4.079GlnAla: 4.079 ± 0.63
0.166GlnCys: 0.166 ± 0.092
2.331GlnAsp: 2.331 ± 0.45
3.246GlnGlu: 3.246 ± 0.541
1.748GlnPhe: 1.748 ± 0.33
1.914GlnGly: 1.914 ± 0.319
0.333GlnHis: 0.333 ± 0.146
2.913GlnIle: 2.913 ± 0.452
2.664GlnLys: 2.664 ± 0.512
4.495GlnLeu: 4.495 ± 0.578
1.249GlnMet: 1.249 ± 0.29
1.998GlnAsn: 1.998 ± 0.563
1.581GlnPro: 1.581 ± 0.452
2.164GlnGln: 2.164 ± 0.405
1.415GlnArg: 1.415 ± 0.296
2.83GlnSer: 2.83 ± 0.491
2.414GlnThr: 2.414 ± 0.405
3.08GlnVal: 3.08 ± 0.589
0.749GlnTrp: 0.749 ± 0.3
0.999GlnTyr: 0.999 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
2.664ArgAla: 2.664 ± 0.599
0.749ArgCys: 0.749 ± 0.261
1.998ArgAsp: 1.998 ± 0.387
3.829ArgGlu: 3.829 ± 0.53
2.331ArgPhe: 2.331 ± 0.482
2.081ArgGly: 2.081 ± 0.419
0.749ArgHis: 0.749 ± 0.231
2.58ArgIle: 2.58 ± 0.426
3.579ArgLys: 3.579 ± 0.741
5.41ArgLeu: 5.41 ± 0.703
0.416ArgMet: 0.416 ± 0.15
2.58ArgAsn: 2.58 ± 0.463
1.332ArgPro: 1.332 ± 0.403
3.413ArgGln: 3.413 ± 0.425
2.247ArgArg: 2.247 ± 0.457
2.664ArgSer: 2.664 ± 0.369
2.331ArgThr: 2.331 ± 0.513
3.746ArgVal: 3.746 ± 0.496
0.749ArgTrp: 0.749 ± 0.232
1.665ArgTyr: 1.665 ± 0.416
0.0ArgXaa: 0.0 ± 0.0
Ser
4.328SerAla: 4.328 ± 0.818
0.583SerCys: 0.583 ± 0.275
4.578SerAsp: 4.578 ± 0.738
4.412SerGlu: 4.412 ± 0.519
3.329SerPhe: 3.329 ± 0.508
5.327SerGly: 5.327 ± 0.799
1.581SerHis: 1.581 ± 0.342
5.077SerIle: 5.077 ± 0.714
4.578SerLys: 4.578 ± 0.586
5.827SerLeu: 5.827 ± 0.634
1.332SerMet: 1.332 ± 0.326
3.163SerAsn: 3.163 ± 0.631
2.164SerPro: 2.164 ± 0.395
2.58SerGln: 2.58 ± 0.382
3.246SerArg: 3.246 ± 0.462
6.243SerSer: 6.243 ± 1.0
4.911SerThr: 4.911 ± 0.592
4.412SerVal: 4.412 ± 0.512
1.249SerTrp: 1.249 ± 0.241
2.414SerTyr: 2.414 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
4.328ThrAla: 4.328 ± 0.61
0.333ThrCys: 0.333 ± 0.194
2.58ThrAsp: 2.58 ± 0.489
4.911ThrGlu: 4.911 ± 0.618
3.329ThrPhe: 3.329 ± 0.666
4.079ThrGly: 4.079 ± 0.51
0.832ThrHis: 0.832 ± 0.21
5.494ThrIle: 5.494 ± 1.207
3.829ThrLys: 3.829 ± 0.498
5.91ThrLeu: 5.91 ± 0.731
1.082ThrMet: 1.082 ± 0.302
2.58ThrAsn: 2.58 ± 0.413
2.747ThrPro: 2.747 ± 0.472
1.581ThrGln: 1.581 ± 0.337
2.664ThrArg: 2.664 ± 0.471
5.41ThrSer: 5.41 ± 0.841
4.661ThrThr: 4.661 ± 0.528
5.077ThrVal: 5.077 ± 0.705
1.165ThrTrp: 1.165 ± 0.408
2.164ThrTyr: 2.164 ± 0.501
0.0ThrXaa: 0.0 ± 0.0
Val
3.579ValAla: 3.579 ± 0.576
0.583ValCys: 0.583 ± 0.207
3.496ValAsp: 3.496 ± 0.589
5.244ValGlu: 5.244 ± 0.838
2.414ValPhe: 2.414 ± 0.351
3.829ValGly: 3.829 ± 0.59
0.916ValHis: 0.916 ± 0.235
4.911ValIle: 4.911 ± 0.615
3.912ValLys: 3.912 ± 0.679
6.576ValLeu: 6.576 ± 0.669
1.581ValMet: 1.581 ± 0.371
2.414ValAsn: 2.414 ± 0.438
2.497ValPro: 2.497 ± 0.516
2.247ValGln: 2.247 ± 0.306
2.913ValArg: 2.913 ± 0.483
4.911ValSer: 4.911 ± 0.72
5.494ValThr: 5.494 ± 0.857
3.662ValVal: 3.662 ± 0.556
0.916ValTrp: 0.916 ± 0.233
2.664ValTyr: 2.664 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
0.749TrpAla: 0.749 ± 0.261
0.166TrpCys: 0.166 ± 0.108
0.499TrpAsp: 0.499 ± 0.198
1.249TrpGlu: 1.249 ± 0.327
0.832TrpPhe: 0.832 ± 0.262
0.583TrpGly: 0.583 ± 0.201
0.166TrpHis: 0.166 ± 0.117
0.499TrpIle: 0.499 ± 0.232
0.666TrpLys: 0.666 ± 0.315
1.332TrpLeu: 1.332 ± 0.269
0.583TrpMet: 0.583 ± 0.17
1.249TrpAsn: 1.249 ± 0.338
0.166TrpPro: 0.166 ± 0.107
0.749TrpGln: 0.749 ± 0.262
0.916TrpArg: 0.916 ± 0.32
0.916TrpSer: 0.916 ± 0.275
1.165TrpThr: 1.165 ± 0.436
1.082TrpVal: 1.082 ± 0.336
0.166TrpTrp: 0.166 ± 0.102
0.25TrpTyr: 0.25 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 0.407
0.583TyrCys: 0.583 ± 0.222
3.496TyrAsp: 3.496 ± 0.67
2.664TyrGlu: 2.664 ± 0.374
1.498TyrPhe: 1.498 ± 0.287
2.414TyrGly: 2.414 ± 0.454
0.916TyrHis: 0.916 ± 0.337
1.748TyrIle: 1.748 ± 0.375
1.665TyrLys: 1.665 ± 0.447
3.496TyrLeu: 3.496 ± 0.637
0.749TyrMet: 0.749 ± 0.237
1.498TyrAsn: 1.498 ± 0.375
1.249TyrPro: 1.249 ± 0.29
2.247TyrGln: 2.247 ± 0.456
1.998TyrArg: 1.998 ± 0.338
2.331TyrSer: 2.331 ± 0.417
2.414TyrThr: 2.414 ± 0.465
2.247TyrVal: 2.247 ± 0.365
0.166TyrTrp: 0.166 ± 0.116
1.581TyrTyr: 1.581 ± 0.393
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (12015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski