Amino acid dipepetide frequency for Streptococcus phage Javan583

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.084AlaAla: 3.084 ± 0.864
0.851AlaCys: 0.851 ± 0.369
5.53AlaAsp: 5.53 ± 0.765
5.956AlaGlu: 5.956 ± 0.7
2.978AlaPhe: 2.978 ± 0.529
5.849AlaGly: 5.849 ± 1.41
0.638AlaHis: 0.638 ± 0.306
5.849AlaIle: 5.849 ± 1.072
7.019AlaLys: 7.019 ± 0.73
5.211AlaLeu: 5.211 ± 0.654
2.446AlaMet: 2.446 ± 0.474
4.041AlaAsn: 4.041 ± 0.7
1.17AlaPro: 1.17 ± 0.344
3.829AlaGln: 3.829 ± 0.81
2.552AlaArg: 2.552 ± 0.581
3.935AlaSer: 3.935 ± 0.606
4.573AlaThr: 4.573 ± 0.767
4.36AlaVal: 4.36 ± 0.52
1.808AlaTrp: 1.808 ± 0.508
2.34AlaTyr: 2.34 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
0.213CysAla: 0.213 ± 0.147
0.106CysCys: 0.106 ± 0.107
0.638CysAsp: 0.638 ± 0.279
0.106CysGlu: 0.106 ± 0.114
0.106CysPhe: 0.106 ± 0.114
0.638CysGly: 0.638 ± 0.284
0.319CysHis: 0.319 ± 0.208
0.0CysIle: 0.0 ± 0.0
0.319CysLys: 0.319 ± 0.175
0.425CysLeu: 0.425 ± 0.222
0.213CysMet: 0.213 ± 0.153
0.213CysAsn: 0.213 ± 0.146
0.319CysPro: 0.319 ± 0.219
0.106CysGln: 0.106 ± 0.117
0.319CysArg: 0.319 ± 0.243
0.106CysSer: 0.106 ± 0.111
0.319CysThr: 0.319 ± 0.197
0.213CysVal: 0.213 ± 0.168
0.0CysTrp: 0.0 ± 0.0
0.319CysTyr: 0.319 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
2.871AspAla: 2.871 ± 0.534
0.425AspCys: 0.425 ± 0.295
2.552AspAsp: 2.552 ± 0.488
5.105AspGlu: 5.105 ± 0.905
3.084AspPhe: 3.084 ± 0.57
5.105AspGly: 5.105 ± 0.669
0.851AspHis: 0.851 ± 0.32
3.829AspIle: 3.829 ± 0.55
3.829AspLys: 3.829 ± 0.665
6.168AspLeu: 6.168 ± 0.823
1.914AspMet: 1.914 ± 0.531
3.084AspAsn: 3.084 ± 0.48
1.702AspPro: 1.702 ± 0.415
1.063AspGln: 1.063 ± 0.316
1.808AspArg: 1.808 ± 0.419
3.297AspSer: 3.297 ± 0.583
3.51AspThr: 3.51 ± 0.69
5.211AspVal: 5.211 ± 0.589
0.957AspTrp: 0.957 ± 0.269
2.127AspTyr: 2.127 ± 0.545
0.0AspXaa: 0.0 ± 0.0
Glu
4.467GluAla: 4.467 ± 0.71
0.106GluCys: 0.106 ± 0.102
4.148GluAsp: 4.148 ± 0.758
4.679GluGlu: 4.679 ± 0.908
2.871GluPhe: 2.871 ± 0.525
3.51GluGly: 3.51 ± 0.571
1.063GluHis: 1.063 ± 0.378
4.998GluIle: 4.998 ± 0.942
5.743GluLys: 5.743 ± 1.167
7.232GluLeu: 7.232 ± 1.233
2.233GluMet: 2.233 ± 0.489
4.254GluAsn: 4.254 ± 0.638
1.914GluPro: 1.914 ± 0.537
3.616GluGln: 3.616 ± 0.726
4.573GluArg: 4.573 ± 0.624
3.616GluSer: 3.616 ± 0.584
4.36GluThr: 4.36 ± 0.695
5.849GluVal: 5.849 ± 0.993
1.063GluTrp: 1.063 ± 0.343
1.808GluTyr: 1.808 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
3.403PheAla: 3.403 ± 0.522
0.106PheCys: 0.106 ± 0.11
3.19PheAsp: 3.19 ± 0.4
3.616PheGlu: 3.616 ± 0.592
1.276PhePhe: 1.276 ± 0.268
3.51PheGly: 3.51 ± 0.704
0.638PheHis: 0.638 ± 0.253
1.595PheIle: 1.595 ± 0.4
2.446PheLys: 2.446 ± 0.434
3.616PheLeu: 3.616 ± 0.664
0.744PheMet: 0.744 ± 0.275
2.021PheAsn: 2.021 ± 0.582
0.957PhePro: 0.957 ± 0.224
1.595PheGln: 1.595 ± 0.47
1.595PheArg: 1.595 ± 0.421
1.808PheSer: 1.808 ± 0.443
2.446PheThr: 2.446 ± 0.547
2.233PheVal: 2.233 ± 0.443
0.213PheTrp: 0.213 ± 0.153
1.595PheTyr: 1.595 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
5.211GlyAla: 5.211 ± 1.066
0.213GlyCys: 0.213 ± 0.171
2.659GlyAsp: 2.659 ± 0.521
3.935GlyGlu: 3.935 ± 0.544
2.871GlyPhe: 2.871 ± 0.769
4.36GlyGly: 4.36 ± 0.876
1.383GlyHis: 1.383 ± 0.509
6.381GlyIle: 6.381 ± 0.942
5.636GlyLys: 5.636 ± 0.866
6.7GlyLeu: 6.7 ± 1.064
2.127GlyMet: 2.127 ± 0.494
3.935GlyAsn: 3.935 ± 0.761
1.17GlyPro: 1.17 ± 0.355
3.935GlyGln: 3.935 ± 0.822
2.978GlyArg: 2.978 ± 0.592
3.19GlySer: 3.19 ± 0.954
4.786GlyThr: 4.786 ± 0.745
3.51GlyVal: 3.51 ± 0.889
0.851GlyTrp: 0.851 ± 0.279
3.51GlyTyr: 3.51 ± 0.687
0.0GlyXaa: 0.0 ± 0.0
His
0.957HisAla: 0.957 ± 0.277
0.106HisCys: 0.106 ± 0.111
0.532HisAsp: 0.532 ± 0.264
0.638HisGlu: 0.638 ± 0.225
1.17HisPhe: 1.17 ± 0.374
1.063HisGly: 1.063 ± 0.407
0.106HisHis: 0.106 ± 0.107
0.957HisIle: 0.957 ± 0.327
0.638HisLys: 0.638 ± 0.306
1.17HisLeu: 1.17 ± 0.286
0.213HisMet: 0.213 ± 0.119
1.063HisAsn: 1.063 ± 0.458
0.744HisPro: 0.744 ± 0.294
0.851HisGln: 0.851 ± 0.382
0.638HisArg: 0.638 ± 0.208
1.063HisSer: 1.063 ± 0.268
0.851HisThr: 0.851 ± 0.313
0.532HisVal: 0.532 ± 0.275
0.319HisTrp: 0.319 ± 0.192
0.425HisTyr: 0.425 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.211IleAla: 5.211 ± 0.713
0.319IleCys: 0.319 ± 0.177
4.573IleAsp: 4.573 ± 0.694
6.275IleGlu: 6.275 ± 0.974
2.978IlePhe: 2.978 ± 0.588
4.36IleGly: 4.36 ± 0.703
0.638IleHis: 0.638 ± 0.238
3.616IleIle: 3.616 ± 0.755
4.998IleLys: 4.998 ± 0.549
4.786IleLeu: 4.786 ± 0.57
2.021IleMet: 2.021 ± 0.558
3.722IleAsn: 3.722 ± 0.77
2.765IlePro: 2.765 ± 0.697
2.552IleGln: 2.552 ± 0.523
2.871IleArg: 2.871 ± 0.544
3.829IleSer: 3.829 ± 0.809
3.403IleThr: 3.403 ± 0.66
3.935IleVal: 3.935 ± 0.629
0.532IleTrp: 0.532 ± 0.213
3.616IleTyr: 3.616 ± 0.778
0.0IleXaa: 0.0 ± 0.0
Lys
5.636LysAla: 5.636 ± 0.674
0.213LysCys: 0.213 ± 0.166
4.36LysAsp: 4.36 ± 0.674
5.424LysGlu: 5.424 ± 0.97
2.127LysPhe: 2.127 ± 0.569
4.998LysGly: 4.998 ± 0.594
0.851LysHis: 0.851 ± 0.318
5.105LysIle: 5.105 ± 0.746
6.168LysLys: 6.168 ± 1.093
5.424LysLeu: 5.424 ± 0.676
2.659LysMet: 2.659 ± 0.457
4.148LysAsn: 4.148 ± 0.869
2.021LysPro: 2.021 ± 0.454
3.51LysGln: 3.51 ± 0.606
5.105LysArg: 5.105 ± 0.694
4.041LysSer: 4.041 ± 0.622
4.892LysThr: 4.892 ± 0.739
5.636LysVal: 5.636 ± 0.725
1.063LysTrp: 1.063 ± 0.301
2.765LysTyr: 2.765 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
8.721LeuAla: 8.721 ± 1.085
0.106LeuCys: 0.106 ± 0.076
5.211LeuAsp: 5.211 ± 0.826
6.487LeuGlu: 6.487 ± 1.097
3.19LeuPhe: 3.19 ± 0.445
5.636LeuGly: 5.636 ± 1.106
1.489LeuHis: 1.489 ± 0.366
3.51LeuIle: 3.51 ± 0.841
7.444LeuLys: 7.444 ± 0.961
6.7LeuLeu: 6.7 ± 0.853
2.127LeuMet: 2.127 ± 0.417
5.849LeuAsn: 5.849 ± 0.871
2.233LeuPro: 2.233 ± 0.535
3.403LeuGln: 3.403 ± 0.546
4.041LeuArg: 4.041 ± 0.895
4.998LeuSer: 4.998 ± 0.925
6.275LeuThr: 6.275 ± 0.78
4.573LeuVal: 4.573 ± 0.61
0.532LeuTrp: 0.532 ± 0.217
2.978LeuTyr: 2.978 ± 0.66
0.0LeuXaa: 0.0 ± 0.0
Met
2.021MetAla: 2.021 ± 0.518
0.213MetCys: 0.213 ± 0.157
2.233MetAsp: 2.233 ± 0.535
1.914MetGlu: 1.914 ± 0.612
0.957MetPhe: 0.957 ± 0.267
1.808MetGly: 1.808 ± 0.674
0.319MetHis: 0.319 ± 0.181
2.021MetIle: 2.021 ± 0.472
1.276MetLys: 1.276 ± 0.377
2.659MetLeu: 2.659 ± 0.493
0.425MetMet: 0.425 ± 0.267
0.425MetAsn: 0.425 ± 0.233
0.851MetPro: 0.851 ± 0.302
0.851MetGln: 0.851 ± 0.354
1.808MetArg: 1.808 ± 0.475
1.914MetSer: 1.914 ± 0.377
2.021MetThr: 2.021 ± 0.419
1.383MetVal: 1.383 ± 0.392
0.532MetTrp: 0.532 ± 0.228
1.276MetTyr: 1.276 ± 0.308
0.0MetXaa: 0.0 ± 0.0
Asn
3.084AsnAla: 3.084 ± 0.655
0.425AsnCys: 0.425 ± 0.222
2.552AsnAsp: 2.552 ± 0.626
2.871AsnGlu: 2.871 ± 0.454
2.021AsnPhe: 2.021 ± 0.493
6.275AsnGly: 6.275 ± 1.091
0.638AsnHis: 0.638 ± 0.26
4.573AsnIle: 4.573 ± 0.734
3.935AsnLys: 3.935 ± 0.786
4.679AsnLeu: 4.679 ± 0.976
0.638AsnMet: 0.638 ± 0.254
1.808AsnAsn: 1.808 ± 0.433
2.233AsnPro: 2.233 ± 0.652
2.552AsnGln: 2.552 ± 0.566
3.297AsnArg: 3.297 ± 0.775
3.935AsnSer: 3.935 ± 0.657
2.446AsnThr: 2.446 ± 0.555
2.127AsnVal: 2.127 ± 0.497
0.851AsnTrp: 0.851 ± 0.349
2.021AsnTyr: 2.021 ± 0.493
0.0AsnXaa: 0.0 ± 0.0
Pro
2.021ProAla: 2.021 ± 0.495
0.0ProCys: 0.0 ± 0.0
2.34ProAsp: 2.34 ± 0.668
2.127ProGlu: 2.127 ± 0.42
1.595ProPhe: 1.595 ± 0.458
1.17ProGly: 1.17 ± 0.286
0.532ProHis: 0.532 ± 0.28
2.34ProIle: 2.34 ± 0.569
3.19ProLys: 3.19 ± 0.758
2.233ProLeu: 2.233 ± 0.484
0.638ProMet: 0.638 ± 0.199
1.489ProAsn: 1.489 ± 0.413
0.425ProPro: 0.425 ± 0.218
1.489ProGln: 1.489 ± 0.359
0.744ProArg: 0.744 ± 0.326
2.021ProSer: 2.021 ± 0.516
1.595ProThr: 1.595 ± 0.462
2.446ProVal: 2.446 ± 0.499
0.0ProTrp: 0.0 ± 0.0
0.425ProTyr: 0.425 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
4.148GlnAla: 4.148 ± 0.795
0.106GlnCys: 0.106 ± 0.094
1.914GlnAsp: 1.914 ± 0.37
3.829GlnGlu: 3.829 ± 0.56
1.595GlnPhe: 1.595 ± 0.44
1.276GlnGly: 1.276 ± 0.371
0.319GlnHis: 0.319 ± 0.177
2.871GlnIle: 2.871 ± 0.696
3.403GlnLys: 3.403 ± 0.764
3.616GlnLeu: 3.616 ± 0.914
1.17GlnMet: 1.17 ± 0.465
1.808GlnAsn: 1.808 ± 0.411
0.744GlnPro: 0.744 ± 0.298
2.552GlnGln: 2.552 ± 0.566
1.914GlnArg: 1.914 ± 0.375
4.041GlnSer: 4.041 ± 0.754
3.19GlnThr: 3.19 ± 0.899
3.403GlnVal: 3.403 ± 0.676
0.425GlnTrp: 0.425 ± 0.156
1.489GlnTyr: 1.489 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
2.233ArgAla: 2.233 ± 0.529
0.319ArgCys: 0.319 ± 0.232
2.34ArgAsp: 2.34 ± 0.58
2.127ArgGlu: 2.127 ± 0.389
1.702ArgPhe: 1.702 ± 0.378
1.808ArgGly: 1.808 ± 0.435
1.276ArgHis: 1.276 ± 0.373
3.935ArgIle: 3.935 ± 0.691
4.573ArgLys: 4.573 ± 0.737
3.829ArgLeu: 3.829 ± 0.673
1.17ArgMet: 1.17 ± 0.381
3.616ArgAsn: 3.616 ± 0.661
1.914ArgPro: 1.914 ± 0.577
1.914ArgGln: 1.914 ± 0.422
2.233ArgArg: 2.233 ± 0.47
3.19ArgSer: 3.19 ± 0.506
3.616ArgThr: 3.616 ± 0.506
3.616ArgVal: 3.616 ± 0.635
0.532ArgTrp: 0.532 ± 0.251
2.552ArgTyr: 2.552 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
5.424SerAla: 5.424 ± 1.409
0.213SerCys: 0.213 ± 0.147
2.871SerAsp: 2.871 ± 0.605
4.467SerGlu: 4.467 ± 0.786
2.446SerPhe: 2.446 ± 0.559
5.636SerGly: 5.636 ± 1.085
0.425SerHis: 0.425 ± 0.27
3.935SerIle: 3.935 ± 0.673
2.871SerLys: 2.871 ± 0.544
6.275SerLeu: 6.275 ± 1.125
1.914SerMet: 1.914 ± 0.433
3.19SerAsn: 3.19 ± 0.734
1.489SerPro: 1.489 ± 0.352
2.021SerGln: 2.021 ± 0.557
2.871SerArg: 2.871 ± 0.539
4.36SerSer: 4.36 ± 0.739
3.829SerThr: 3.829 ± 0.785
3.829SerVal: 3.829 ± 0.636
0.638SerTrp: 0.638 ± 0.192
2.446SerTyr: 2.446 ± 0.432
0.0SerXaa: 0.0 ± 0.0
Thr
5.743ThrAla: 5.743 ± 0.921
0.425ThrCys: 0.425 ± 0.236
3.403ThrAsp: 3.403 ± 0.645
3.935ThrGlu: 3.935 ± 0.763
2.233ThrPhe: 2.233 ± 0.506
5.317ThrGly: 5.317 ± 0.727
0.957ThrHis: 0.957 ± 0.31
4.786ThrIle: 4.786 ± 0.984
5.211ThrLys: 5.211 ± 0.75
4.786ThrLeu: 4.786 ± 0.651
1.063ThrMet: 1.063 ± 0.314
2.34ThrAsn: 2.34 ± 0.43
2.021ThrPro: 2.021 ± 0.505
2.871ThrGln: 2.871 ± 0.602
2.233ThrArg: 2.233 ± 0.487
4.148ThrSer: 4.148 ± 0.725
3.935ThrThr: 3.935 ± 0.613
4.998ThrVal: 4.998 ± 0.687
0.319ThrTrp: 0.319 ± 0.174
2.127ThrTyr: 2.127 ± 0.45
0.0ThrXaa: 0.0 ± 0.0
Val
4.786ValAla: 4.786 ± 0.656
0.106ValCys: 0.106 ± 0.102
4.786ValAsp: 4.786 ± 0.579
5.105ValGlu: 5.105 ± 1.083
1.808ValPhe: 1.808 ± 0.423
3.19ValGly: 3.19 ± 0.704
0.425ValHis: 0.425 ± 0.228
3.935ValIle: 3.935 ± 0.723
4.573ValLys: 4.573 ± 0.781
5.424ValLeu: 5.424 ± 0.842
1.489ValMet: 1.489 ± 0.381
3.19ValAsn: 3.19 ± 0.594
2.871ValPro: 2.871 ± 0.688
2.765ValGln: 2.765 ± 0.568
3.51ValArg: 3.51 ± 0.546
4.786ValSer: 4.786 ± 0.847
4.36ValThr: 4.36 ± 0.78
3.722ValVal: 3.722 ± 0.74
0.532ValTrp: 0.532 ± 0.258
2.659ValTyr: 2.659 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
1.17TrpAla: 1.17 ± 0.429
0.213TrpCys: 0.213 ± 0.148
0.213TrpAsp: 0.213 ± 0.188
0.957TrpGlu: 0.957 ± 0.336
0.532TrpPhe: 0.532 ± 0.237
1.489TrpGly: 1.489 ± 0.458
0.425TrpHis: 0.425 ± 0.2
1.17TrpIle: 1.17 ± 0.388
0.425TrpLys: 0.425 ± 0.217
0.638TrpLeu: 0.638 ± 0.264
0.319TrpMet: 0.319 ± 0.155
0.851TrpAsn: 0.851 ± 0.262
0.106TrpPro: 0.106 ± 0.117
0.532TrpGln: 0.532 ± 0.235
1.063TrpArg: 1.063 ± 0.319
0.638TrpSer: 0.638 ± 0.174
0.532TrpThr: 0.532 ± 0.203
0.425TrpVal: 0.425 ± 0.217
0.0TrpTrp: 0.0 ± 0.0
0.532TrpTyr: 0.532 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.616TyrAla: 3.616 ± 0.557
0.425TyrCys: 0.425 ± 0.256
2.552TyrAsp: 2.552 ± 0.602
2.552TyrGlu: 2.552 ± 0.701
1.063TyrPhe: 1.063 ± 0.333
2.34TyrGly: 2.34 ± 0.486
0.744TyrHis: 0.744 ± 0.244
1.595TyrIle: 1.595 ± 0.423
2.446TyrLys: 2.446 ± 0.477
3.935TyrLeu: 3.935 ± 0.638
1.276TyrMet: 1.276 ± 0.347
1.914TyrAsn: 1.914 ± 0.536
1.17TyrPro: 1.17 ± 0.328
1.808TyrGln: 1.808 ± 0.401
2.34TyrArg: 2.34 ± 0.544
2.233TyrSer: 2.233 ± 0.465
1.914TyrThr: 1.914 ± 0.475
2.021TyrVal: 2.021 ± 0.363
1.063TyrTrp: 1.063 ± 0.306
1.489TyrTyr: 1.489 ± 0.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (9404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski