Amino acid dipepetide frequency for Streptococcus phage Javan386

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.317AlaAla: 4.317 ± 1.207
0.345AlaCys: 0.345 ± 0.16
3.54AlaAsp: 3.54 ± 0.507
5.958AlaGlu: 5.958 ± 0.755
3.195AlaPhe: 3.195 ± 0.847
4.749AlaGly: 4.749 ± 1.336
0.95AlaHis: 0.95 ± 0.354
5.785AlaIle: 5.785 ± 1.071
6.303AlaLys: 6.303 ± 0.714
6.39AlaLeu: 6.39 ± 1.134
2.849AlaMet: 2.849 ± 0.557
3.799AlaAsn: 3.799 ± 0.547
1.468AlaPro: 1.468 ± 0.39
2.936AlaGln: 2.936 ± 0.627
2.936AlaArg: 2.936 ± 0.585
4.49AlaSer: 4.49 ± 0.968
4.231AlaThr: 4.231 ± 0.923
4.749AlaVal: 4.749 ± 1.086
0.518AlaTrp: 0.518 ± 0.203
1.813AlaTyr: 1.813 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.173CysAla: 0.173 ± 0.108
0.086CysCys: 0.086 ± 0.093
0.345CysAsp: 0.345 ± 0.138
0.345CysGlu: 0.345 ± 0.17
0.173CysPhe: 0.173 ± 0.132
0.604CysGly: 0.604 ± 0.261
0.0CysHis: 0.0 ± 0.0
0.173CysIle: 0.173 ± 0.144
0.432CysLys: 0.432 ± 0.169
0.345CysLeu: 0.345 ± 0.184
0.173CysMet: 0.173 ± 0.112
0.259CysAsn: 0.259 ± 0.132
0.259CysPro: 0.259 ± 0.211
0.086CysGln: 0.086 ± 0.087
0.0CysArg: 0.0 ± 0.0
0.173CysSer: 0.173 ± 0.101
0.173CysThr: 0.173 ± 0.117
0.259CysVal: 0.259 ± 0.144
0.173CysTrp: 0.173 ± 0.118
0.259CysTyr: 0.259 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
4.231AspAla: 4.231 ± 0.646
0.259AspCys: 0.259 ± 0.158
4.922AspAsp: 4.922 ± 0.948
4.145AspGlu: 4.145 ± 0.68
3.368AspPhe: 3.368 ± 0.508
6.735AspGly: 6.735 ± 0.814
0.691AspHis: 0.691 ± 0.254
4.404AspIle: 4.404 ± 0.717
5.354AspLys: 5.354 ± 0.729
4.922AspLeu: 4.922 ± 0.707
1.986AspMet: 1.986 ± 0.402
5.181AspAsn: 5.181 ± 0.704
1.123AspPro: 1.123 ± 0.277
1.813AspGln: 1.813 ± 0.355
2.159AspArg: 2.159 ± 0.528
3.109AspSer: 3.109 ± 0.524
3.713AspThr: 3.713 ± 0.655
3.713AspVal: 3.713 ± 0.429
0.691AspTrp: 0.691 ± 0.219
2.936AspTyr: 2.936 ± 0.562
0.0AspXaa: 0.0 ± 0.0
Glu
2.763GluAla: 2.763 ± 0.427
0.259GluCys: 0.259 ± 0.146
3.54GluAsp: 3.54 ± 0.795
4.836GluGlu: 4.836 ± 0.953
3.54GluPhe: 3.54 ± 0.664
1.727GluGly: 1.727 ± 0.305
0.95GluHis: 0.95 ± 0.284
5.958GluIle: 5.958 ± 0.974
5.613GluLys: 5.613 ± 0.756
6.303GluLeu: 6.303 ± 1.089
2.504GluMet: 2.504 ± 0.513
4.404GluAsn: 4.404 ± 0.571
1.986GluPro: 1.986 ± 0.447
3.886GluGln: 3.886 ± 0.679
3.109GluArg: 3.109 ± 0.72
2.331GluSer: 2.331 ± 0.393
3.54GluThr: 3.54 ± 0.581
4.231GluVal: 4.231 ± 0.639
0.691GluTrp: 0.691 ± 0.253
1.986GluTyr: 1.986 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.331PheAla: 2.331 ± 0.508
0.173PheCys: 0.173 ± 0.118
4.663PheAsp: 4.663 ± 0.643
4.404PheGlu: 4.404 ± 0.845
1.123PhePhe: 1.123 ± 0.326
2.849PheGly: 2.849 ± 0.555
0.604PheHis: 0.604 ± 0.217
2.245PheIle: 2.245 ± 0.361
3.799PheLys: 3.799 ± 0.533
3.368PheLeu: 3.368 ± 0.438
1.382PheMet: 1.382 ± 0.312
3.972PheAsn: 3.972 ± 0.444
0.345PhePro: 0.345 ± 0.162
1.727PheGln: 1.727 ± 0.316
1.554PheArg: 1.554 ± 0.368
3.368PheSer: 3.368 ± 0.597
2.331PheThr: 2.331 ± 0.392
2.245PheVal: 2.245 ± 0.37
0.345PheTrp: 0.345 ± 0.16
1.468PheTyr: 1.468 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
4.836GlyAla: 4.836 ± 1.169
0.086GlyCys: 0.086 ± 0.084
3.799GlyAsp: 3.799 ± 0.618
2.849GlyGlu: 2.849 ± 0.522
3.368GlyPhe: 3.368 ± 0.652
4.231GlyGly: 4.231 ± 0.889
0.95GlyHis: 0.95 ± 0.224
7.081GlyIle: 7.081 ± 1.342
4.663GlyLys: 4.663 ± 0.652
5.181GlyLeu: 5.181 ± 0.92
2.072GlyMet: 2.072 ± 0.494
3.281GlyAsn: 3.281 ± 0.549
0.086GlyPro: 0.086 ± 0.081
3.627GlyGln: 3.627 ± 0.495
1.986GlyArg: 1.986 ± 0.368
3.972GlySer: 3.972 ± 0.795
4.922GlyThr: 4.922 ± 0.814
5.095GlyVal: 5.095 ± 0.684
0.863GlyTrp: 0.863 ± 0.283
2.418GlyTyr: 2.418 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.95HisAla: 0.95 ± 0.222
0.345HisCys: 0.345 ± 0.181
0.777HisAsp: 0.777 ± 0.286
0.863HisGlu: 0.863 ± 0.259
0.691HisPhe: 0.691 ± 0.291
0.345HisGly: 0.345 ± 0.17
0.173HisHis: 0.173 ± 0.104
0.777HisIle: 0.777 ± 0.263
0.604HisLys: 0.604 ± 0.255
0.604HisLeu: 0.604 ± 0.255
0.432HisMet: 0.432 ± 0.201
0.95HisAsn: 0.95 ± 0.37
0.95HisPro: 0.95 ± 0.287
0.518HisGln: 0.518 ± 0.179
0.777HisArg: 0.777 ± 0.243
0.604HisSer: 0.604 ± 0.239
0.95HisThr: 0.95 ± 0.254
0.863HisVal: 0.863 ± 0.193
0.259HisTrp: 0.259 ± 0.138
0.777HisTyr: 0.777 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
6.649IleAla: 6.649 ± 0.895
0.604IleCys: 0.604 ± 0.216
6.303IleAsp: 6.303 ± 0.732
5.872IleGlu: 5.872 ± 0.732
2.59IlePhe: 2.59 ± 0.388
5.181IleGly: 5.181 ± 1.146
1.123IleHis: 1.123 ± 0.3
4.663IleIle: 4.663 ± 0.587
7.253IleLys: 7.253 ± 1.085
4.317IleLeu: 4.317 ± 0.482
0.604IleMet: 0.604 ± 0.217
4.922IleAsn: 4.922 ± 0.711
2.849IlePro: 2.849 ± 0.502
1.468IleGln: 1.468 ± 0.322
2.418IleArg: 2.418 ± 0.472
7.426IleSer: 7.426 ± 0.912
5.526IleThr: 5.526 ± 0.723
4.576IleVal: 4.576 ± 0.598
0.604IleTrp: 0.604 ± 0.244
2.59IleTyr: 2.59 ± 0.497
0.0IleXaa: 0.0 ± 0.0
Lys
6.908LysAla: 6.908 ± 0.901
0.259LysCys: 0.259 ± 0.192
3.886LysAsp: 3.886 ± 0.678
6.131LysGlu: 6.131 ± 1.039
2.936LysPhe: 2.936 ± 0.407
4.836LysGly: 4.836 ± 0.587
1.123LysHis: 1.123 ± 0.355
6.303LysIle: 6.303 ± 0.847
7.512LysLys: 7.512 ± 1.483
6.131LysLeu: 6.131 ± 0.781
1.382LysMet: 1.382 ± 0.366
6.303LysAsn: 6.303 ± 0.868
2.936LysPro: 2.936 ± 0.586
3.368LysGln: 3.368 ± 0.461
2.849LysArg: 2.849 ± 0.628
5.785LysSer: 5.785 ± 0.715
4.058LysThr: 4.058 ± 0.62
5.958LysVal: 5.958 ± 0.635
0.95LysTrp: 0.95 ± 0.304
3.454LysTyr: 3.454 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
5.613LeuAla: 5.613 ± 0.785
0.345LeuCys: 0.345 ± 0.174
6.217LeuAsp: 6.217 ± 0.728
4.058LeuGlu: 4.058 ± 0.744
2.936LeuPhe: 2.936 ± 0.459
4.231LeuGly: 4.231 ± 0.709
0.95LeuHis: 0.95 ± 0.413
5.008LeuIle: 5.008 ± 0.638
6.39LeuLys: 6.39 ± 1.003
4.404LeuLeu: 4.404 ± 0.636
1.727LeuMet: 1.727 ± 0.397
5.354LeuAsn: 5.354 ± 0.628
2.59LeuPro: 2.59 ± 0.471
2.763LeuGln: 2.763 ± 0.572
2.418LeuArg: 2.418 ± 0.517
6.044LeuSer: 6.044 ± 0.749
5.181LeuThr: 5.181 ± 0.617
3.627LeuVal: 3.627 ± 0.6
0.777LeuTrp: 0.777 ± 0.42
2.763LeuTyr: 2.763 ± 0.623
0.0LeuXaa: 0.0 ± 0.0
Met
2.159MetAla: 2.159 ± 0.645
0.259MetCys: 0.259 ± 0.131
1.295MetAsp: 1.295 ± 0.362
1.468MetGlu: 1.468 ± 0.417
0.777MetPhe: 0.777 ± 0.2
1.468MetGly: 1.468 ± 0.323
0.691MetHis: 0.691 ± 0.254
1.986MetIle: 1.986 ± 0.439
1.641MetLys: 1.641 ± 0.39
1.727MetLeu: 1.727 ± 0.365
1.554MetMet: 1.554 ± 0.401
0.95MetAsn: 0.95 ± 0.254
1.036MetPro: 1.036 ± 0.293
1.813MetGln: 1.813 ± 0.317
0.95MetArg: 0.95 ± 0.252
2.331MetSer: 2.331 ± 0.362
2.159MetThr: 2.159 ± 0.393
1.382MetVal: 1.382 ± 0.286
0.518MetTrp: 0.518 ± 0.224
0.345MetTyr: 0.345 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
5.008AsnAla: 5.008 ± 0.851
0.432AsnCys: 0.432 ± 0.165
4.576AsnAsp: 4.576 ± 0.688
4.317AsnGlu: 4.317 ± 0.742
3.281AsnPhe: 3.281 ± 0.475
4.663AsnGly: 4.663 ± 0.771
1.382AsnHis: 1.382 ± 0.399
5.526AsnIle: 5.526 ± 0.59
4.663AsnLys: 4.663 ± 0.676
3.886AsnLeu: 3.886 ± 0.609
1.641AsnMet: 1.641 ± 0.411
4.49AsnAsn: 4.49 ± 0.99
2.849AsnPro: 2.849 ± 0.53
2.849AsnGln: 2.849 ± 0.825
1.295AsnArg: 1.295 ± 0.362
4.49AsnSer: 4.49 ± 0.671
3.022AsnThr: 3.022 ± 0.454
3.713AsnVal: 3.713 ± 0.469
0.863AsnTrp: 0.863 ± 0.299
2.072AsnTyr: 2.072 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
2.59ProAla: 2.59 ± 0.576
0.086ProCys: 0.086 ± 0.087
1.813ProAsp: 1.813 ± 0.381
0.863ProGlu: 0.863 ± 0.365
1.813ProPhe: 1.813 ± 0.458
1.123ProGly: 1.123 ± 0.414
0.086ProHis: 0.086 ± 0.07
1.727ProIle: 1.727 ± 0.445
1.813ProLys: 1.813 ± 0.37
1.9ProLeu: 1.9 ± 0.389
0.432ProMet: 0.432 ± 0.198
1.641ProAsn: 1.641 ± 0.401
0.777ProPro: 0.777 ± 0.168
1.554ProGln: 1.554 ± 0.446
0.777ProArg: 0.777 ± 0.223
1.986ProSer: 1.986 ± 0.365
2.418ProThr: 2.418 ± 0.739
2.245ProVal: 2.245 ± 0.34
0.173ProTrp: 0.173 ± 0.126
1.295ProTyr: 1.295 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
2.763GlnAla: 2.763 ± 0.444
0.086GlnCys: 0.086 ± 0.09
1.468GlnAsp: 1.468 ± 0.325
2.418GlnGlu: 2.418 ± 0.534
1.9GlnPhe: 1.9 ± 0.335
3.109GlnGly: 3.109 ± 0.802
0.259GlnHis: 0.259 ± 0.135
3.713GlnIle: 3.713 ± 0.576
3.713GlnLys: 3.713 ± 0.433
3.454GlnLeu: 3.454 ± 0.398
1.382GlnMet: 1.382 ± 0.32
2.072GlnAsn: 2.072 ± 0.561
1.209GlnPro: 1.209 ± 0.314
2.936GlnGln: 2.936 ± 0.826
1.554GlnArg: 1.554 ± 0.336
4.058GlnSer: 4.058 ± 0.512
1.986GlnThr: 1.986 ± 0.445
2.763GlnVal: 2.763 ± 0.365
0.345GlnTrp: 0.345 ± 0.166
1.295GlnTyr: 1.295 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
1.813ArgAla: 1.813 ± 0.389
0.0ArgCys: 0.0 ± 0.0
2.072ArgAsp: 2.072 ± 0.406
1.9ArgGlu: 1.9 ± 0.465
1.382ArgPhe: 1.382 ± 0.415
1.9ArgGly: 1.9 ± 0.438
0.259ArgHis: 0.259 ± 0.14
2.159ArgIle: 2.159 ± 0.388
3.281ArgLys: 3.281 ± 0.531
3.368ArgLeu: 3.368 ± 0.632
1.209ArgMet: 1.209 ± 0.301
2.418ArgAsn: 2.418 ± 0.416
0.518ArgPro: 0.518 ± 0.267
1.295ArgGln: 1.295 ± 0.273
1.554ArgArg: 1.554 ± 0.421
1.727ArgSer: 1.727 ± 0.403
2.072ArgThr: 2.072 ± 0.369
2.331ArgVal: 2.331 ± 0.362
0.518ArgTrp: 0.518 ± 0.212
1.813ArgTyr: 1.813 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
6.39SerAla: 6.39 ± 1.528
0.173SerCys: 0.173 ± 0.122
4.404SerAsp: 4.404 ± 0.664
4.231SerGlu: 4.231 ± 0.704
2.418SerPhe: 2.418 ± 0.566
4.404SerGly: 4.404 ± 0.819
0.604SerHis: 0.604 ± 0.27
5.267SerIle: 5.267 ± 0.578
5.699SerLys: 5.699 ± 0.701
5.181SerLeu: 5.181 ± 0.62
1.382SerMet: 1.382 ± 0.371
4.836SerAsn: 4.836 ± 0.684
1.641SerPro: 1.641 ± 0.299
2.331SerGln: 2.331 ± 0.439
1.986SerArg: 1.986 ± 0.344
4.836SerSer: 4.836 ± 0.825
5.613SerThr: 5.613 ± 0.895
3.886SerVal: 3.886 ± 0.549
1.036SerTrp: 1.036 ± 0.275
3.368SerTyr: 3.368 ± 0.676
0.0SerXaa: 0.0 ± 0.0
Thr
4.317ThrAla: 4.317 ± 0.948
0.259ThrCys: 0.259 ± 0.136
3.54ThrAsp: 3.54 ± 0.484
3.454ThrGlu: 3.454 ± 0.607
3.022ThrPhe: 3.022 ± 0.521
4.576ThrGly: 4.576 ± 0.836
0.777ThrHis: 0.777 ± 0.282
6.735ThrIle: 6.735 ± 0.806
5.267ThrLys: 5.267 ± 0.619
3.799ThrLeu: 3.799 ± 0.504
1.295ThrMet: 1.295 ± 0.351
4.317ThrAsn: 4.317 ± 0.923
1.468ThrPro: 1.468 ± 0.359
2.504ThrGln: 2.504 ± 0.423
1.295ThrArg: 1.295 ± 0.268
4.317ThrSer: 4.317 ± 0.755
3.281ThrThr: 3.281 ± 0.515
5.095ThrVal: 5.095 ± 0.763
0.777ThrTrp: 0.777 ± 0.254
2.245ThrTyr: 2.245 ± 0.376
0.0ThrXaa: 0.0 ± 0.0
Val
5.008ValAla: 5.008 ± 0.983
0.173ValCys: 0.173 ± 0.099
4.749ValAsp: 4.749 ± 0.541
3.022ValGlu: 3.022 ± 0.455
3.022ValPhe: 3.022 ± 0.417
5.267ValGly: 5.267 ± 1.06
0.95ValHis: 0.95 ± 0.27
5.181ValIle: 5.181 ± 0.851
4.836ValLys: 4.836 ± 0.708
3.627ValLeu: 3.627 ± 0.54
1.468ValMet: 1.468 ± 0.324
3.713ValAsn: 3.713 ± 0.443
2.245ValPro: 2.245 ± 0.401
2.245ValGln: 2.245 ± 0.457
1.986ValArg: 1.986 ± 0.39
4.576ValSer: 4.576 ± 0.621
3.972ValThr: 3.972 ± 0.578
2.677ValVal: 2.677 ± 0.395
0.777ValTrp: 0.777 ± 0.273
2.504ValTyr: 2.504 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.432TrpAla: 0.432 ± 0.196
0.0TrpCys: 0.0 ± 0.0
0.432TrpAsp: 0.432 ± 0.186
0.604TrpGlu: 0.604 ± 0.188
0.432TrpPhe: 0.432 ± 0.2
0.604TrpGly: 0.604 ± 0.211
0.345TrpHis: 0.345 ± 0.152
0.691TrpIle: 0.691 ± 0.236
1.468TrpLys: 1.468 ± 0.328
1.209TrpLeu: 1.209 ± 0.263
0.432TrpMet: 0.432 ± 0.179
0.345TrpAsn: 0.345 ± 0.166
0.086TrpPro: 0.086 ± 0.092
0.604TrpGln: 0.604 ± 0.24
0.432TrpArg: 0.432 ± 0.184
1.036TrpSer: 1.036 ± 0.257
0.777TrpThr: 0.777 ± 0.295
0.863TrpVal: 0.863 ± 0.253
0.259TrpTrp: 0.259 ± 0.151
0.432TrpTyr: 0.432 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.072TyrAla: 2.072 ± 0.385
0.259TyrCys: 0.259 ± 0.212
3.109TyrAsp: 3.109 ± 0.562
2.418TyrGlu: 2.418 ± 0.527
2.331TyrPhe: 2.331 ± 0.564
2.763TyrGly: 2.763 ± 0.656
0.432TyrHis: 0.432 ± 0.266
2.159TyrIle: 2.159 ± 0.472
2.763TyrLys: 2.763 ± 0.456
3.281TyrLeu: 3.281 ± 0.625
0.518TyrMet: 0.518 ± 0.195
1.813TyrAsn: 1.813 ± 0.443
0.691TyrPro: 0.691 ± 0.219
2.159TyrGln: 2.159 ± 0.533
1.554TyrArg: 1.554 ± 0.463
3.022TyrSer: 3.022 ± 0.644
2.418TyrThr: 2.418 ± 0.338
1.727TyrVal: 1.727 ± 0.415
0.345TyrTrp: 0.345 ± 0.17
1.641TyrTyr: 1.641 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski