Amino acid dipepetide frequency for Gordonia phage KatherineG

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.093AlaAla: 10.093 ± 1.036
0.748AlaCys: 0.748 ± 0.206
5.483AlaAsp: 5.483 ± 0.682
7.227AlaGlu: 7.227 ± 0.721
3.925AlaPhe: 3.925 ± 0.547
7.538AlaGly: 7.538 ± 0.869
1.433AlaHis: 1.433 ± 0.331
4.112AlaIle: 4.112 ± 0.386
4.486AlaLys: 4.486 ± 0.588
6.355AlaLeu: 6.355 ± 0.773
2.43AlaMet: 2.43 ± 0.31
1.994AlaAsn: 1.994 ± 0.4
4.984AlaPro: 4.984 ± 0.834
3.613AlaGln: 3.613 ± 0.433
5.233AlaArg: 5.233 ± 0.608
5.171AlaSer: 5.171 ± 0.568
5.856AlaThr: 5.856 ± 0.704
6.978AlaVal: 6.978 ± 0.604
1.869AlaTrp: 1.869 ± 0.304
2.118AlaTyr: 2.118 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
0.748CysAla: 0.748 ± 0.214
0.187CysCys: 0.187 ± 0.139
0.685CysAsp: 0.685 ± 0.224
0.249CysGlu: 0.249 ± 0.126
0.312CysPhe: 0.312 ± 0.158
0.81CysGly: 0.81 ± 0.274
0.374CysHis: 0.374 ± 0.163
0.249CysIle: 0.249 ± 0.128
0.498CysLys: 0.498 ± 0.163
0.872CysLeu: 0.872 ± 0.207
0.312CysMet: 0.312 ± 0.155
0.498CysAsn: 0.498 ± 0.145
0.561CysPro: 0.561 ± 0.218
0.249CysGln: 0.249 ± 0.16
0.436CysArg: 0.436 ± 0.161
0.81CysSer: 0.81 ± 0.239
0.249CysThr: 0.249 ± 0.128
0.498CysVal: 0.498 ± 0.167
0.249CysTrp: 0.249 ± 0.116
0.312CysTyr: 0.312 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
5.046AspAla: 5.046 ± 0.642
0.623AspCys: 0.623 ± 0.265
5.296AspAsp: 5.296 ± 0.78
4.361AspGlu: 4.361 ± 0.554
2.43AspPhe: 2.43 ± 0.346
5.856AspGly: 5.856 ± 0.629
1.682AspHis: 1.682 ± 0.374
3.053AspIle: 3.053 ± 0.511
2.118AspLys: 2.118 ± 0.428
6.106AspLeu: 6.106 ± 0.715
1.059AspMet: 1.059 ± 0.239
2.118AspAsn: 2.118 ± 0.357
5.046AspPro: 5.046 ± 0.594
2.118AspGln: 2.118 ± 0.345
3.115AspArg: 3.115 ± 0.538
3.177AspSer: 3.177 ± 0.428
4.05AspThr: 4.05 ± 0.522
3.987AspVal: 3.987 ± 0.463
1.308AspTrp: 1.308 ± 0.29
2.804AspTyr: 2.804 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
8.722GluAla: 8.722 ± 0.817
0.125GluCys: 0.125 ± 0.097
4.984GluAsp: 4.984 ± 0.572
3.925GluGlu: 3.925 ± 0.674
2.367GluPhe: 2.367 ± 0.395
5.42GluGly: 5.42 ± 0.569
1.744GluHis: 1.744 ± 0.36
4.112GluIle: 4.112 ± 0.529
3.053GluLys: 3.053 ± 0.461
6.729GluLeu: 6.729 ± 0.656
1.744GluMet: 1.744 ± 0.345
1.931GluAsn: 1.931 ± 0.314
2.243GluPro: 2.243 ± 0.434
2.181GluGln: 2.181 ± 0.418
4.236GluArg: 4.236 ± 0.497
2.99GluSer: 2.99 ± 0.38
4.174GluThr: 4.174 ± 0.57
5.545GluVal: 5.545 ± 0.635
1.558GluTrp: 1.558 ± 0.283
1.744GluTyr: 1.744 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
3.302PheAla: 3.302 ± 0.499
0.249PheCys: 0.249 ± 0.148
3.053PheAsp: 3.053 ± 0.388
3.053PheGlu: 3.053 ± 0.54
1.121PhePhe: 1.121 ± 0.263
3.302PheGly: 3.302 ± 0.439
0.997PheHis: 0.997 ± 0.25
1.246PheIle: 1.246 ± 0.253
1.433PheLys: 1.433 ± 0.277
1.931PheLeu: 1.931 ± 0.333
0.997PheMet: 0.997 ± 0.215
1.744PheAsn: 1.744 ± 0.298
1.807PhePro: 1.807 ± 0.327
0.81PheGln: 0.81 ± 0.229
1.433PheArg: 1.433 ± 0.252
2.617PheSer: 2.617 ± 0.452
2.056PheThr: 2.056 ± 0.313
2.367PheVal: 2.367 ± 0.37
0.374PheTrp: 0.374 ± 0.145
0.748PheTyr: 0.748 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
5.919GlyAla: 5.919 ± 0.62
1.059GlyCys: 1.059 ± 0.249
5.732GlyAsp: 5.732 ± 0.635
5.669GlyGlu: 5.669 ± 0.614
2.99GlyPhe: 2.99 ± 0.517
7.725GlyGly: 7.725 ± 1.398
2.243GlyHis: 2.243 ± 0.389
4.05GlyIle: 4.05 ± 0.587
3.987GlyLys: 3.987 ± 0.47
6.23GlyLeu: 6.23 ± 0.771
1.807GlyMet: 1.807 ± 0.366
2.741GlyAsn: 2.741 ± 0.327
3.302GlyPro: 3.302 ± 0.489
2.305GlyGln: 2.305 ± 0.379
4.673GlyArg: 4.673 ± 0.614
4.86GlySer: 4.86 ± 0.578
5.545GlyThr: 5.545 ± 0.672
4.984GlyVal: 4.984 ± 0.618
2.056GlyTrp: 2.056 ± 0.374
3.302GlyTyr: 3.302 ± 0.489
0.0GlyXaa: 0.0 ± 0.0
His
1.994HisAla: 1.994 ± 0.412
0.249HisCys: 0.249 ± 0.119
1.184HisAsp: 1.184 ± 0.248
1.495HisGlu: 1.495 ± 0.385
0.685HisPhe: 0.685 ± 0.215
1.682HisGly: 1.682 ± 0.374
0.374HisHis: 0.374 ± 0.174
1.246HisIle: 1.246 ± 0.28
0.685HisLys: 0.685 ± 0.234
1.994HisLeu: 1.994 ± 0.403
0.312HisMet: 0.312 ± 0.132
1.059HisAsn: 1.059 ± 0.274
0.997HisPro: 0.997 ± 0.239
0.561HisGln: 0.561 ± 0.191
2.118HisArg: 2.118 ± 0.418
0.935HisSer: 0.935 ± 0.3
1.121HisThr: 1.121 ± 0.241
1.184HisVal: 1.184 ± 0.26
0.312HisTrp: 0.312 ± 0.139
0.935HisTyr: 0.935 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
4.299IleAla: 4.299 ± 0.483
0.498IleCys: 0.498 ± 0.153
3.613IleAsp: 3.613 ± 0.463
4.423IleGlu: 4.423 ± 0.625
1.433IlePhe: 1.433 ± 0.229
3.8IleGly: 3.8 ± 0.626
1.184IleHis: 1.184 ± 0.289
2.43IleIle: 2.43 ± 0.38
1.62IleLys: 1.62 ± 0.315
3.8IleLeu: 3.8 ± 0.415
0.872IleMet: 0.872 ± 0.234
1.994IleAsn: 1.994 ± 0.425
3.302IlePro: 3.302 ± 0.384
1.495IleGln: 1.495 ± 0.296
2.99IleArg: 2.99 ± 0.343
3.115IleSer: 3.115 ± 0.386
3.613IleThr: 3.613 ± 0.43
3.551IleVal: 3.551 ± 0.452
0.748IleTrp: 0.748 ± 0.19
1.059IleTyr: 1.059 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
5.046LysAla: 5.046 ± 0.62
0.249LysCys: 0.249 ± 0.141
2.305LysAsp: 2.305 ± 0.322
3.302LysGlu: 3.302 ± 0.491
1.246LysPhe: 1.246 ± 0.246
3.302LysGly: 3.302 ± 0.418
0.623LysHis: 0.623 ± 0.19
2.243LysIle: 2.243 ± 0.381
3.302LysLys: 3.302 ± 0.614
4.86LysLeu: 4.86 ± 0.465
0.935LysMet: 0.935 ± 0.25
1.121LysAsn: 1.121 ± 0.252
3.053LysPro: 3.053 ± 0.545
1.931LysGln: 1.931 ± 0.317
2.741LysArg: 2.741 ± 0.497
2.367LysSer: 2.367 ± 0.415
2.305LysThr: 2.305 ± 0.408
3.738LysVal: 3.738 ± 0.523
0.872LysTrp: 0.872 ± 0.223
1.059LysTyr: 1.059 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
7.414LeuAla: 7.414 ± 0.766
0.561LeuCys: 0.561 ± 0.172
4.984LeuAsp: 4.984 ± 0.672
6.915LeuGlu: 6.915 ± 0.567
2.679LeuPhe: 2.679 ± 0.485
6.542LeuGly: 6.542 ± 0.677
1.62LeuHis: 1.62 ± 0.325
4.361LeuIle: 4.361 ± 0.534
3.053LeuLys: 3.053 ± 0.605
5.42LeuLeu: 5.42 ± 0.718
2.367LeuMet: 2.367 ± 0.478
2.617LeuAsn: 2.617 ± 0.308
4.361LeuPro: 4.361 ± 0.475
2.056LeuGln: 2.056 ± 0.3
5.109LeuArg: 5.109 ± 0.563
4.984LeuSer: 4.984 ± 0.577
4.922LeuThr: 4.922 ± 0.686
6.355LeuVal: 6.355 ± 0.641
1.682LeuTrp: 1.682 ± 0.281
2.492LeuTyr: 2.492 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
2.305MetAla: 2.305 ± 0.342
0.187MetCys: 0.187 ± 0.099
0.935MetAsp: 0.935 ± 0.208
1.246MetGlu: 1.246 ± 0.207
1.246MetPhe: 1.246 ± 0.314
1.371MetGly: 1.371 ± 0.293
0.374MetHis: 0.374 ± 0.152
1.371MetIle: 1.371 ± 0.318
1.059MetLys: 1.059 ± 0.24
1.807MetLeu: 1.807 ± 0.338
0.748MetMet: 0.748 ± 0.194
0.748MetAsn: 0.748 ± 0.27
1.682MetPro: 1.682 ± 0.342
0.685MetGln: 0.685 ± 0.22
1.869MetArg: 1.869 ± 0.327
2.367MetSer: 2.367 ± 0.401
2.928MetThr: 2.928 ± 0.361
1.184MetVal: 1.184 ± 0.288
0.125MetTrp: 0.125 ± 0.089
0.623MetTyr: 0.623 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
3.115AsnAla: 3.115 ± 0.56
0.561AsnCys: 0.561 ± 0.193
1.682AsnAsp: 1.682 ± 0.281
1.807AsnGlu: 1.807 ± 0.373
0.935AsnPhe: 0.935 ± 0.221
3.613AsnGly: 3.613 ± 0.521
0.623AsnHis: 0.623 ± 0.177
1.495AsnIle: 1.495 ± 0.417
1.682AsnLys: 1.682 ± 0.327
3.053AsnLeu: 3.053 ± 0.411
0.685AsnMet: 0.685 ± 0.168
0.81AsnAsn: 0.81 ± 0.202
2.679AsnPro: 2.679 ± 0.332
1.433AsnGln: 1.433 ± 0.257
1.807AsnArg: 1.807 ± 0.363
1.869AsnSer: 1.869 ± 0.364
2.741AsnThr: 2.741 ± 0.362
1.744AsnVal: 1.744 ± 0.291
0.561AsnTrp: 0.561 ± 0.176
1.433AsnTyr: 1.433 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
4.61ProAla: 4.61 ± 0.804
0.374ProCys: 0.374 ± 0.153
3.8ProAsp: 3.8 ± 0.447
4.361ProGlu: 4.361 ± 0.517
1.869ProPhe: 1.869 ± 0.34
4.922ProGly: 4.922 ± 0.558
0.935ProHis: 0.935 ± 0.22
2.804ProIle: 2.804 ± 0.425
2.99ProLys: 2.99 ± 0.442
2.617ProLeu: 2.617 ± 0.453
1.433ProMet: 1.433 ± 0.279
2.118ProAsn: 2.118 ± 0.407
2.243ProPro: 2.243 ± 0.453
1.495ProGln: 1.495 ± 0.326
3.115ProArg: 3.115 ± 0.422
2.928ProSer: 2.928 ± 0.314
4.423ProThr: 4.423 ± 0.452
3.738ProVal: 3.738 ± 0.441
0.81ProTrp: 0.81 ± 0.224
1.682ProTyr: 1.682 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
3.551GlnAla: 3.551 ± 0.456
0.312GlnCys: 0.312 ± 0.145
1.558GlnAsp: 1.558 ± 0.358
2.056GlnGlu: 2.056 ± 0.353
1.184GlnPhe: 1.184 ± 0.275
1.994GlnGly: 1.994 ± 0.369
0.623GlnHis: 0.623 ± 0.207
1.869GlnIle: 1.869 ± 0.272
1.246GlnLys: 1.246 ± 0.308
3.613GlnLeu: 3.613 ± 0.437
1.059GlnMet: 1.059 ± 0.251
1.121GlnAsn: 1.121 ± 0.267
1.246GlnPro: 1.246 ± 0.357
0.997GlnGln: 0.997 ± 0.24
2.181GlnArg: 2.181 ± 0.387
1.558GlnSer: 1.558 ± 0.254
1.994GlnThr: 1.994 ± 0.371
2.43GlnVal: 2.43 ± 0.436
0.748GlnTrp: 0.748 ± 0.184
1.246GlnTyr: 1.246 ± 0.2
0.0GlnXaa: 0.0 ± 0.0
Arg
4.797ArgAla: 4.797 ± 0.52
0.748ArgCys: 0.748 ± 0.274
3.613ArgAsp: 3.613 ± 0.483
3.863ArgGlu: 3.863 ± 0.623
2.243ArgPhe: 2.243 ± 0.372
3.489ArgGly: 3.489 ± 0.503
1.62ArgHis: 1.62 ± 0.355
3.987ArgIle: 3.987 ± 0.479
2.928ArgLys: 2.928 ± 0.461
5.046ArgLeu: 5.046 ± 0.567
1.869ArgMet: 1.869 ± 0.285
2.243ArgAsn: 2.243 ± 0.374
2.43ArgPro: 2.43 ± 0.388
1.682ArgGln: 1.682 ± 0.323
5.171ArgArg: 5.171 ± 0.609
3.613ArgSer: 3.613 ± 0.559
2.928ArgThr: 2.928 ± 0.423
4.174ArgVal: 4.174 ± 0.402
1.059ArgTrp: 1.059 ± 0.255
2.492ArgTyr: 2.492 ± 0.576
0.0ArgXaa: 0.0 ± 0.0
Ser
5.42SerAla: 5.42 ± 0.597
0.374SerCys: 0.374 ± 0.18
2.866SerAsp: 2.866 ± 0.454
3.489SerGlu: 3.489 ± 0.419
2.243SerPhe: 2.243 ± 0.37
5.607SerGly: 5.607 ± 0.714
0.935SerHis: 0.935 ± 0.234
2.492SerIle: 2.492 ± 0.334
2.679SerLys: 2.679 ± 0.413
5.669SerLeu: 5.669 ± 0.812
1.994SerMet: 1.994 ± 0.358
2.679SerAsn: 2.679 ± 0.502
2.181SerPro: 2.181 ± 0.297
2.305SerGln: 2.305 ± 0.331
3.427SerArg: 3.427 ± 0.386
3.551SerSer: 3.551 ± 0.542
3.925SerThr: 3.925 ± 0.626
4.174SerVal: 4.174 ± 0.561
1.433SerTrp: 1.433 ± 0.285
1.433SerTyr: 1.433 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
5.732ThrAla: 5.732 ± 0.621
0.872ThrCys: 0.872 ± 0.257
4.486ThrAsp: 4.486 ± 0.438
3.8ThrGlu: 3.8 ± 0.57
1.869ThrPhe: 1.869 ± 0.358
5.483ThrGly: 5.483 ± 0.592
1.184ThrHis: 1.184 ± 0.293
3.24ThrIle: 3.24 ± 0.451
4.112ThrLys: 4.112 ± 0.603
5.233ThrLeu: 5.233 ± 0.461
1.308ThrMet: 1.308 ± 0.302
1.744ThrAsn: 1.744 ± 0.309
4.548ThrPro: 4.548 ± 0.466
2.305ThrGln: 2.305 ± 0.41
2.679ThrArg: 2.679 ± 0.417
4.112ThrSer: 4.112 ± 0.548
3.925ThrThr: 3.925 ± 0.515
5.109ThrVal: 5.109 ± 0.535
1.558ThrTrp: 1.558 ± 0.259
2.492ThrTyr: 2.492 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
5.545ValAla: 5.545 ± 0.6
0.81ValCys: 0.81 ± 0.226
5.296ValAsp: 5.296 ± 0.535
4.797ValGlu: 4.797 ± 0.475
1.869ValPhe: 1.869 ± 0.4
4.86ValGly: 4.86 ± 0.492
1.308ValHis: 1.308 ± 0.275
3.489ValIle: 3.489 ± 0.616
3.676ValLys: 3.676 ± 0.438
5.296ValLeu: 5.296 ± 0.706
1.246ValMet: 1.246 ± 0.22
2.866ValAsn: 2.866 ± 0.447
4.361ValPro: 4.361 ± 0.637
2.305ValGln: 2.305 ± 0.372
4.486ValArg: 4.486 ± 0.497
4.361ValSer: 4.361 ± 0.546
4.984ValThr: 4.984 ± 0.552
4.548ValVal: 4.548 ± 0.627
1.246ValTrp: 1.246 ± 0.3
2.305ValTyr: 2.305 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
1.62TrpAla: 1.62 ± 0.327
0.374TrpCys: 0.374 ± 0.166
1.495TrpAsp: 1.495 ± 0.285
1.558TrpGlu: 1.558 ± 0.318
0.81TrpPhe: 0.81 ± 0.215
1.184TrpGly: 1.184 ± 0.286
0.498TrpHis: 0.498 ± 0.16
0.935TrpIle: 0.935 ± 0.242
0.685TrpLys: 0.685 ± 0.197
0.997TrpLeu: 0.997 ± 0.203
0.374TrpMet: 0.374 ± 0.149
0.872TrpAsn: 0.872 ± 0.253
0.872TrpPro: 0.872 ± 0.254
0.997TrpGln: 0.997 ± 0.234
1.059TrpArg: 1.059 ± 0.239
1.371TrpSer: 1.371 ± 0.291
1.495TrpThr: 1.495 ± 0.229
1.308TrpVal: 1.308 ± 0.291
0.623TrpTrp: 0.623 ± 0.213
0.436TrpTyr: 0.436 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.367TyrAla: 2.367 ± 0.412
0.0TyrCys: 0.0 ± 0.0
2.305TyrAsp: 2.305 ± 0.392
1.931TyrGlu: 1.931 ± 0.323
1.184TyrPhe: 1.184 ± 0.377
2.679TyrGly: 2.679 ± 0.353
0.748TyrHis: 0.748 ± 0.215
1.059TyrIle: 1.059 ± 0.271
1.371TyrLys: 1.371 ± 0.336
2.866TyrLeu: 2.866 ± 0.476
1.246TyrMet: 1.246 ± 0.328
1.308TyrAsn: 1.308 ± 0.304
1.371TyrPro: 1.371 ± 0.276
1.059TyrGln: 1.059 ± 0.245
2.118TyrArg: 2.118 ± 0.45
2.118TyrSer: 2.118 ± 0.419
2.554TyrThr: 2.554 ± 0.454
2.056TyrVal: 2.056 ± 0.349
0.374TyrTrp: 0.374 ± 0.181
0.81TyrTyr: 0.81 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (16052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski