Amino acid dipepetide frequency for Arthrobacter phage Maja

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.682AlaAla: 19.682 ± 1.893
0.929AlaCys: 0.929 ± 0.337
8.363AlaAsp: 8.363 ± 0.928
7.096AlaGlu: 7.096 ± 0.979
4.562AlaPhe: 4.562 ± 0.83
12.671AlaGly: 12.671 ± 1.657
1.098AlaHis: 1.098 ± 0.3
4.815AlaIle: 4.815 ± 0.764
6.082AlaLys: 6.082 ± 0.816
11.319AlaLeu: 11.319 ± 1.276
2.703AlaMet: 2.703 ± 0.471
2.45AlaAsn: 2.45 ± 0.435
7.434AlaPro: 7.434 ± 0.781
6.167AlaGln: 6.167 ± 1.039
7.349AlaArg: 7.349 ± 1.16
5.406AlaSer: 5.406 ± 0.931
6.167AlaThr: 6.167 ± 0.61
8.87AlaVal: 8.87 ± 0.904
2.619AlaTrp: 2.619 ± 0.473
1.943AlaTyr: 1.943 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.166
0.0CysCys: 0.0 ± 0.0
0.422CysAsp: 0.422 ± 0.224
0.422CysGlu: 0.422 ± 0.22
0.169CysPhe: 0.169 ± 0.13
0.845CysGly: 0.845 ± 0.356
0.169CysHis: 0.169 ± 0.114
0.422CysIle: 0.422 ± 0.206
0.422CysLys: 0.422 ± 0.241
0.422CysLeu: 0.422 ± 0.198
0.169CysMet: 0.169 ± 0.11
0.422CysAsn: 0.422 ± 0.182
0.169CysPro: 0.169 ± 0.111
0.338CysGln: 0.338 ± 0.165
0.845CysArg: 0.845 ± 0.268
0.591CysSer: 0.591 ± 0.233
0.422CysThr: 0.422 ± 0.239
0.422CysVal: 0.422 ± 0.195
0.169CysTrp: 0.169 ± 0.121
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.518AspAla: 7.518 ± 0.934
0.422AspCys: 0.422 ± 0.219
3.463AspAsp: 3.463 ± 0.688
3.294AspGlu: 3.294 ± 0.667
2.281AspPhe: 2.281 ± 0.474
5.66AspGly: 5.66 ± 0.719
1.521AspHis: 1.521 ± 0.354
3.041AspIle: 3.041 ± 0.474
2.196AspLys: 2.196 ± 0.414
5.744AspLeu: 5.744 ± 0.716
1.352AspMet: 1.352 ± 0.424
1.689AspAsn: 1.689 ± 0.258
3.126AspPro: 3.126 ± 0.657
2.703AspGln: 2.703 ± 0.52
2.872AspArg: 2.872 ± 0.52
2.027AspSer: 2.027 ± 0.447
3.21AspThr: 3.21 ± 0.435
3.801AspVal: 3.801 ± 0.606
1.436AspTrp: 1.436 ± 0.41
1.605AspTyr: 1.605 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
8.025GluAla: 8.025 ± 0.826
0.507GluCys: 0.507 ± 0.213
4.055GluAsp: 4.055 ± 0.779
3.548GluGlu: 3.548 ± 0.763
1.521GluPhe: 1.521 ± 0.406
3.886GluGly: 3.886 ± 0.667
0.929GluHis: 0.929 ± 0.271
2.027GluIle: 2.027 ± 0.36
2.196GluLys: 2.196 ± 0.43
4.984GluLeu: 4.984 ± 0.874
0.929GluMet: 0.929 ± 0.3
1.689GluAsn: 1.689 ± 0.303
2.534GluPro: 2.534 ± 0.535
2.957GluGln: 2.957 ± 0.561
4.393GluArg: 4.393 ± 0.844
2.788GluSer: 2.788 ± 0.755
3.294GluThr: 3.294 ± 0.642
4.139GluVal: 4.139 ± 0.54
1.183GluTrp: 1.183 ± 0.328
1.267GluTyr: 1.267 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
3.886PheAla: 3.886 ± 0.619
0.253PheCys: 0.253 ± 0.133
2.196PheAsp: 2.196 ± 0.453
1.774PheGlu: 1.774 ± 0.482
0.929PhePhe: 0.929 ± 0.278
2.45PheGly: 2.45 ± 0.564
0.169PheHis: 0.169 ± 0.114
2.027PheIle: 2.027 ± 0.545
1.014PheLys: 1.014 ± 0.291
1.858PheLeu: 1.858 ± 0.382
0.422PheMet: 0.422 ± 0.244
0.76PheAsn: 0.76 ± 0.248
2.027PhePro: 2.027 ± 0.312
1.098PheGln: 1.098 ± 0.388
1.943PheArg: 1.943 ± 0.33
1.858PheSer: 1.858 ± 0.47
1.605PheThr: 1.605 ± 0.289
2.703PheVal: 2.703 ± 0.641
0.507PheTrp: 0.507 ± 0.23
0.338PheTyr: 0.338 ± 0.153
0.0PheXaa: 0.0 ± 0.0
Gly
9.292GlyAla: 9.292 ± 1.676
0.253GlyCys: 0.253 ± 0.148
4.308GlyAsp: 4.308 ± 0.555
4.393GlyGlu: 4.393 ± 0.66
3.548GlyPhe: 3.548 ± 0.849
7.096GlyGly: 7.096 ± 0.912
1.689GlyHis: 1.689 ± 0.392
4.308GlyIle: 4.308 ± 0.649
3.801GlyLys: 3.801 ± 0.46
7.434GlyLeu: 7.434 ± 0.914
3.21GlyMet: 3.21 ± 0.464
2.196GlyAsn: 2.196 ± 0.39
5.913GlyPro: 5.913 ± 1.87
2.957GlyGln: 2.957 ± 0.44
5.491GlyArg: 5.491 ± 0.989
3.632GlySer: 3.632 ± 0.729
6.336GlyThr: 6.336 ± 0.77
5.998GlyVal: 5.998 ± 0.729
2.196GlyTrp: 2.196 ± 0.417
2.788GlyTyr: 2.788 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
1.267HisAla: 1.267 ± 0.322
0.084HisCys: 0.084 ± 0.083
1.098HisAsp: 1.098 ± 0.266
1.014HisGlu: 1.014 ± 0.299
0.338HisPhe: 0.338 ± 0.149
1.521HisGly: 1.521 ± 0.316
0.422HisHis: 0.422 ± 0.246
0.845HisIle: 0.845 ± 0.258
0.845HisLys: 0.845 ± 0.227
1.521HisLeu: 1.521 ± 0.38
0.169HisMet: 0.169 ± 0.109
0.422HisAsn: 0.422 ± 0.228
1.183HisPro: 1.183 ± 0.382
1.014HisGln: 1.014 ± 0.357
1.352HisArg: 1.352 ± 0.383
1.098HisSer: 1.098 ± 0.359
0.507HisThr: 0.507 ± 0.212
1.098HisVal: 1.098 ± 0.284
0.422HisTrp: 0.422 ± 0.221
0.169HisTyr: 0.169 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
6.504IleAla: 6.504 ± 0.838
0.169IleCys: 0.169 ± 0.11
2.703IleAsp: 2.703 ± 0.496
3.548IleGlu: 3.548 ± 0.565
1.183IlePhe: 1.183 ± 0.29
3.632IleGly: 3.632 ± 0.743
0.591IleHis: 0.591 ± 0.248
1.858IleIle: 1.858 ± 0.35
2.45IleLys: 2.45 ± 0.677
3.294IleLeu: 3.294 ± 0.504
0.676IleMet: 0.676 ± 0.218
1.352IleAsn: 1.352 ± 0.364
3.041IlePro: 3.041 ± 0.398
2.619IleGln: 2.619 ± 0.504
2.957IleArg: 2.957 ± 0.372
2.45IleSer: 2.45 ± 0.404
3.548IleThr: 3.548 ± 0.619
2.45IleVal: 2.45 ± 0.572
0.507IleTrp: 0.507 ± 0.183
0.591IleTyr: 0.591 ± 0.191
0.0IleXaa: 0.0 ± 0.0
Lys
7.18LysAla: 7.18 ± 0.693
0.253LysCys: 0.253 ± 0.125
2.281LysAsp: 2.281 ± 0.306
1.774LysGlu: 1.774 ± 0.388
0.929LysPhe: 0.929 ± 0.249
3.463LysGly: 3.463 ± 0.552
0.76LysHis: 0.76 ± 0.222
1.605LysIle: 1.605 ± 0.42
1.605LysLys: 1.605 ± 0.436
2.703LysLeu: 2.703 ± 0.478
0.929LysMet: 0.929 ± 0.319
1.521LysAsn: 1.521 ± 0.367
3.294LysPro: 3.294 ± 0.554
2.112LysGln: 2.112 ± 0.58
2.281LysArg: 2.281 ± 0.437
1.943LysSer: 1.943 ± 0.388
2.957LysThr: 2.957 ± 0.47
2.872LysVal: 2.872 ± 0.592
0.591LysTrp: 0.591 ± 0.173
0.929LysTyr: 0.929 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
11.995LeuAla: 11.995 ± 1.014
0.507LeuCys: 0.507 ± 0.185
4.815LeuAsp: 4.815 ± 0.7
5.237LeuGlu: 5.237 ± 0.62
2.281LeuPhe: 2.281 ± 0.419
7.349LeuGly: 7.349 ± 0.911
1.267LeuHis: 1.267 ± 0.292
4.139LeuIle: 4.139 ± 0.807
3.041LeuLys: 3.041 ± 0.395
5.913LeuLeu: 5.913 ± 0.813
2.281LeuMet: 2.281 ± 0.454
2.872LeuAsn: 2.872 ± 0.577
4.984LeuPro: 4.984 ± 0.648
2.365LeuGln: 2.365 ± 0.488
4.899LeuArg: 4.899 ± 0.879
5.068LeuSer: 5.068 ± 0.701
4.815LeuThr: 4.815 ± 0.639
6.251LeuVal: 6.251 ± 0.756
1.436LeuTrp: 1.436 ± 0.384
1.943LeuTyr: 1.943 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
4.055MetAla: 4.055 ± 0.538
0.169MetCys: 0.169 ± 0.116
1.267MetAsp: 1.267 ± 0.321
1.183MetGlu: 1.183 ± 0.331
0.676MetPhe: 0.676 ± 0.244
1.774MetGly: 1.774 ± 0.416
0.422MetHis: 0.422 ± 0.175
1.267MetIle: 1.267 ± 0.271
0.591MetLys: 0.591 ± 0.212
1.605MetLeu: 1.605 ± 0.332
0.338MetMet: 0.338 ± 0.267
0.845MetAsn: 0.845 ± 0.279
0.676MetPro: 0.676 ± 0.333
0.76MetGln: 0.76 ± 0.196
0.76MetArg: 0.76 ± 0.257
1.689MetSer: 1.689 ± 0.257
2.45MetThr: 2.45 ± 0.449
2.112MetVal: 2.112 ± 0.383
0.253MetTrp: 0.253 ± 0.144
0.422MetTyr: 0.422 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
2.957AsnAla: 2.957 ± 0.505
0.084AsnCys: 0.084 ± 0.09
2.196AsnAsp: 2.196 ± 0.392
1.267AsnGlu: 1.267 ± 0.221
0.422AsnPhe: 0.422 ± 0.173
3.886AsnGly: 3.886 ± 0.572
0.76AsnHis: 0.76 ± 0.234
1.521AsnIle: 1.521 ± 0.397
0.929AsnLys: 0.929 ± 0.269
3.21AsnLeu: 3.21 ± 0.424
0.591AsnMet: 0.591 ± 0.224
0.76AsnAsn: 0.76 ± 0.302
2.788AsnPro: 2.788 ± 0.462
0.591AsnGln: 0.591 ± 0.244
1.605AsnArg: 1.605 ± 0.285
1.267AsnSer: 1.267 ± 0.26
1.858AsnThr: 1.858 ± 0.405
2.196AsnVal: 2.196 ± 0.396
0.253AsnTrp: 0.253 ± 0.129
0.507AsnTyr: 0.507 ± 0.208
0.0AsnXaa: 0.0 ± 0.0
Pro
7.18ProAla: 7.18 ± 0.962
0.422ProCys: 0.422 ± 0.25
3.463ProAsp: 3.463 ± 0.573
3.886ProGlu: 3.886 ± 0.654
1.521ProPhe: 1.521 ± 0.304
4.899ProGly: 4.899 ± 0.755
0.591ProHis: 0.591 ± 0.225
3.294ProIle: 3.294 ± 0.453
2.281ProLys: 2.281 ± 0.401
4.308ProLeu: 4.308 ± 0.523
1.521ProMet: 1.521 ± 0.383
1.858ProAsn: 1.858 ± 0.417
2.872ProPro: 2.872 ± 0.679
2.196ProGln: 2.196 ± 0.623
3.717ProArg: 3.717 ± 0.586
2.534ProSer: 2.534 ± 0.548
3.379ProThr: 3.379 ± 0.5
5.913ProVal: 5.913 ± 0.945
0.76ProTrp: 0.76 ± 0.248
0.845ProTyr: 0.845 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
5.237GlnAla: 5.237 ± 0.707
0.422GlnCys: 0.422 ± 0.181
1.689GlnAsp: 1.689 ± 0.392
2.534GlnGlu: 2.534 ± 0.43
1.605GlnPhe: 1.605 ± 0.317
3.126GlnGly: 3.126 ± 0.664
0.338GlnHis: 0.338 ± 0.159
1.521GlnIle: 1.521 ± 0.33
1.267GlnLys: 1.267 ± 0.254
3.632GlnLeu: 3.632 ± 0.58
0.845GlnMet: 0.845 ± 0.316
0.929GlnAsn: 0.929 ± 0.283
2.027GlnPro: 2.027 ± 0.515
1.436GlnGln: 1.436 ± 0.326
3.126GlnArg: 3.126 ± 0.473
2.957GlnSer: 2.957 ± 0.621
2.872GlnThr: 2.872 ± 0.461
3.041GlnVal: 3.041 ± 0.526
1.436GlnTrp: 1.436 ± 0.351
0.845GlnTyr: 0.845 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
6.758ArgAla: 6.758 ± 1.003
1.014ArgCys: 1.014 ± 0.357
3.97ArgAsp: 3.97 ± 0.609
3.886ArgGlu: 3.886 ± 0.747
1.605ArgPhe: 1.605 ± 0.351
4.055ArgGly: 4.055 ± 0.564
1.605ArgHis: 1.605 ± 0.405
2.872ArgIle: 2.872 ± 0.51
2.703ArgLys: 2.703 ± 0.465
4.646ArgLeu: 4.646 ± 0.832
1.521ArgMet: 1.521 ± 0.366
1.858ArgAsn: 1.858 ± 0.438
3.548ArgPro: 3.548 ± 0.747
2.534ArgGln: 2.534 ± 0.646
4.815ArgArg: 4.815 ± 0.781
4.055ArgSer: 4.055 ± 0.735
3.379ArgThr: 3.379 ± 0.559
4.984ArgVal: 4.984 ± 0.567
1.774ArgTrp: 1.774 ± 0.349
1.943ArgTyr: 1.943 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
6.842SerAla: 6.842 ± 0.814
0.422SerCys: 0.422 ± 0.179
2.365SerAsp: 2.365 ± 0.415
2.534SerGlu: 2.534 ± 0.49
1.098SerPhe: 1.098 ± 0.32
5.744SerGly: 5.744 ± 0.654
0.845SerHis: 0.845 ± 0.302
2.196SerIle: 2.196 ± 0.424
2.619SerLys: 2.619 ± 0.4
4.562SerLeu: 4.562 ± 0.548
1.521SerMet: 1.521 ± 0.357
1.774SerAsn: 1.774 ± 0.375
3.041SerPro: 3.041 ± 0.631
1.605SerGln: 1.605 ± 0.357
3.632SerArg: 3.632 ± 0.737
3.463SerSer: 3.463 ± 0.668
3.717SerThr: 3.717 ± 0.568
3.126SerVal: 3.126 ± 0.541
1.183SerTrp: 1.183 ± 0.244
1.267SerTyr: 1.267 ± 0.37
0.0SerXaa: 0.0 ± 0.0
Thr
7.687ThrAla: 7.687 ± 0.885
0.253ThrCys: 0.253 ± 0.137
3.632ThrAsp: 3.632 ± 0.675
2.872ThrGlu: 2.872 ± 0.554
1.521ThrPhe: 1.521 ± 0.296
7.18ThrGly: 7.18 ± 0.799
1.183ThrHis: 1.183 ± 0.359
2.957ThrIle: 2.957 ± 0.537
3.21ThrLys: 3.21 ± 0.538
5.406ThrLeu: 5.406 ± 0.578
0.845ThrMet: 0.845 ± 0.297
2.112ThrAsn: 2.112 ± 0.497
3.379ThrPro: 3.379 ± 0.539
2.196ThrGln: 2.196 ± 0.403
3.463ThrArg: 3.463 ± 0.496
3.463ThrSer: 3.463 ± 0.648
4.055ThrThr: 4.055 ± 0.885
4.308ThrVal: 4.308 ± 0.515
0.76ThrTrp: 0.76 ± 0.279
1.858ThrTyr: 1.858 ± 0.534
0.0ThrXaa: 0.0 ± 0.0
Val
8.278ValAla: 8.278 ± 0.836
0.338ValCys: 0.338 ± 0.148
3.294ValAsp: 3.294 ± 0.496
4.899ValGlu: 4.899 ± 0.848
2.534ValPhe: 2.534 ± 0.896
4.646ValGly: 4.646 ± 1.234
1.267ValHis: 1.267 ± 0.403
4.308ValIle: 4.308 ± 0.723
3.294ValLys: 3.294 ± 0.503
5.998ValLeu: 5.998 ± 0.809
1.943ValMet: 1.943 ± 0.296
2.957ValAsn: 2.957 ± 0.706
3.041ValPro: 3.041 ± 0.456
3.379ValGln: 3.379 ± 0.608
4.055ValArg: 4.055 ± 0.669
5.153ValSer: 5.153 ± 0.481
4.984ValThr: 4.984 ± 0.723
5.998ValVal: 5.998 ± 0.862
1.267ValTrp: 1.267 ± 0.395
1.521ValTyr: 1.521 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
1.605TrpAla: 1.605 ± 0.376
0.338TrpCys: 0.338 ± 0.195
1.605TrpAsp: 1.605 ± 0.377
0.76TrpGlu: 0.76 ± 0.205
0.591TrpPhe: 0.591 ± 0.187
0.845TrpGly: 0.845 ± 0.245
0.422TrpHis: 0.422 ± 0.208
0.676TrpIle: 0.676 ± 0.224
0.845TrpLys: 0.845 ± 0.323
2.788TrpLeu: 2.788 ± 0.468
0.591TrpMet: 0.591 ± 0.194
0.676TrpAsn: 0.676 ± 0.255
1.098TrpPro: 1.098 ± 0.305
1.014TrpGln: 1.014 ± 0.294
1.605TrpArg: 1.605 ± 0.434
0.676TrpSer: 0.676 ± 0.258
1.267TrpThr: 1.267 ± 0.352
1.352TrpVal: 1.352 ± 0.317
0.338TrpTrp: 0.338 ± 0.129
0.253TrpTyr: 0.253 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.436TyrAla: 1.436 ± 0.379
0.338TyrCys: 0.338 ± 0.152
1.774TyrAsp: 1.774 ± 0.365
0.929TyrGlu: 0.929 ± 0.198
0.507TyrPhe: 0.507 ± 0.166
2.112TyrGly: 2.112 ± 0.506
0.338TyrHis: 0.338 ± 0.185
0.422TyrIle: 0.422 ± 0.289
0.591TyrLys: 0.591 ± 0.224
2.112TyrLeu: 2.112 ± 0.525
0.676TyrMet: 0.676 ± 0.226
0.507TyrAsn: 0.507 ± 0.252
1.267TyrPro: 1.267 ± 0.334
0.845TyrGln: 0.845 ± 0.211
2.45TyrArg: 2.45 ± 0.538
1.436TyrSer: 1.436 ± 0.427
1.521TyrThr: 1.521 ± 0.344
1.521TyrVal: 1.521 ± 0.351
0.253TyrTrp: 0.253 ± 0.14
0.422TyrTyr: 0.422 ± 0.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11839 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski