Amino acid dipepetide frequency for Microbacterium phage Kieran

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.22AlaAla: 14.22 ± 1.291
0.154AlaCys: 0.154 ± 0.115
6.687AlaAsp: 6.687 ± 0.798
7.533AlaGlu: 7.533 ± 0.788
3.689AlaPhe: 3.689 ± 0.552
12.375AlaGly: 12.375 ± 0.96
1.614AlaHis: 1.614 ± 0.353
4.689AlaIle: 4.689 ± 0.715
6.149AlaLys: 6.149 ± 0.769
11.53AlaLeu: 11.53 ± 0.9
2.152AlaMet: 2.152 ± 0.379
3.843AlaAsn: 3.843 ± 0.646
6.303AlaPro: 6.303 ± 0.528
4.919AlaGln: 4.919 ± 0.628
7.456AlaArg: 7.456 ± 0.753
5.15AlaSer: 5.15 ± 0.707
5.995AlaThr: 5.995 ± 0.798
8.378AlaVal: 8.378 ± 0.913
2.537AlaTrp: 2.537 ± 0.519
2.69AlaTyr: 2.69 ± 0.507
0.0AlaXaa: 0.0 ± 0.0
Cys
0.231CysAla: 0.231 ± 0.125
0.077CysCys: 0.077 ± 0.071
0.231CysAsp: 0.231 ± 0.149
0.077CysGlu: 0.077 ± 0.067
0.307CysPhe: 0.307 ± 0.164
0.615CysGly: 0.615 ± 0.237
0.0CysHis: 0.0 ± 0.0
0.077CysIle: 0.077 ± 0.072
0.307CysLys: 0.307 ± 0.166
0.077CysLeu: 0.077 ± 0.078
0.077CysMet: 0.077 ± 0.078
0.231CysAsn: 0.231 ± 0.138
0.615CysPro: 0.615 ± 0.216
0.077CysGln: 0.077 ± 0.066
0.307CysArg: 0.307 ± 0.143
0.231CysSer: 0.231 ± 0.118
0.154CysThr: 0.154 ± 0.105
0.231CysVal: 0.231 ± 0.133
0.077CysTrp: 0.077 ± 0.08
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.38AspAla: 6.38 ± 0.848
0.384AspCys: 0.384 ± 0.198
3.228AspAsp: 3.228 ± 0.5
3.305AspGlu: 3.305 ± 0.53
1.845AspPhe: 1.845 ± 0.342
6.995AspGly: 6.995 ± 0.858
1.384AspHis: 1.384 ± 0.354
1.768AspIle: 1.768 ± 0.318
1.691AspLys: 1.691 ± 0.526
6.687AspLeu: 6.687 ± 0.933
1.076AspMet: 1.076 ± 0.316
0.922AspAsn: 0.922 ± 0.258
4.074AspPro: 4.074 ± 0.579
2.075AspGln: 2.075 ± 0.359
3.228AspArg: 3.228 ± 0.577
3.075AspSer: 3.075 ± 0.429
3.536AspThr: 3.536 ± 0.54
5.304AspVal: 5.304 ± 0.554
1.614AspTrp: 1.614 ± 0.322
2.613AspTyr: 2.613 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
9.147GluAla: 9.147 ± 0.819
0.077GluCys: 0.077 ± 0.072
3.997GluAsp: 3.997 ± 0.483
4.766GluGlu: 4.766 ± 0.735
2.767GluPhe: 2.767 ± 0.325
4.228GluGly: 4.228 ± 0.455
1.307GluHis: 1.307 ± 0.29
0.999GluIle: 0.999 ± 0.285
1.998GluLys: 1.998 ± 0.37
7.686GluLeu: 7.686 ± 0.7
1.153GluMet: 1.153 ± 0.278
1.614GluAsn: 1.614 ± 0.44
3.075GluPro: 3.075 ± 0.444
2.152GluGln: 2.152 ± 0.367
2.998GluArg: 2.998 ± 0.598
2.383GluSer: 2.383 ± 0.617
3.382GluThr: 3.382 ± 0.443
5.457GluVal: 5.457 ± 0.693
1.537GluTrp: 1.537 ± 0.349
1.46GluTyr: 1.46 ± 0.32
0.0GluXaa: 0.0 ± 0.0
Phe
2.69PheAla: 2.69 ± 0.502
0.307PheCys: 0.307 ± 0.144
1.614PheAsp: 1.614 ± 0.334
2.46PheGlu: 2.46 ± 0.396
0.538PhePhe: 0.538 ± 0.236
2.46PheGly: 2.46 ± 0.416
0.231PheHis: 0.231 ± 0.122
1.384PheIle: 1.384 ± 0.361
1.768PheLys: 1.768 ± 0.357
2.46PheLeu: 2.46 ± 0.442
0.769PheMet: 0.769 ± 0.225
1.537PheAsn: 1.537 ± 0.297
1.076PhePro: 1.076 ± 0.273
1.307PheGln: 1.307 ± 0.268
3.228PheArg: 3.228 ± 0.366
1.768PheSer: 1.768 ± 0.288
2.613PheThr: 2.613 ± 0.403
2.537PheVal: 2.537 ± 0.429
0.692PheTrp: 0.692 ± 0.264
0.692PheTyr: 0.692 ± 0.188
0.0PheXaa: 0.0 ± 0.0
Gly
9.147GlyAla: 9.147 ± 0.991
0.384GlyCys: 0.384 ± 0.158
6.457GlyAsp: 6.457 ± 0.61
4.074GlyGlu: 4.074 ± 0.563
2.844GlyPhe: 2.844 ± 0.531
6.764GlyGly: 6.764 ± 1.004
1.614GlyHis: 1.614 ± 0.338
3.382GlyIle: 3.382 ± 0.543
3.843GlyLys: 3.843 ± 0.577
7.148GlyLeu: 7.148 ± 0.647
2.613GlyMet: 2.613 ± 0.546
2.767GlyAsn: 2.767 ± 0.62
3.766GlyPro: 3.766 ± 0.661
4.228GlyGln: 4.228 ± 0.693
4.996GlyArg: 4.996 ± 0.495
4.919GlySer: 4.919 ± 0.691
6.61GlyThr: 6.61 ± 0.948
8.148GlyVal: 8.148 ± 0.995
2.075GlyTrp: 2.075 ± 0.43
3.075GlyTyr: 3.075 ± 0.485
0.0GlyXaa: 0.0 ± 0.0
His
2.537HisAla: 2.537 ± 0.452
0.077HisCys: 0.077 ± 0.08
0.846HisAsp: 0.846 ± 0.235
0.999HisGlu: 0.999 ± 0.253
1.076HisPhe: 1.076 ± 0.318
1.307HisGly: 1.307 ± 0.339
0.384HisHis: 0.384 ± 0.15
0.307HisIle: 0.307 ± 0.156
0.769HisLys: 0.769 ± 0.221
2.383HisLeu: 2.383 ± 0.428
0.231HisMet: 0.231 ± 0.119
0.307HisAsn: 0.307 ± 0.131
1.153HisPro: 1.153 ± 0.285
0.538HisGln: 0.538 ± 0.165
0.769HisArg: 0.769 ± 0.253
0.461HisSer: 0.461 ± 0.202
0.769HisThr: 0.769 ± 0.226
1.46HisVal: 1.46 ± 0.337
0.154HisTrp: 0.154 ± 0.091
0.922HisTyr: 0.922 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
4.612IleAla: 4.612 ± 0.552
0.0IleCys: 0.0 ± 0.0
2.69IleAsp: 2.69 ± 0.411
2.69IleGlu: 2.69 ± 0.385
0.615IlePhe: 0.615 ± 0.221
3.228IleGly: 3.228 ± 0.534
1.076IleHis: 1.076 ± 0.346
1.691IleIle: 1.691 ± 0.353
2.075IleLys: 2.075 ± 0.396
2.998IleLeu: 2.998 ± 0.555
0.384IleMet: 0.384 ± 0.165
1.076IleAsn: 1.076 ± 0.402
1.384IlePro: 1.384 ± 0.36
0.999IleGln: 0.999 ± 0.357
2.46IleArg: 2.46 ± 0.426
2.075IleSer: 2.075 ± 0.418
2.383IleThr: 2.383 ± 0.672
3.228IleVal: 3.228 ± 0.593
0.615IleTrp: 0.615 ± 0.265
0.922IleTyr: 0.922 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
5.842LysAla: 5.842 ± 0.904
0.615LysCys: 0.615 ± 0.265
2.075LysAsp: 2.075 ± 0.485
2.306LysGlu: 2.306 ± 0.447
1.153LysPhe: 1.153 ± 0.366
3.305LysGly: 3.305 ± 0.45
0.538LysHis: 0.538 ± 0.187
1.153LysIle: 1.153 ± 0.31
1.46LysLys: 1.46 ± 0.39
3.843LysLeu: 3.843 ± 0.622
0.999LysMet: 0.999 ± 0.252
0.846LysAsn: 0.846 ± 0.214
2.767LysPro: 2.767 ± 0.446
1.768LysGln: 1.768 ± 0.349
3.689LysArg: 3.689 ± 0.506
2.306LysSer: 2.306 ± 0.441
2.69LysThr: 2.69 ± 0.481
2.998LysVal: 2.998 ± 0.424
0.384LysTrp: 0.384 ± 0.154
0.615LysTyr: 0.615 ± 0.221
0.0LysXaa: 0.0 ± 0.0
Leu
10.069LeuAla: 10.069 ± 1.06
0.307LeuCys: 0.307 ± 0.147
5.38LeuAsp: 5.38 ± 0.706
5.688LeuGlu: 5.688 ± 0.668
2.537LeuPhe: 2.537 ± 0.479
7.917LeuGly: 7.917 ± 0.674
1.691LeuHis: 1.691 ± 0.385
3.151LeuIle: 3.151 ± 0.461
3.689LeuLys: 3.689 ± 0.612
6.687LeuLeu: 6.687 ± 0.714
2.46LeuMet: 2.46 ± 0.379
3.151LeuAsn: 3.151 ± 0.656
4.458LeuPro: 4.458 ± 0.571
3.613LeuGln: 3.613 ± 0.821
5.765LeuArg: 5.765 ± 0.581
4.689LeuSer: 4.689 ± 0.511
5.611LeuThr: 5.611 ± 0.984
8.609LeuVal: 8.609 ± 0.835
1.537LeuTrp: 1.537 ± 0.304
1.768LeuTyr: 1.768 ± 0.309
0.0LeuXaa: 0.0 ± 0.0
Met
3.459MetAla: 3.459 ± 0.512
0.0MetCys: 0.0 ± 0.0
1.537MetAsp: 1.537 ± 0.324
1.307MetGlu: 1.307 ± 0.303
0.461MetPhe: 0.461 ± 0.175
0.999MetGly: 0.999 ± 0.259
0.077MetHis: 0.077 ± 0.083
0.615MetIle: 0.615 ± 0.192
1.23MetLys: 1.23 ± 0.313
1.537MetLeu: 1.537 ± 0.309
0.231MetMet: 0.231 ± 0.117
0.922MetAsn: 0.922 ± 0.264
0.846MetPro: 0.846 ± 0.227
0.461MetGln: 0.461 ± 0.201
0.922MetArg: 0.922 ± 0.253
1.384MetSer: 1.384 ± 0.344
1.922MetThr: 1.922 ± 0.324
1.153MetVal: 1.153 ± 0.283
0.461MetTrp: 0.461 ± 0.216
0.384MetTyr: 0.384 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
2.844AsnAla: 2.844 ± 0.44
0.231AsnCys: 0.231 ± 0.123
1.691AsnAsp: 1.691 ± 0.281
0.922AsnGlu: 0.922 ± 0.268
0.922AsnPhe: 0.922 ± 0.256
4.074AsnGly: 4.074 ± 0.779
0.615AsnHis: 0.615 ± 0.211
0.769AsnIle: 0.769 ± 0.25
1.153AsnLys: 1.153 ± 0.343
2.306AsnLeu: 2.306 ± 0.377
0.538AsnMet: 0.538 ± 0.217
0.769AsnAsn: 0.769 ± 0.256
1.998AsnPro: 1.998 ± 0.32
1.614AsnGln: 1.614 ± 0.368
2.152AsnArg: 2.152 ± 0.626
1.845AsnSer: 1.845 ± 0.392
1.537AsnThr: 1.537 ± 0.347
2.152AsnVal: 2.152 ± 0.368
0.692AsnTrp: 0.692 ± 0.227
0.769AsnTyr: 0.769 ± 0.245
0.0AsnXaa: 0.0 ± 0.0
Pro
7.917ProAla: 7.917 ± 0.709
0.307ProCys: 0.307 ± 0.153
3.382ProAsp: 3.382 ± 0.591
4.304ProGlu: 4.304 ± 0.606
1.845ProPhe: 1.845 ± 0.337
5.457ProGly: 5.457 ± 0.629
1.076ProHis: 1.076 ± 0.34
2.306ProIle: 2.306 ± 0.494
2.075ProLys: 2.075 ± 0.509
3.997ProLeu: 3.997 ± 0.514
0.846ProMet: 0.846 ± 0.233
1.768ProAsn: 1.768 ± 0.389
1.537ProPro: 1.537 ± 0.371
1.845ProGln: 1.845 ± 0.45
2.152ProArg: 2.152 ± 0.397
3.459ProSer: 3.459 ± 0.603
2.383ProThr: 2.383 ± 0.376
3.613ProVal: 3.613 ± 0.542
1.23ProTrp: 1.23 ± 0.329
0.615ProTyr: 0.615 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
4.381GlnAla: 4.381 ± 0.535
0.154GlnCys: 0.154 ± 0.142
1.998GlnAsp: 1.998 ± 0.385
2.306GlnGlu: 2.306 ± 0.438
1.307GlnPhe: 1.307 ± 0.393
2.537GlnGly: 2.537 ± 0.332
0.384GlnHis: 0.384 ± 0.192
0.846GlnIle: 0.846 ± 0.204
1.23GlnLys: 1.23 ± 0.279
3.766GlnLeu: 3.766 ± 0.626
0.538GlnMet: 0.538 ± 0.174
1.23GlnAsn: 1.23 ± 0.293
1.845GlnPro: 1.845 ± 0.472
1.46GlnGln: 1.46 ± 0.296
1.998GlnArg: 1.998 ± 0.389
1.614GlnSer: 1.614 ± 0.311
2.767GlnThr: 2.767 ± 0.502
3.92GlnVal: 3.92 ± 0.527
0.692GlnTrp: 0.692 ± 0.267
0.999GlnTyr: 0.999 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
6.457ArgAla: 6.457 ± 0.863
0.231ArgCys: 0.231 ± 0.185
4.151ArgAsp: 4.151 ± 0.572
4.535ArgGlu: 4.535 ± 0.759
2.46ArgPhe: 2.46 ± 0.433
5.304ArgGly: 5.304 ± 0.671
0.922ArgHis: 0.922 ± 0.284
2.46ArgIle: 2.46 ± 0.49
2.383ArgLys: 2.383 ± 0.339
6.38ArgLeu: 6.38 ± 0.759
0.999ArgMet: 0.999 ± 0.293
1.691ArgAsn: 1.691 ± 0.306
3.075ArgPro: 3.075 ± 0.523
1.998ArgGln: 1.998 ± 0.32
5.534ArgArg: 5.534 ± 0.851
3.459ArgSer: 3.459 ± 0.465
3.92ArgThr: 3.92 ± 0.494
5.38ArgVal: 5.38 ± 0.764
1.384ArgTrp: 1.384 ± 0.402
1.153ArgTyr: 1.153 ± 0.284
0.0ArgXaa: 0.0 ± 0.0
Ser
6.38SerAla: 6.38 ± 0.596
0.231SerCys: 0.231 ± 0.127
3.382SerAsp: 3.382 ± 0.375
2.537SerGlu: 2.537 ± 0.398
2.229SerPhe: 2.229 ± 0.424
6.072SerGly: 6.072 ± 0.783
0.615SerHis: 0.615 ± 0.208
2.537SerIle: 2.537 ± 0.474
2.152SerLys: 2.152 ± 0.35
4.612SerLeu: 4.612 ± 0.662
1.384SerMet: 1.384 ± 0.297
1.691SerAsn: 1.691 ± 0.36
3.075SerPro: 3.075 ± 0.462
1.537SerGln: 1.537 ± 0.357
3.151SerArg: 3.151 ± 0.345
2.844SerSer: 2.844 ± 0.506
3.689SerThr: 3.689 ± 0.526
4.458SerVal: 4.458 ± 0.64
1.23SerTrp: 1.23 ± 0.283
0.846SerTyr: 0.846 ± 0.171
0.0SerXaa: 0.0 ± 0.0
Thr
7.456ThrAla: 7.456 ± 0.9
0.077ThrCys: 0.077 ± 0.081
3.613ThrAsp: 3.613 ± 0.502
3.536ThrGlu: 3.536 ± 0.491
1.998ThrPhe: 1.998 ± 0.388
5.611ThrGly: 5.611 ± 0.796
0.922ThrHis: 0.922 ± 0.313
3.766ThrIle: 3.766 ± 0.64
2.844ThrLys: 2.844 ± 0.485
4.996ThrLeu: 4.996 ± 0.682
0.615ThrMet: 0.615 ± 0.251
1.23ThrAsn: 1.23 ± 0.315
4.151ThrPro: 4.151 ± 0.503
0.769ThrGln: 0.769 ± 0.227
3.536ThrArg: 3.536 ± 0.545
4.151ThrSer: 4.151 ± 0.538
3.843ThrThr: 3.843 ± 0.662
6.149ThrVal: 6.149 ± 0.946
1.46ThrTrp: 1.46 ± 0.337
1.768ThrTyr: 1.768 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
9.992ValAla: 9.992 ± 0.71
0.154ValCys: 0.154 ± 0.103
5.15ValAsp: 5.15 ± 0.693
5.765ValGlu: 5.765 ± 0.576
2.69ValPhe: 2.69 ± 0.5
5.765ValGly: 5.765 ± 0.804
1.922ValHis: 1.922 ± 0.348
3.382ValIle: 3.382 ± 0.402
3.151ValLys: 3.151 ± 0.563
6.072ValLeu: 6.072 ± 0.738
1.23ValMet: 1.23 ± 0.247
2.383ValAsn: 2.383 ± 0.397
4.996ValPro: 4.996 ± 0.67
2.767ValGln: 2.767 ± 0.401
6.072ValArg: 6.072 ± 0.682
5.073ValSer: 5.073 ± 0.588
5.688ValThr: 5.688 ± 0.742
8.301ValVal: 8.301 ± 0.856
1.922ValTrp: 1.922 ± 0.454
2.306ValTyr: 2.306 ± 0.49
0.0ValXaa: 0.0 ± 0.0
Trp
2.152TrpAla: 2.152 ± 0.427
0.077TrpCys: 0.077 ± 0.081
0.846TrpAsp: 0.846 ± 0.282
1.307TrpGlu: 1.307 ± 0.347
0.384TrpPhe: 0.384 ± 0.207
1.691TrpGly: 1.691 ± 0.305
0.461TrpHis: 0.461 ± 0.172
0.999TrpIle: 0.999 ± 0.332
0.769TrpLys: 0.769 ± 0.202
2.306TrpLeu: 2.306 ± 0.372
0.692TrpMet: 0.692 ± 0.214
0.999TrpAsn: 0.999 ± 0.289
0.461TrpPro: 0.461 ± 0.193
1.153TrpGln: 1.153 ± 0.417
1.076TrpArg: 1.076 ± 0.409
1.845TrpSer: 1.845 ± 0.37
1.46TrpThr: 1.46 ± 0.382
1.614TrpVal: 1.614 ± 0.294
0.154TrpTrp: 0.154 ± 0.112
0.538TrpTyr: 0.538 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.152TyrAla: 2.152 ± 0.5
0.154TyrCys: 0.154 ± 0.11
2.229TyrAsp: 2.229 ± 0.372
1.768TyrGlu: 1.768 ± 0.308
0.538TyrPhe: 0.538 ± 0.19
2.152TyrGly: 2.152 ± 0.358
0.615TyrHis: 0.615 ± 0.189
0.922TyrIle: 0.922 ± 0.292
0.615TyrLys: 0.615 ± 0.199
1.307TyrLeu: 1.307 ± 0.328
0.846TyrMet: 0.846 ± 0.201
0.615TyrAsn: 0.615 ± 0.206
1.46TyrPro: 1.46 ± 0.293
0.692TyrGln: 0.692 ± 0.24
2.383TyrArg: 2.383 ± 0.504
1.922TyrSer: 1.922 ± 0.466
1.537TyrThr: 1.537 ± 0.332
1.691TyrVal: 1.691 ± 0.318
0.538TyrTrp: 0.538 ± 0.213
0.307TyrTyr: 0.307 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13011 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski