Amino acid dipepetide frequency for Microbacterium phage Ariadne

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.628AlaAla: 17.628 ± 1.526
0.867AlaCys: 0.867 ± 0.235
8.034AlaAsp: 8.034 ± 0.953
7.282AlaGlu: 7.282 ± 0.807
3.526AlaPhe: 3.526 ± 0.467
9.479AlaGly: 9.479 ± 0.853
2.485AlaHis: 2.485 ± 0.371
5.375AlaIle: 5.375 ± 0.613
3.93AlaLys: 3.93 ± 0.453
10.75AlaLeu: 10.75 ± 1.213
3.237AlaMet: 3.237 ± 0.44
2.948AlaAsn: 2.948 ± 0.537
5.78AlaPro: 5.78 ± 0.635
4.277AlaGln: 4.277 ± 0.433
8.149AlaArg: 8.149 ± 0.74
6.415AlaSer: 6.415 ± 0.503
7.398AlaThr: 7.398 ± 0.64
6.531AlaVal: 6.531 ± 0.78
2.601AlaTrp: 2.601 ± 0.449
2.312AlaTyr: 2.312 ± 0.3
0.0AlaXaa: 0.0 ± 0.0
Cys
0.694CysAla: 0.694 ± 0.226
0.289CysCys: 0.289 ± 0.134
0.983CysAsp: 0.983 ± 0.238
0.809CysGlu: 0.809 ± 0.25
0.173CysPhe: 0.173 ± 0.106
1.387CysGly: 1.387 ± 0.343
0.231CysHis: 0.231 ± 0.12
0.289CysIle: 0.289 ± 0.117
0.058CysLys: 0.058 ± 0.051
0.52CysLeu: 0.52 ± 0.164
0.116CysMet: 0.116 ± 0.084
0.289CysAsn: 0.289 ± 0.114
1.156CysPro: 1.156 ± 0.228
0.173CysGln: 0.173 ± 0.088
0.52CysArg: 0.52 ± 0.182
0.347CysSer: 0.347 ± 0.155
0.462CysThr: 0.462 ± 0.175
0.867CysVal: 0.867 ± 0.213
0.116CysTrp: 0.116 ± 0.076
0.231CysTyr: 0.231 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
8.092AspAla: 8.092 ± 0.827
0.694AspCys: 0.694 ± 0.216
5.317AspAsp: 5.317 ± 0.81
4.971AspGlu: 4.971 ± 0.796
1.618AspPhe: 1.618 ± 0.321
5.664AspGly: 5.664 ± 0.635
1.445AspHis: 1.445 ± 0.293
2.37AspIle: 2.37 ± 0.353
1.098AspLys: 1.098 ± 0.233
5.722AspLeu: 5.722 ± 0.642
1.561AspMet: 1.561 ± 0.304
2.312AspAsn: 2.312 ± 0.319
3.757AspPro: 3.757 ± 0.52
2.312AspGln: 2.312 ± 0.385
4.797AspArg: 4.797 ± 0.643
3.121AspSer: 3.121 ± 0.426
3.237AspThr: 3.237 ± 0.457
4.393AspVal: 4.393 ± 0.562
1.618AspTrp: 1.618 ± 0.328
1.618AspTyr: 1.618 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
8.207GluAla: 8.207 ± 0.792
0.405GluCys: 0.405 ± 0.149
4.161GluAsp: 4.161 ± 0.564
4.797GluGlu: 4.797 ± 0.625
1.445GluPhe: 1.445 ± 0.306
4.971GluGly: 4.971 ± 0.539
1.965GluHis: 1.965 ± 0.346
1.272GluIle: 1.272 ± 0.292
1.734GluLys: 1.734 ± 0.305
3.237GluLeu: 3.237 ± 0.404
1.734GluMet: 1.734 ± 0.238
1.792GluAsn: 1.792 ± 0.357
5.317GluPro: 5.317 ± 0.896
2.37GluGln: 2.37 ± 0.321
5.144GluArg: 5.144 ± 0.547
3.179GluSer: 3.179 ± 0.419
4.161GluThr: 4.161 ± 0.56
5.491GluVal: 5.491 ± 0.608
1.734GluTrp: 1.734 ± 0.325
1.792GluTyr: 1.792 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
2.254PheAla: 2.254 ± 0.387
0.289PheCys: 0.289 ± 0.158
2.196PheAsp: 2.196 ± 0.395
1.965PheGlu: 1.965 ± 0.303
0.578PhePhe: 0.578 ± 0.177
2.37PheGly: 2.37 ± 0.367
0.405PheHis: 0.405 ± 0.146
1.214PheIle: 1.214 ± 0.254
0.636PheLys: 0.636 ± 0.206
1.965PheLeu: 1.965 ± 0.345
0.462PheMet: 0.462 ± 0.152
0.52PheAsn: 0.52 ± 0.173
0.983PhePro: 0.983 ± 0.223
0.347PheGln: 0.347 ± 0.189
1.676PheArg: 1.676 ± 0.236
1.156PheSer: 1.156 ± 0.32
2.427PheThr: 2.427 ± 0.309
1.849PheVal: 1.849 ± 0.376
0.462PheTrp: 0.462 ± 0.193
0.52PheTyr: 0.52 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
8.034GlyAla: 8.034 ± 1.035
0.694GlyCys: 0.694 ± 0.227
5.144GlyAsp: 5.144 ± 0.507
5.433GlyGlu: 5.433 ± 0.579
2.716GlyPhe: 2.716 ± 0.415
8.554GlyGly: 8.554 ± 0.852
1.907GlyHis: 1.907 ± 0.373
4.161GlyIle: 4.161 ± 0.526
2.601GlyLys: 2.601 ± 0.404
7.167GlyLeu: 7.167 ± 0.695
2.659GlyMet: 2.659 ± 0.415
2.196GlyAsn: 2.196 ± 0.471
3.641GlyPro: 3.641 ± 0.397
3.526GlyGln: 3.526 ± 0.496
4.971GlyArg: 4.971 ± 0.641
5.895GlySer: 5.895 ± 0.791
6.415GlyThr: 6.415 ± 0.632
6.415GlyVal: 6.415 ± 0.627
2.312GlyTrp: 2.312 ± 0.335
3.237GlyTyr: 3.237 ± 0.36
0.0GlyXaa: 0.0 ± 0.0
His
1.907HisAla: 1.907 ± 0.305
0.231HisCys: 0.231 ± 0.116
1.561HisAsp: 1.561 ± 0.282
1.387HisGlu: 1.387 ± 0.293
0.52HisPhe: 0.52 ± 0.178
1.329HisGly: 1.329 ± 0.27
0.462HisHis: 0.462 ± 0.195
0.578HisIle: 0.578 ± 0.186
0.462HisLys: 0.462 ± 0.176
2.138HisLeu: 2.138 ± 0.311
0.289HisMet: 0.289 ± 0.121
0.173HisAsn: 0.173 ± 0.098
2.254HisPro: 2.254 ± 0.404
0.231HisGln: 0.231 ± 0.149
1.561HisArg: 1.561 ± 0.306
1.098HisSer: 1.098 ± 0.226
1.214HisThr: 1.214 ± 0.259
1.329HisVal: 1.329 ± 0.351
0.347HisTrp: 0.347 ± 0.145
0.462HisTyr: 0.462 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
4.508IleAla: 4.508 ± 0.485
0.289IleCys: 0.289 ± 0.161
3.757IleAsp: 3.757 ± 0.45
4.046IleGlu: 4.046 ± 0.454
0.694IlePhe: 0.694 ± 0.199
3.526IleGly: 3.526 ± 0.422
0.636IleHis: 0.636 ± 0.192
2.312IleIle: 2.312 ± 0.441
0.983IleLys: 0.983 ± 0.217
2.601IleLeu: 2.601 ± 0.422
0.867IleMet: 0.867 ± 0.216
1.214IleAsn: 1.214 ± 0.282
3.583IlePro: 3.583 ± 0.627
1.272IleGln: 1.272 ± 0.296
2.196IleArg: 2.196 ± 0.394
2.138IleSer: 2.138 ± 0.439
4.913IleThr: 4.913 ± 0.674
3.815IleVal: 3.815 ± 0.473
0.694IleTrp: 0.694 ± 0.18
0.809IleTyr: 0.809 ± 0.215
0.0IleXaa: 0.0 ± 0.0
Lys
3.468LysAla: 3.468 ± 0.515
0.289LysCys: 0.289 ± 0.117
1.214LysAsp: 1.214 ± 0.248
0.347LysGlu: 0.347 ± 0.124
0.867LysPhe: 0.867 ± 0.248
2.485LysGly: 2.485 ± 0.363
0.751LysHis: 0.751 ± 0.212
0.694LysIle: 0.694 ± 0.241
0.52LysLys: 0.52 ± 0.193
0.867LysLeu: 0.867 ± 0.275
0.347LysMet: 0.347 ± 0.142
0.289LysAsn: 0.289 ± 0.132
1.849LysPro: 1.849 ± 0.391
0.809LysGln: 0.809 ± 0.225
2.601LysArg: 2.601 ± 0.442
1.734LysSer: 1.734 ± 0.28
0.925LysThr: 0.925 ± 0.248
3.121LysVal: 3.121 ± 0.419
0.347LysTrp: 0.347 ± 0.129
0.694LysTyr: 0.694 ± 0.161
0.0LysXaa: 0.0 ± 0.0
Leu
9.016LeuAla: 9.016 ± 0.718
0.347LeuCys: 0.347 ± 0.135
4.855LeuAsp: 4.855 ± 0.685
4.393LeuGlu: 4.393 ± 0.514
1.676LeuPhe: 1.676 ± 0.381
7.456LeuGly: 7.456 ± 0.903
1.156LeuHis: 1.156 ± 0.254
4.508LeuIle: 4.508 ± 0.707
0.925LeuLys: 0.925 ± 0.208
5.837LeuLeu: 5.837 ± 0.596
1.503LeuMet: 1.503 ± 0.298
1.503LeuAsn: 1.503 ± 0.282
4.566LeuPro: 4.566 ± 0.45
2.196LeuGln: 2.196 ± 0.321
5.317LeuArg: 5.317 ± 0.648
5.144LeuSer: 5.144 ± 0.677
6.993LeuThr: 6.993 ± 0.599
6.126LeuVal: 6.126 ± 0.59
1.098LeuTrp: 1.098 ± 0.222
1.387LeuTyr: 1.387 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
3.294MetAla: 3.294 ± 0.471
0.173MetCys: 0.173 ± 0.093
1.618MetAsp: 1.618 ± 0.318
0.578MetGlu: 0.578 ± 0.169
0.462MetPhe: 0.462 ± 0.156
1.907MetGly: 1.907 ± 0.368
0.405MetHis: 0.405 ± 0.143
1.561MetIle: 1.561 ± 0.386
0.462MetLys: 0.462 ± 0.161
1.503MetLeu: 1.503 ± 0.272
0.462MetMet: 0.462 ± 0.135
0.694MetAsn: 0.694 ± 0.153
1.214MetPro: 1.214 ± 0.33
0.809MetGln: 0.809 ± 0.218
1.387MetArg: 1.387 ± 0.226
3.294MetSer: 3.294 ± 0.42
2.659MetThr: 2.659 ± 0.407
1.445MetVal: 1.445 ± 0.311
0.289MetTrp: 0.289 ± 0.119
0.751MetTyr: 0.751 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.699AsnAla: 3.699 ± 0.503
0.347AsnCys: 0.347 ± 0.121
1.156AsnAsp: 1.156 ± 0.251
1.156AsnGlu: 1.156 ± 0.283
0.347AsnPhe: 0.347 ± 0.173
3.988AsnGly: 3.988 ± 0.651
0.347AsnHis: 0.347 ± 0.156
1.098AsnIle: 1.098 ± 0.274
0.289AsnLys: 0.289 ± 0.126
2.254AsnLeu: 2.254 ± 0.322
0.405AsnMet: 0.405 ± 0.144
0.289AsnAsn: 0.289 ± 0.134
2.427AsnPro: 2.427 ± 0.596
0.751AsnGln: 0.751 ± 0.187
2.138AsnArg: 2.138 ± 0.378
1.734AsnSer: 1.734 ± 0.324
1.734AsnThr: 1.734 ± 0.272
1.734AsnVal: 1.734 ± 0.397
0.347AsnTrp: 0.347 ± 0.128
0.578AsnTyr: 0.578 ± 0.147
0.0AsnXaa: 0.0 ± 0.0
Pro
7.282ProAla: 7.282 ± 0.761
0.867ProCys: 0.867 ± 0.214
4.682ProAsp: 4.682 ± 0.706
5.26ProGlu: 5.26 ± 0.636
1.907ProPhe: 1.907 ± 0.309
5.722ProGly: 5.722 ± 0.563
0.983ProHis: 0.983 ± 0.217
2.196ProIle: 2.196 ± 0.303
1.618ProLys: 1.618 ± 0.287
3.641ProLeu: 3.641 ± 0.47
1.387ProMet: 1.387 ± 0.263
2.601ProAsn: 2.601 ± 0.433
4.046ProPro: 4.046 ± 0.49
2.196ProGln: 2.196 ± 0.556
3.063ProArg: 3.063 ± 0.482
3.179ProSer: 3.179 ± 0.426
4.971ProThr: 4.971 ± 0.679
5.028ProVal: 5.028 ± 0.578
0.867ProTrp: 0.867 ± 0.226
1.445ProTyr: 1.445 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.988GlnAla: 3.988 ± 0.518
0.462GlnCys: 0.462 ± 0.175
1.618GlnAsp: 1.618 ± 0.304
1.618GlnGlu: 1.618 ± 0.309
1.214GlnPhe: 1.214 ± 0.28
2.196GlnGly: 2.196 ± 0.37
0.983GlnHis: 0.983 ± 0.285
1.849GlnIle: 1.849 ± 0.351
0.751GlnLys: 0.751 ± 0.21
1.098GlnLeu: 1.098 ± 0.357
1.387GlnMet: 1.387 ± 0.302
1.387GlnAsn: 1.387 ± 0.384
2.601GlnPro: 2.601 ± 0.416
1.445GlnGln: 1.445 ± 0.31
2.774GlnArg: 2.774 ± 0.459
1.907GlnSer: 1.907 ± 0.443
2.196GlnThr: 2.196 ± 0.445
2.312GlnVal: 2.312 ± 0.441
0.809GlnTrp: 0.809 ± 0.233
0.751GlnTyr: 0.751 ± 0.212
0.0GlnXaa: 0.0 ± 0.0
Arg
7.918ArgAla: 7.918 ± 0.678
0.925ArgCys: 0.925 ± 0.231
4.393ArgAsp: 4.393 ± 0.634
4.566ArgGlu: 4.566 ± 0.551
1.561ArgPhe: 1.561 ± 0.314
5.895ArgGly: 5.895 ± 0.579
1.387ArgHis: 1.387 ± 0.337
3.005ArgIle: 3.005 ± 0.518
1.792ArgLys: 1.792 ± 0.322
6.704ArgLeu: 6.704 ± 0.605
2.196ArgMet: 2.196 ± 0.309
1.561ArgAsn: 1.561 ± 0.299
3.583ArgPro: 3.583 ± 0.569
2.716ArgGln: 2.716 ± 0.442
7.225ArgArg: 7.225 ± 0.875
2.832ArgSer: 2.832 ± 0.39
2.89ArgThr: 2.89 ± 0.457
5.202ArgVal: 5.202 ± 0.467
1.792ArgTrp: 1.792 ± 0.315
2.196ArgTyr: 2.196 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
7.225SerAla: 7.225 ± 0.908
0.405SerCys: 0.405 ± 0.145
3.93SerAsp: 3.93 ± 0.503
3.872SerGlu: 3.872 ± 0.577
0.983SerPhe: 0.983 ± 0.245
5.086SerGly: 5.086 ± 0.666
0.867SerHis: 0.867 ± 0.246
2.89SerIle: 2.89 ± 0.4
1.098SerLys: 1.098 ± 0.275
4.566SerLeu: 4.566 ± 0.57
2.023SerMet: 2.023 ± 0.352
1.561SerAsn: 1.561 ± 0.301
3.468SerPro: 3.468 ± 0.404
2.023SerGln: 2.023 ± 0.306
3.063SerArg: 3.063 ± 0.357
2.543SerSer: 2.543 ± 0.449
4.855SerThr: 4.855 ± 0.519
3.468SerVal: 3.468 ± 0.39
1.156SerTrp: 1.156 ± 0.274
1.445SerTyr: 1.445 ± 0.27
0.0SerXaa: 0.0 ± 0.0
Thr
8.67ThrAla: 8.67 ± 0.889
0.751ThrCys: 0.751 ± 0.217
4.45ThrAsp: 4.45 ± 0.608
3.93ThrGlu: 3.93 ± 0.573
1.272ThrPhe: 1.272 ± 0.235
6.415ThrGly: 6.415 ± 0.768
1.156ThrHis: 1.156 ± 0.265
3.641ThrIle: 3.641 ± 0.551
1.792ThrLys: 1.792 ± 0.439
6.589ThrLeu: 6.589 ± 0.629
1.503ThrMet: 1.503 ± 0.321
2.081ThrAsn: 2.081 ± 0.376
5.491ThrPro: 5.491 ± 0.517
1.849ThrGln: 1.849 ± 0.342
4.855ThrArg: 4.855 ± 0.57
3.699ThrSer: 3.699 ± 0.525
4.393ThrThr: 4.393 ± 0.559
5.375ThrVal: 5.375 ± 0.523
0.983ThrTrp: 0.983 ± 0.248
1.503ThrTyr: 1.503 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
8.381ValAla: 8.381 ± 0.746
0.809ValCys: 0.809 ± 0.229
3.699ValAsp: 3.699 ± 0.447
5.144ValGlu: 5.144 ± 0.544
1.561ValPhe: 1.561 ± 0.262
5.722ValGly: 5.722 ± 0.632
1.329ValHis: 1.329 ± 0.275
4.219ValIle: 4.219 ± 0.555
2.312ValLys: 2.312 ± 0.333
5.202ValLeu: 5.202 ± 0.468
1.734ValMet: 1.734 ± 0.345
2.138ValAsn: 2.138 ± 0.343
5.144ValPro: 5.144 ± 0.611
2.023ValGln: 2.023 ± 0.344
5.317ValArg: 5.317 ± 0.614
4.45ValSer: 4.45 ± 0.579
6.069ValThr: 6.069 ± 0.455
5.433ValVal: 5.433 ± 0.702
1.734ValTrp: 1.734 ± 0.297
1.272ValTyr: 1.272 ± 0.226
0.0ValXaa: 0.0 ± 0.0
Trp
2.138TrpAla: 2.138 ± 0.338
0.347TrpCys: 0.347 ± 0.152
1.098TrpAsp: 1.098 ± 0.257
1.618TrpGlu: 1.618 ± 0.28
0.751TrpPhe: 0.751 ± 0.212
1.387TrpGly: 1.387 ± 0.303
0.462TrpHis: 0.462 ± 0.149
0.867TrpIle: 0.867 ± 0.255
0.694TrpLys: 0.694 ± 0.217
1.734TrpLeu: 1.734 ± 0.369
0.289TrpMet: 0.289 ± 0.122
0.867TrpAsn: 0.867 ± 0.229
0.867TrpPro: 0.867 ± 0.273
1.04TrpGln: 1.04 ± 0.188
1.387TrpArg: 1.387 ± 0.258
1.272TrpSer: 1.272 ± 0.31
0.925TrpThr: 0.925 ± 0.225
1.445TrpVal: 1.445 ± 0.289
0.867TrpTrp: 0.867 ± 0.273
0.694TrpTyr: 0.694 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.601TyrAla: 2.601 ± 0.381
0.231TyrCys: 0.231 ± 0.139
1.849TyrAsp: 1.849 ± 0.329
1.734TyrGlu: 1.734 ± 0.313
0.405TyrPhe: 0.405 ± 0.17
2.196TyrGly: 2.196 ± 0.361
0.173TyrHis: 0.173 ± 0.137
0.751TyrIle: 0.751 ± 0.208
0.462TyrLys: 0.462 ± 0.162
1.907TyrLeu: 1.907 ± 0.364
0.751TyrMet: 0.751 ± 0.223
0.405TyrAsn: 0.405 ± 0.161
1.214TyrPro: 1.214 ± 0.259
0.983TyrGln: 0.983 ± 0.246
2.196TyrArg: 2.196 ± 0.362
1.387TyrSer: 1.387 ± 0.276
1.561TyrThr: 1.561 ± 0.3
2.312TyrVal: 2.312 ± 0.39
0.578TyrTrp: 0.578 ± 0.158
0.636TyrTyr: 0.636 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (17303 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski