Amino acid dipepetide frequency for Panteoa phage Kyle

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.182AlaAla: 10.182 ± 0.822
0.829AlaCys: 0.829 ± 0.204
5.436AlaAsp: 5.436 ± 0.532
7.371AlaGlu: 7.371 ± 0.871
3.778AlaPhe: 3.778 ± 0.398
6.45AlaGly: 6.45 ± 0.608
1.705AlaHis: 1.705 ± 0.336
5.344AlaIle: 5.344 ± 0.479
6.588AlaLys: 6.588 ± 0.942
7.141AlaLeu: 7.141 ± 0.639
1.797AlaMet: 1.797 ± 0.262
4.146AlaAsn: 4.146 ± 0.535
2.718AlaPro: 2.718 ± 0.294
2.902AlaGln: 2.902 ± 0.318
4.146AlaArg: 4.146 ± 0.449
5.436AlaSer: 5.436 ± 0.542
4.883AlaThr: 4.883 ± 0.545
5.436AlaVal: 5.436 ± 0.481
0.921AlaTrp: 0.921 ± 0.224
2.211AlaTyr: 2.211 ± 0.313
0.0AlaXaa: 0.0 ± 0.0
Cys
0.875CysAla: 0.875 ± 0.178
0.046CysCys: 0.046 ± 0.044
0.553CysAsp: 0.553 ± 0.169
0.645CysGlu: 0.645 ± 0.18
0.461CysPhe: 0.461 ± 0.123
0.599CysGly: 0.599 ± 0.166
0.184CysHis: 0.184 ± 0.095
0.553CysIle: 0.553 ± 0.152
0.507CysLys: 0.507 ± 0.154
0.322CysLeu: 0.322 ± 0.117
0.184CysMet: 0.184 ± 0.104
0.415CysAsn: 0.415 ± 0.117
0.645CysPro: 0.645 ± 0.148
0.415CysGln: 0.415 ± 0.124
0.461CysArg: 0.461 ± 0.174
0.645CysSer: 0.645 ± 0.169
0.553CysThr: 0.553 ± 0.144
0.461CysVal: 0.461 ± 0.123
0.092CysTrp: 0.092 ± 0.074
0.507CysTyr: 0.507 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
5.206AspAla: 5.206 ± 0.59
0.599AspCys: 0.599 ± 0.156
3.455AspAsp: 3.455 ± 0.359
4.377AspGlu: 4.377 ± 0.433
2.995AspPhe: 2.995 ± 0.333
5.022AspGly: 5.022 ± 0.473
1.106AspHis: 1.106 ± 0.202
3.271AspIle: 3.271 ± 0.365
2.995AspLys: 2.995 ± 0.367
4.054AspLeu: 4.054 ± 0.389
1.889AspMet: 1.889 ± 0.261
2.81AspAsn: 2.81 ± 0.296
1.981AspPro: 1.981 ± 0.263
2.488AspGln: 2.488 ± 0.334
2.626AspArg: 2.626 ± 0.302
3.501AspSer: 3.501 ± 0.358
3.179AspThr: 3.179 ± 0.309
4.238AspVal: 4.238 ± 0.512
0.829AspTrp: 0.829 ± 0.231
1.935AspTyr: 1.935 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
6.173GluAla: 6.173 ± 0.787
0.507GluCys: 0.507 ± 0.163
3.824GluAsp: 3.824 ± 0.415
5.713GluGlu: 5.713 ± 0.801
2.902GluPhe: 2.902 ± 0.398
3.778GluGly: 3.778 ± 0.427
1.566GluHis: 1.566 ± 0.233
4.192GluIle: 4.192 ± 0.433
3.547GluLys: 3.547 ± 0.437
6.496GluLeu: 6.496 ± 0.557
1.981GluMet: 1.981 ± 0.272
2.534GluAsn: 2.534 ± 0.365
1.843GluPro: 1.843 ± 0.263
3.271GluGln: 3.271 ± 0.397
4.791GluArg: 4.791 ± 0.803
3.501GluSer: 3.501 ± 0.384
3.317GluThr: 3.317 ± 0.347
4.653GluVal: 4.653 ± 0.503
0.967GluTrp: 0.967 ± 0.204
2.442GluTyr: 2.442 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
3.363PheAla: 3.363 ± 0.339
0.184PheCys: 0.184 ± 0.094
3.916PheAsp: 3.916 ± 0.414
2.902PheGlu: 2.902 ± 0.31
1.29PhePhe: 1.29 ± 0.208
3.593PheGly: 3.593 ± 0.434
0.461PheHis: 0.461 ± 0.155
2.396PheIle: 2.396 ± 0.369
2.442PheLys: 2.442 ± 0.333
2.211PheLeu: 2.211 ± 0.386
1.014PheMet: 1.014 ± 0.262
2.81PheAsn: 2.81 ± 0.332
1.152PhePro: 1.152 ± 0.212
1.152PheGln: 1.152 ± 0.267
2.35PheArg: 2.35 ± 0.281
3.087PheSer: 3.087 ± 0.439
2.027PheThr: 2.027 ± 0.359
3.363PheVal: 3.363 ± 0.4
1.152PheTrp: 1.152 ± 0.254
1.428PheTyr: 1.428 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
5.897GlyAla: 5.897 ± 0.624
0.645GlyCys: 0.645 ± 0.198
3.732GlyAsp: 3.732 ± 0.372
4.653GlyGlu: 4.653 ± 0.498
3.455GlyPhe: 3.455 ± 0.428
4.699GlyGly: 4.699 ± 0.579
1.52GlyHis: 1.52 ± 0.228
4.285GlyIle: 4.285 ± 0.485
5.39GlyLys: 5.39 ± 0.554
5.943GlyLeu: 5.943 ± 0.632
2.304GlyMet: 2.304 ± 0.322
3.041GlyAsn: 3.041 ± 0.413
1.797GlyPro: 1.797 ± 0.334
3.041GlyGln: 3.041 ± 0.538
3.686GlyArg: 3.686 ± 0.35
4.192GlySer: 4.192 ± 0.429
4.1GlyThr: 4.1 ± 0.481
5.206GlyVal: 5.206 ± 0.469
1.612GlyTrp: 1.612 ± 0.255
2.396GlyTyr: 2.396 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.222
0.322HisCys: 0.322 ± 0.117
1.106HisAsp: 1.106 ± 0.202
1.29HisGlu: 1.29 ± 0.251
0.783HisPhe: 0.783 ± 0.175
1.428HisGly: 1.428 ± 0.261
0.599HisHis: 0.599 ± 0.188
0.967HisIle: 0.967 ± 0.17
1.014HisLys: 1.014 ± 0.19
1.797HisLeu: 1.797 ± 0.281
0.553HisMet: 0.553 ± 0.176
0.415HisAsn: 0.415 ± 0.123
0.967HisPro: 0.967 ± 0.218
0.599HisGln: 0.599 ± 0.15
1.106HisArg: 1.106 ± 0.199
1.566HisSer: 1.566 ± 0.33
0.921HisThr: 0.921 ± 0.196
0.967HisVal: 0.967 ± 0.218
0.23HisTrp: 0.23 ± 0.104
0.599HisTyr: 0.599 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
5.022IleAla: 5.022 ± 0.562
0.553IleCys: 0.553 ± 0.141
4.192IleAsp: 4.192 ± 0.426
4.146IleGlu: 4.146 ± 0.414
1.428IlePhe: 1.428 ± 0.228
4.561IleGly: 4.561 ± 0.42
0.967IleHis: 0.967 ± 0.239
3.87IleIle: 3.87 ± 0.46
4.331IleLys: 4.331 ± 0.481
3.962IleLeu: 3.962 ± 0.359
1.797IleMet: 1.797 ± 0.281
3.547IleAsn: 3.547 ± 0.348
2.626IlePro: 2.626 ± 0.343
2.211IleGln: 2.211 ± 0.304
2.257IleArg: 2.257 ± 0.307
3.363IleSer: 3.363 ± 0.479
3.778IleThr: 3.778 ± 0.395
4.469IleVal: 4.469 ± 0.457
0.369IleTrp: 0.369 ± 0.115
1.566IleTyr: 1.566 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
6.864LysAla: 6.864 ± 0.962
0.691LysCys: 0.691 ± 0.164
4.146LysAsp: 4.146 ± 0.534
4.515LysGlu: 4.515 ± 0.595
2.35LysPhe: 2.35 ± 0.272
4.331LysGly: 4.331 ± 0.47
1.29LysHis: 1.29 ± 0.249
3.317LysIle: 3.317 ± 0.383
3.732LysLys: 3.732 ± 0.529
4.976LysLeu: 4.976 ± 0.477
2.073LysMet: 2.073 ± 0.319
2.488LysAsn: 2.488 ± 0.234
2.672LysPro: 2.672 ± 0.36
2.396LysGln: 2.396 ± 0.35
4.1LysArg: 4.1 ± 0.512
3.916LysSer: 3.916 ± 0.58
2.764LysThr: 2.764 ± 0.353
4.699LysVal: 4.699 ± 0.5
0.875LysTrp: 0.875 ± 0.195
1.889LysTyr: 1.889 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
6.864LeuAla: 6.864 ± 0.655
0.829LeuCys: 0.829 ± 0.213
3.962LeuAsp: 3.962 ± 0.366
4.745LeuGlu: 4.745 ± 0.451
3.271LeuPhe: 3.271 ± 0.379
4.146LeuGly: 4.146 ± 0.488
0.967LeuHis: 0.967 ± 0.177
4.791LeuIle: 4.791 ± 0.451
5.252LeuLys: 5.252 ± 0.43
6.266LeuLeu: 6.266 ± 0.695
2.257LeuMet: 2.257 ± 0.267
3.916LeuAsn: 3.916 ± 0.5
3.409LeuPro: 3.409 ± 0.427
3.041LeuGln: 3.041 ± 0.419
4.1LeuArg: 4.1 ± 0.413
5.344LeuSer: 5.344 ± 0.626
5.482LeuThr: 5.482 ± 0.588
4.607LeuVal: 4.607 ± 0.524
0.967LeuTrp: 0.967 ± 0.236
2.165LeuTyr: 2.165 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
2.396MetAla: 2.396 ± 0.265
0.138MetCys: 0.138 ± 0.077
1.382MetAsp: 1.382 ± 0.243
1.29MetGlu: 1.29 ± 0.223
0.783MetPhe: 0.783 ± 0.191
1.382MetGly: 1.382 ± 0.238
0.461MetHis: 0.461 ± 0.168
1.336MetIle: 1.336 ± 0.267
2.442MetLys: 2.442 ± 0.369
1.797MetLeu: 1.797 ± 0.272
0.829MetMet: 0.829 ± 0.165
1.797MetAsn: 1.797 ± 0.276
1.244MetPro: 1.244 ± 0.273
1.198MetGln: 1.198 ± 0.256
1.566MetArg: 1.566 ± 0.32
2.626MetSer: 2.626 ± 0.401
2.257MetThr: 2.257 ± 0.349
2.165MetVal: 2.165 ± 0.336
0.322MetTrp: 0.322 ± 0.105
1.06MetTyr: 1.06 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
4.008AsnAla: 4.008 ± 0.462
0.322AsnCys: 0.322 ± 0.113
2.488AsnAsp: 2.488 ± 0.21
2.672AsnGlu: 2.672 ± 0.305
2.211AsnPhe: 2.211 ± 0.313
4.561AsnGly: 4.561 ± 0.496
1.29AsnHis: 1.29 ± 0.26
2.626AsnIle: 2.626 ± 0.382
2.902AsnLys: 2.902 ± 0.386
3.271AsnLeu: 3.271 ± 0.446
1.014AsnMet: 1.014 ± 0.211
2.442AsnAsn: 2.442 ± 0.353
3.225AsnPro: 3.225 ± 0.37
1.705AsnGln: 1.705 ± 0.268
2.764AsnArg: 2.764 ± 0.335
2.534AsnSer: 2.534 ± 0.309
3.317AsnThr: 3.317 ± 0.458
3.409AsnVal: 3.409 ± 0.422
0.783AsnTrp: 0.783 ± 0.173
1.612AsnTyr: 1.612 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
2.58ProAla: 2.58 ± 0.323
0.369ProCys: 0.369 ± 0.118
2.626ProAsp: 2.626 ± 0.358
3.133ProGlu: 3.133 ± 0.497
1.566ProPhe: 1.566 ± 0.357
2.948ProGly: 2.948 ± 0.382
0.553ProHis: 0.553 ± 0.13
2.073ProIle: 2.073 ± 0.312
2.35ProLys: 2.35 ± 0.309
3.041ProLeu: 3.041 ± 0.315
1.29ProMet: 1.29 ± 0.235
2.165ProAsn: 2.165 ± 0.334
1.336ProPro: 1.336 ± 0.306
1.659ProGln: 1.659 ± 0.213
1.244ProArg: 1.244 ± 0.239
2.672ProSer: 2.672 ± 0.344
2.165ProThr: 2.165 ± 0.318
2.948ProVal: 2.948 ± 0.494
0.599ProTrp: 0.599 ± 0.162
1.52ProTyr: 1.52 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
3.409GlnAla: 3.409 ± 0.473
0.276GlnCys: 0.276 ± 0.117
1.981GlnAsp: 1.981 ± 0.297
2.257GlnGlu: 2.257 ± 0.377
1.889GlnPhe: 1.889 ± 0.288
2.672GlnGly: 2.672 ± 0.393
0.737GlnHis: 0.737 ± 0.158
2.718GlnIle: 2.718 ± 0.317
2.165GlnLys: 2.165 ± 0.312
2.995GlnLeu: 2.995 ± 0.319
1.198GlnMet: 1.198 ± 0.247
2.027GlnAsn: 2.027 ± 0.276
1.659GlnPro: 1.659 ± 0.275
2.073GlnGln: 2.073 ± 0.314
2.35GlnArg: 2.35 ± 0.337
2.396GlnSer: 2.396 ± 0.321
2.442GlnThr: 2.442 ± 0.338
2.626GlnVal: 2.626 ± 0.349
0.507GlnTrp: 0.507 ± 0.156
1.244GlnTyr: 1.244 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
4.1ArgAla: 4.1 ± 0.442
0.645ArgCys: 0.645 ± 0.16
2.488ArgAsp: 2.488 ± 0.386
3.593ArgGlu: 3.593 ± 0.592
2.764ArgPhe: 2.764 ± 0.472
3.363ArgGly: 3.363 ± 0.339
0.783ArgHis: 0.783 ± 0.201
3.271ArgIle: 3.271 ± 0.403
4.469ArgLys: 4.469 ± 0.518
4.607ArgLeu: 4.607 ± 0.513
1.797ArgMet: 1.797 ± 0.261
3.133ArgAsn: 3.133 ± 0.317
1.889ArgPro: 1.889 ± 0.294
1.889ArgGln: 1.889 ± 0.281
2.902ArgArg: 2.902 ± 0.41
2.304ArgSer: 2.304 ± 0.281
2.396ArgThr: 2.396 ± 0.383
3.363ArgVal: 3.363 ± 0.369
0.691ArgTrp: 0.691 ± 0.19
1.705ArgTyr: 1.705 ± 0.256
0.0ArgXaa: 0.0 ± 0.0
Ser
6.127SerAla: 6.127 ± 0.675
0.369SerCys: 0.369 ± 0.132
3.409SerAsp: 3.409 ± 0.384
3.778SerGlu: 3.778 ± 0.531
2.948SerPhe: 2.948 ± 0.368
5.528SerGly: 5.528 ± 0.591
1.152SerHis: 1.152 ± 0.292
3.916SerIle: 3.916 ± 0.535
3.732SerLys: 3.732 ± 0.433
4.377SerLeu: 4.377 ± 0.459
1.52SerMet: 1.52 ± 0.291
3.087SerAsn: 3.087 ± 0.364
1.981SerPro: 1.981 ± 0.284
2.073SerGln: 2.073 ± 0.322
2.534SerArg: 2.534 ± 0.396
4.008SerSer: 4.008 ± 0.533
3.87SerThr: 3.87 ± 0.53
4.699SerVal: 4.699 ± 0.445
0.599SerTrp: 0.599 ± 0.199
1.889SerTyr: 1.889 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
5.482ThrAla: 5.482 ± 0.631
0.507ThrCys: 0.507 ± 0.138
2.81ThrAsp: 2.81 ± 0.396
2.534ThrGlu: 2.534 ± 0.316
2.58ThrPhe: 2.58 ± 0.304
4.653ThrGly: 4.653 ± 0.371
1.014ThrHis: 1.014 ± 0.204
3.547ThrIle: 3.547 ± 0.366
3.593ThrLys: 3.593 ± 0.494
4.699ThrLeu: 4.699 ± 0.528
0.967ThrMet: 0.967 ± 0.197
2.856ThrAsn: 2.856 ± 0.535
3.179ThrPro: 3.179 ± 0.359
2.304ThrGln: 2.304 ± 0.287
2.58ThrArg: 2.58 ± 0.314
4.561ThrSer: 4.561 ± 0.523
3.547ThrThr: 3.547 ± 0.587
4.607ThrVal: 4.607 ± 0.579
0.691ThrTrp: 0.691 ± 0.173
1.52ThrTyr: 1.52 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
6.358ValAla: 6.358 ± 0.607
0.553ValCys: 0.553 ± 0.161
4.561ValAsp: 4.561 ± 0.5
5.252ValGlu: 5.252 ± 0.434
3.041ValPhe: 3.041 ± 0.366
4.238ValGly: 4.238 ± 0.447
1.244ValHis: 1.244 ± 0.227
4.331ValIle: 4.331 ± 0.422
4.469ValLys: 4.469 ± 0.446
5.16ValLeu: 5.16 ± 0.498
2.396ValMet: 2.396 ± 0.313
2.856ValAsn: 2.856 ± 0.363
2.856ValPro: 2.856 ± 0.376
2.764ValGln: 2.764 ± 0.394
4.1ValArg: 4.1 ± 0.4
3.363ValSer: 3.363 ± 0.445
4.837ValThr: 4.837 ± 0.473
5.022ValVal: 5.022 ± 0.563
0.875ValTrp: 0.875 ± 0.187
2.119ValTyr: 2.119 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.014TrpAla: 1.014 ± 0.234
0.276TrpCys: 0.276 ± 0.125
0.691TrpAsp: 0.691 ± 0.201
1.244TrpGlu: 1.244 ± 0.285
0.415TrpPhe: 0.415 ± 0.145
0.691TrpGly: 0.691 ± 0.186
0.276TrpHis: 0.276 ± 0.099
0.921TrpIle: 0.921 ± 0.205
0.415TrpLys: 0.415 ± 0.132
1.336TrpLeu: 1.336 ± 0.192
0.737TrpMet: 0.737 ± 0.203
0.691TrpAsn: 0.691 ± 0.181
0.322TrpPro: 0.322 ± 0.131
0.461TrpGln: 0.461 ± 0.162
0.921TrpArg: 0.921 ± 0.222
0.599TrpSer: 0.599 ± 0.17
0.691TrpThr: 0.691 ± 0.197
1.29TrpVal: 1.29 ± 0.257
0.092TrpTrp: 0.092 ± 0.062
0.461TrpTyr: 0.461 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.442TyrAla: 2.442 ± 0.326
0.461TyrCys: 0.461 ± 0.137
1.797TyrAsp: 1.797 ± 0.268
1.889TyrGlu: 1.889 ± 0.281
1.382TyrPhe: 1.382 ± 0.266
2.948TyrGly: 2.948 ± 0.343
0.415TyrHis: 0.415 ± 0.139
1.29TyrIle: 1.29 ± 0.252
1.751TyrLys: 1.751 ± 0.276
1.935TyrLeu: 1.935 ± 0.272
0.829TyrMet: 0.829 ± 0.199
1.981TyrAsn: 1.981 ± 0.291
1.336TyrPro: 1.336 ± 0.223
1.981TyrGln: 1.981 ± 0.311
1.566TyrArg: 1.566 ± 0.236
1.981TyrSer: 1.981 ± 0.347
1.705TyrThr: 1.705 ± 0.293
2.165TyrVal: 2.165 ± 0.362
0.369TyrTrp: 0.369 ± 0.123
1.014TyrTyr: 1.014 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (21707 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski