Amino acid dipepetide frequency for Aphanizomenon phage vB_AphaS-CL131

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.814AlaAla: 6.814 ± 1.131
0.582AlaCys: 0.582 ± 0.144
3.582AlaAsp: 3.582 ± 0.376
4.513AlaGlu: 4.513 ± 0.446
3.087AlaPhe: 3.087 ± 0.354
4.746AlaGly: 4.746 ± 0.625
0.757AlaHis: 0.757 ± 0.148
7.687AlaIle: 7.687 ± 0.552
5.62AlaLys: 5.62 ± 0.637
6.639AlaLeu: 6.639 ± 0.682
1.631AlaMet: 1.631 ± 0.24
4.018AlaAsn: 4.018 ± 0.405
1.864AlaPro: 1.864 ± 0.271
2.737AlaGln: 2.737 ± 0.398
3.32AlaArg: 3.32 ± 0.402
5.736AlaSer: 5.736 ± 0.522
5.62AlaThr: 5.62 ± 0.815
3.989AlaVal: 3.989 ± 0.401
1.107AlaTrp: 1.107 ± 0.191
2.533AlaTyr: 2.533 ± 0.22
0.0AlaXaa: 0.0 ± 0.0
Cys
0.524CysAla: 0.524 ± 0.161
0.087CysCys: 0.087 ± 0.052
0.903CysAsp: 0.903 ± 0.197
0.67CysGlu: 0.67 ± 0.148
0.495CysPhe: 0.495 ± 0.135
1.165CysGly: 1.165 ± 0.241
0.233CysHis: 0.233 ± 0.082
1.019CysIle: 1.019 ± 0.189
1.048CysLys: 1.048 ± 0.226
0.961CysLeu: 0.961 ± 0.199
0.087CysMet: 0.087 ± 0.054
0.466CysAsn: 0.466 ± 0.106
0.466CysPro: 0.466 ± 0.13
0.699CysGln: 0.699 ± 0.144
0.466CysArg: 0.466 ± 0.116
0.524CysSer: 0.524 ± 0.128
0.495CysThr: 0.495 ± 0.109
0.553CysVal: 0.553 ± 0.134
0.058CysTrp: 0.058 ± 0.041
0.379CysTyr: 0.379 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
4.455AspAla: 4.455 ± 0.31
0.757AspCys: 0.757 ± 0.168
3.145AspAsp: 3.145 ± 0.399
3.29AspGlu: 3.29 ± 0.353
2.679AspPhe: 2.679 ± 0.311
3.611AspGly: 3.611 ± 0.361
0.699AspHis: 0.699 ± 0.154
3.611AspIle: 3.611 ± 0.338
4.31AspLys: 4.31 ± 0.55
4.106AspLeu: 4.106 ± 0.313
0.99AspMet: 0.99 ± 0.19
2.562AspAsn: 2.562 ± 0.293
2.213AspPro: 2.213 ± 0.234
1.776AspGln: 1.776 ± 0.191
3.203AspArg: 3.203 ± 0.258
4.28AspSer: 4.28 ± 0.505
2.795AspThr: 2.795 ± 0.283
2.883AspVal: 2.883 ± 0.328
1.077AspTrp: 1.077 ± 0.228
2.388AspTyr: 2.388 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
4.775GluAla: 4.775 ± 0.493
0.641GluCys: 0.641 ± 0.143
4.106GluAsp: 4.106 ± 0.436
4.805GluGlu: 4.805 ± 0.694
3.232GluPhe: 3.232 ± 0.299
2.97GluGly: 2.97 ± 0.291
1.048GluHis: 1.048 ± 0.217
6.144GluIle: 6.144 ± 0.481
5.212GluLys: 5.212 ± 0.477
6.989GluLeu: 6.989 ± 0.533
1.223GluMet: 1.223 ± 0.225
3.582GluAsn: 3.582 ± 0.342
1.805GluPro: 1.805 ± 0.243
2.883GluGln: 2.883 ± 0.392
3.232GluArg: 3.232 ± 0.373
4.484GluSer: 4.484 ± 0.424
3.698GluThr: 3.698 ± 0.353
4.018GluVal: 4.018 ± 0.342
0.757GluTrp: 0.757 ± 0.139
2.533GluTyr: 2.533 ± 0.309
0.0GluXaa: 0.0 ± 0.0
Phe
2.766PheAla: 2.766 ± 0.232
0.553PheCys: 0.553 ± 0.134
2.097PheAsp: 2.097 ± 0.257
2.359PheGlu: 2.359 ± 0.292
1.194PhePhe: 1.194 ± 0.239
3.203PheGly: 3.203 ± 0.308
0.553PheHis: 0.553 ± 0.136
2.009PheIle: 2.009 ± 0.274
3.057PheLys: 3.057 ± 0.355
2.766PheLeu: 2.766 ± 0.321
0.611PheMet: 0.611 ± 0.112
2.679PheAsn: 2.679 ± 0.304
1.776PhePro: 1.776 ± 0.239
1.98PheGln: 1.98 ± 0.238
1.514PheArg: 1.514 ± 0.227
3.32PheSer: 3.32 ± 0.302
2.475PheThr: 2.475 ± 0.286
1.893PheVal: 1.893 ± 0.253
0.786PheTrp: 0.786 ± 0.137
1.66PheTyr: 1.66 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
4.018GlyAla: 4.018 ± 0.728
0.699GlyCys: 0.699 ± 0.16
4.426GlyAsp: 4.426 ± 0.27
3.989GlyGlu: 3.989 ± 0.382
2.679GlyPhe: 2.679 ± 0.243
3.96GlyGly: 3.96 ± 0.433
0.932GlyHis: 0.932 ± 0.184
4.892GlyIle: 4.892 ± 0.345
5.067GlyLys: 5.067 ± 0.413
5.038GlyLeu: 5.038 ± 0.479
1.776GlyMet: 1.776 ± 0.273
3.087GlyAsn: 3.087 ± 0.263
0.582GlyPro: 0.582 ± 0.146
2.126GlyGln: 2.126 ± 0.304
2.912GlyArg: 2.912 ± 0.269
4.834GlySer: 4.834 ± 0.543
3.494GlyThr: 3.494 ± 0.42
3.873GlyVal: 3.873 ± 0.336
0.903GlyTrp: 0.903 ± 0.201
2.825GlyTyr: 2.825 ± 0.313
0.0GlyXaa: 0.0 ± 0.0
His
0.903HisAla: 0.903 ± 0.153
0.204HisCys: 0.204 ± 0.08
0.641HisAsp: 0.641 ± 0.139
1.223HisGlu: 1.223 ± 0.204
0.874HisPhe: 0.874 ± 0.172
1.077HisGly: 1.077 ± 0.196
0.553HisHis: 0.553 ± 0.132
0.961HisIle: 0.961 ± 0.166
0.961HisLys: 0.961 ± 0.188
1.602HisLeu: 1.602 ± 0.228
0.204HisMet: 0.204 ± 0.074
0.67HisAsn: 0.67 ± 0.145
0.99HisPro: 0.99 ± 0.146
0.903HisGln: 0.903 ± 0.202
0.786HisArg: 0.786 ± 0.144
1.048HisSer: 1.048 ± 0.213
0.815HisThr: 0.815 ± 0.156
0.611HisVal: 0.611 ± 0.147
0.262HisTrp: 0.262 ± 0.087
0.466HisTyr: 0.466 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
8.707IleAla: 8.707 ± 0.575
0.757IleCys: 0.757 ± 0.154
3.815IleAsp: 3.815 ± 0.339
5.212IleGlu: 5.212 ± 0.407
2.155IlePhe: 2.155 ± 0.301
4.339IleGly: 4.339 ± 0.456
0.903IleHis: 0.903 ± 0.191
3.815IleIle: 3.815 ± 0.38
5.62IleLys: 5.62 ± 0.412
5.678IleLeu: 5.678 ± 0.387
0.932IleMet: 0.932 ± 0.158
4.251IleAsn: 4.251 ± 0.363
3.844IlePro: 3.844 ± 0.295
3.145IleGln: 3.145 ± 0.312
2.941IleArg: 2.941 ± 0.307
4.455IleSer: 4.455 ± 0.485
5.096IleThr: 5.096 ± 0.495
3.32IleVal: 3.32 ± 0.316
0.757IleTrp: 0.757 ± 0.162
1.864IleTyr: 1.864 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
5.038LysAla: 5.038 ± 0.527
1.107LysCys: 1.107 ± 0.242
3.553LysAsp: 3.553 ± 0.353
5.125LysGlu: 5.125 ± 0.484
2.941LysPhe: 2.941 ± 0.288
3.465LysGly: 3.465 ± 0.36
1.369LysHis: 1.369 ± 0.218
5.445LysIle: 5.445 ± 0.454
5.795LysLys: 5.795 ± 0.683
7.251LysLeu: 7.251 ± 0.465
1.572LysMet: 1.572 ± 0.27
3.785LysAsn: 3.785 ± 0.436
3.29LysPro: 3.29 ± 0.418
4.135LysGln: 4.135 ± 0.512
3.203LysArg: 3.203 ± 0.302
5.154LysSer: 5.154 ± 0.368
4.484LysThr: 4.484 ± 0.421
4.368LysVal: 4.368 ± 0.377
1.107LysTrp: 1.107 ± 0.229
2.766LysTyr: 2.766 ± 0.298
0.0LysXaa: 0.0 ± 0.0
Leu
7.076LeuAla: 7.076 ± 0.764
0.99LeuCys: 0.99 ± 0.215
5.008LeuAsp: 5.008 ± 0.316
7.105LeuGlu: 7.105 ± 0.79
3.436LeuPhe: 3.436 ± 0.323
4.601LeuGly: 4.601 ± 0.31
1.252LeuHis: 1.252 ± 0.21
5.271LeuIle: 5.271 ± 0.401
6.231LeuLys: 6.231 ± 0.541
7.047LeuLeu: 7.047 ± 0.484
1.747LeuMet: 1.747 ± 0.276
4.426LeuAsn: 4.426 ± 0.391
4.28LeuPro: 4.28 ± 0.423
3.465LeuGln: 3.465 ± 0.477
4.018LeuArg: 4.018 ± 0.393
6.814LeuSer: 6.814 ± 0.489
5.562LeuThr: 5.562 ± 0.452
4.251LeuVal: 4.251 ± 0.371
0.932LeuTrp: 0.932 ± 0.202
1.951LeuTyr: 1.951 ± 0.21
0.0LeuXaa: 0.0 ± 0.0
Met
1.864MetAla: 1.864 ± 0.202
0.029MetCys: 0.029 ± 0.027
0.786MetAsp: 0.786 ± 0.185
1.019MetGlu: 1.019 ± 0.187
0.757MetPhe: 0.757 ± 0.147
1.252MetGly: 1.252 ± 0.214
0.058MetHis: 0.058 ± 0.042
1.339MetIle: 1.339 ± 0.219
1.427MetLys: 1.427 ± 0.232
1.427MetLeu: 1.427 ± 0.192
0.466MetMet: 0.466 ± 0.119
1.107MetAsn: 1.107 ± 0.186
0.99MetPro: 0.99 ± 0.154
0.844MetGln: 0.844 ± 0.161
0.961MetArg: 0.961 ± 0.232
1.834MetSer: 1.834 ± 0.218
1.223MetThr: 1.223 ± 0.176
1.223MetVal: 1.223 ± 0.174
0.204MetTrp: 0.204 ± 0.079
0.233MetTyr: 0.233 ± 0.08
0.0MetXaa: 0.0 ± 0.0
Asn
3.611AsnAla: 3.611 ± 0.381
0.611AsnCys: 0.611 ± 0.143
2.33AsnAsp: 2.33 ± 0.353
2.912AsnGlu: 2.912 ± 0.313
1.747AsnPhe: 1.747 ± 0.257
3.349AsnGly: 3.349 ± 0.277
1.369AsnHis: 1.369 ± 0.254
3.553AsnIle: 3.553 ± 0.314
3.261AsnLys: 3.261 ± 0.352
4.717AsnLeu: 4.717 ± 0.36
0.728AsnMet: 0.728 ± 0.14
2.97AsnAsn: 2.97 ± 0.335
3.378AsnPro: 3.378 ± 0.357
2.912AsnGln: 2.912 ± 0.326
2.388AsnArg: 2.388 ± 0.282
4.28AsnSer: 4.28 ± 0.434
3.028AsnThr: 3.028 ± 0.415
2.3AsnVal: 2.3 ± 0.295
0.757AsnTrp: 0.757 ± 0.155
2.097AsnTyr: 2.097 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
2.475ProAla: 2.475 ± 0.297
0.437ProCys: 0.437 ± 0.126
3.465ProAsp: 3.465 ± 0.412
3.232ProGlu: 3.232 ± 0.396
1.107ProPhe: 1.107 ± 0.203
2.446ProGly: 2.446 ± 0.225
0.582ProHis: 0.582 ± 0.141
2.213ProIle: 2.213 ± 0.271
3.32ProLys: 3.32 ± 0.305
2.97ProLeu: 2.97 ± 0.348
0.641ProMet: 0.641 ± 0.196
1.864ProAsn: 1.864 ± 0.224
1.66ProPro: 1.66 ± 0.188
1.834ProGln: 1.834 ± 0.277
1.427ProArg: 1.427 ± 0.208
2.883ProSer: 2.883 ± 0.307
3.028ProThr: 3.028 ± 0.355
2.737ProVal: 2.737 ± 0.295
0.32ProTrp: 0.32 ± 0.128
1.194ProTyr: 1.194 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
3.32GlnAla: 3.32 ± 0.542
0.437GlnCys: 0.437 ± 0.114
2.242GlnAsp: 2.242 ± 0.256
3.727GlnGlu: 3.727 ± 0.402
1.718GlnPhe: 1.718 ± 0.233
2.737GlnGly: 2.737 ± 0.33
0.553GlnHis: 0.553 ± 0.115
3.407GlnIle: 3.407 ± 0.407
3.378GlnLys: 3.378 ± 0.441
4.688GlnLeu: 4.688 ± 0.519
0.844GlnMet: 0.844 ± 0.15
1.951GlnAsn: 1.951 ± 0.278
1.747GlnPro: 1.747 ± 0.214
2.999GlnGln: 2.999 ± 0.519
2.504GlnArg: 2.504 ± 0.291
3.29GlnSer: 3.29 ± 0.401
1.864GlnThr: 1.864 ± 0.216
2.737GlnVal: 2.737 ± 0.337
0.495GlnTrp: 0.495 ± 0.102
1.194GlnTyr: 1.194 ± 0.152
0.0GlnXaa: 0.0 ± 0.0
Arg
2.679ArgAla: 2.679 ± 0.329
0.874ArgCys: 0.874 ± 0.202
1.864ArgAsp: 1.864 ± 0.204
3.232ArgGlu: 3.232 ± 0.325
1.572ArgPhe: 1.572 ± 0.206
2.737ArgGly: 2.737 ± 0.367
0.815ArgHis: 0.815 ± 0.214
3.727ArgIle: 3.727 ± 0.291
3.756ArgLys: 3.756 ± 0.305
3.727ArgLeu: 3.727 ± 0.399
1.077ArgMet: 1.077 ± 0.179
2.766ArgAsn: 2.766 ± 0.254
1.631ArgPro: 1.631 ± 0.255
2.766ArgGln: 2.766 ± 0.353
2.592ArgArg: 2.592 ± 0.267
2.999ArgSer: 2.999 ± 0.458
2.475ArgThr: 2.475 ± 0.247
2.737ArgVal: 2.737 ± 0.293
0.815ArgTrp: 0.815 ± 0.192
1.631ArgTyr: 1.631 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
5.067SerAla: 5.067 ± 0.395
0.582SerCys: 0.582 ± 0.134
3.902SerAsp: 3.902 ± 0.317
4.805SerGlu: 4.805 ± 0.416
3.087SerPhe: 3.087 ± 0.354
5.533SerGly: 5.533 ± 0.423
1.252SerHis: 1.252 ± 0.214
5.183SerIle: 5.183 ± 0.366
4.688SerLys: 4.688 ± 0.36
6.697SerLeu: 6.697 ± 0.505
1.572SerMet: 1.572 ± 0.234
3.815SerAsn: 3.815 ± 0.383
2.854SerPro: 2.854 ± 0.314
3.494SerGln: 3.494 ± 0.367
3.553SerArg: 3.553 ± 0.318
5.678SerSer: 5.678 ± 0.495
3.698SerThr: 3.698 ± 0.643
4.31SerVal: 4.31 ± 0.359
1.077SerTrp: 1.077 ± 0.183
2.737SerTyr: 2.737 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
5.096ThrAla: 5.096 ± 0.658
0.582ThrCys: 0.582 ± 0.131
3.261ThrAsp: 3.261 ± 0.393
3.931ThrGlu: 3.931 ± 0.407
2.067ThrPhe: 2.067 ± 0.284
3.785ThrGly: 3.785 ± 0.697
1.048ThrHis: 1.048 ± 0.169
4.426ThrIle: 4.426 ± 0.392
4.048ThrLys: 4.048 ± 0.344
5.212ThrLeu: 5.212 ± 0.529
0.932ThrMet: 0.932 ± 0.154
3.028ThrAsn: 3.028 ± 0.326
3.261ThrPro: 3.261 ± 0.323
2.562ThrGln: 2.562 ± 0.385
1.747ThrArg: 1.747 ± 0.259
4.135ThrSer: 4.135 ± 0.617
4.28ThrThr: 4.28 ± 0.748
4.222ThrVal: 4.222 ± 0.441
0.757ThrTrp: 0.757 ± 0.155
1.805ThrTyr: 1.805 ± 0.263
0.0ThrXaa: 0.0 ± 0.0
Val
4.28ValAla: 4.28 ± 0.392
0.641ValCys: 0.641 ± 0.119
3.145ValAsp: 3.145 ± 0.391
4.018ValGlu: 4.018 ± 0.342
2.475ValPhe: 2.475 ± 0.279
3.931ValGly: 3.931 ± 0.315
0.903ValHis: 0.903 ± 0.179
3.873ValIle: 3.873 ± 0.379
4.572ValLys: 4.572 ± 0.495
3.465ValLeu: 3.465 ± 0.285
1.252ValMet: 1.252 ± 0.189
2.825ValAsn: 2.825 ± 0.259
1.805ValPro: 1.805 ± 0.265
1.747ValGln: 1.747 ± 0.266
2.999ValArg: 2.999 ± 0.316
4.543ValSer: 4.543 ± 0.386
3.727ValThr: 3.727 ± 0.421
3.698ValVal: 3.698 ± 0.346
0.699ValTrp: 0.699 ± 0.13
1.369ValTyr: 1.369 ± 0.187
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.138
0.204TrpCys: 0.204 ± 0.096
0.699TrpAsp: 0.699 ± 0.16
1.019TrpGlu: 1.019 ± 0.183
0.408TrpPhe: 0.408 ± 0.104
0.815TrpGly: 0.815 ± 0.144
0.175TrpHis: 0.175 ± 0.071
1.019TrpIle: 1.019 ± 0.149
0.961TrpLys: 0.961 ± 0.177
1.223TrpLeu: 1.223 ± 0.201
0.175TrpMet: 0.175 ± 0.079
1.077TrpAsn: 1.077 ± 0.197
0.116TrpPro: 0.116 ± 0.06
0.611TrpGln: 0.611 ± 0.131
0.874TrpArg: 0.874 ± 0.161
0.757TrpSer: 0.757 ± 0.118
0.786TrpThr: 0.786 ± 0.173
0.932TrpVal: 0.932 ± 0.161
0.146TrpTrp: 0.146 ± 0.075
0.553TrpTyr: 0.553 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.271TyrAla: 2.271 ± 0.222
0.524TyrCys: 0.524 ± 0.133
1.864TyrAsp: 1.864 ± 0.349
1.951TyrGlu: 1.951 ± 0.235
1.543TyrPhe: 1.543 ± 0.262
2.388TyrGly: 2.388 ± 0.325
0.699TyrHis: 0.699 ± 0.157
2.009TyrIle: 2.009 ± 0.267
2.621TyrLys: 2.621 ± 0.333
3.261TyrLeu: 3.261 ± 0.299
0.553TyrMet: 0.553 ± 0.132
1.398TyrAsn: 1.398 ± 0.227
1.281TyrPro: 1.281 ± 0.173
2.213TyrGln: 2.213 ± 0.256
1.834TyrArg: 1.834 ± 0.267
2.533TyrSer: 2.533 ± 0.332
1.543TyrThr: 1.543 ± 0.203
1.31TyrVal: 1.31 ± 0.197
0.379TyrTrp: 0.379 ± 0.113
1.223TyrTyr: 1.223 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 149 proteins (34343 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski