Amino acid dipepetide frequency for Cyanophage Syn2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.183AlaAla: 7.183 ± 0.559
0.49AlaCys: 0.49 ± 0.113
4.136AlaAsp: 4.136 ± 0.315
4.263AlaGlu: 4.263 ± 0.338
3.519AlaPhe: 3.519 ± 0.317
6.367AlaGly: 6.367 ± 0.502
0.998AlaHis: 0.998 ± 0.125
3.991AlaIle: 3.991 ± 0.221
3.718AlaLys: 3.718 ± 0.294
5.024AlaLeu: 5.024 ± 0.347
1.36AlaMet: 1.36 ± 0.21
3.954AlaAsn: 3.954 ± 0.327
2.721AlaPro: 2.721 ± 0.22
2.92AlaGln: 2.92 ± 0.236
2.703AlaArg: 2.703 ± 0.257
5.297AlaSer: 5.297 ± 0.416
6.04AlaThr: 6.04 ± 0.621
4.752AlaVal: 4.752 ± 0.297
0.453AlaTrp: 0.453 ± 0.092
2.213AlaTyr: 2.213 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
0.363CysAla: 0.363 ± 0.086
0.054CysCys: 0.054 ± 0.032
0.707CysAsp: 0.707 ± 0.149
0.544CysGlu: 0.544 ± 0.117
0.345CysPhe: 0.345 ± 0.078
0.508CysGly: 0.508 ± 0.125
0.181CysHis: 0.181 ± 0.067
0.399CysIle: 0.399 ± 0.084
0.508CysLys: 0.508 ± 0.104
0.617CysLeu: 0.617 ± 0.124
0.254CysMet: 0.254 ± 0.074
0.327CysAsn: 0.327 ± 0.104
0.218CysPro: 0.218 ± 0.062
0.345CysGln: 0.345 ± 0.107
0.327CysArg: 0.327 ± 0.081
0.562CysSer: 0.562 ± 0.116
0.399CysThr: 0.399 ± 0.099
0.526CysVal: 0.526 ± 0.103
0.145CysTrp: 0.145 ± 0.064
0.345CysTyr: 0.345 ± 0.076
0.0CysXaa: 0.0 ± 0.0
Asp
5.188AspAla: 5.188 ± 0.317
0.707AspCys: 0.707 ± 0.153
4.245AspAsp: 4.245 ± 0.289
3.954AspGlu: 3.954 ± 0.291
3.156AspPhe: 3.156 ± 0.235
6.185AspGly: 6.185 ± 0.332
1.07AspHis: 1.07 ± 0.153
3.9AspIle: 3.9 ± 0.284
3.374AspLys: 3.374 ± 0.251
4.517AspLeu: 4.517 ± 0.326
1.487AspMet: 1.487 ± 0.202
3.41AspAsn: 3.41 ± 0.255
3.519AspPro: 3.519 ± 0.308
2.267AspGln: 2.267 ± 0.19
2.539AspArg: 2.539 ± 0.219
3.845AspSer: 3.845 ± 0.284
4.644AspThr: 4.644 ± 0.375
4.136AspVal: 4.136 ± 0.312
1.016AspTrp: 1.016 ± 0.128
3.138AspTyr: 3.138 ± 0.216
0.0AspXaa: 0.0 ± 0.0
Glu
3.555GluAla: 3.555 ± 0.283
0.472GluCys: 0.472 ± 0.104
4.19GluAsp: 4.19 ± 0.33
4.607GluGlu: 4.607 ± 0.444
2.92GluPhe: 2.92 ± 0.242
4.063GluGly: 4.063 ± 0.348
1.016GluHis: 1.016 ± 0.152
4.371GluIle: 4.371 ± 0.364
3.592GluLys: 3.592 ± 0.396
4.861GluLeu: 4.861 ± 0.356
1.506GluMet: 1.506 ± 0.212
3.102GluAsn: 3.102 ± 0.231
1.578GluPro: 1.578 ± 0.16
2.539GluGln: 2.539 ± 0.237
2.866GluArg: 2.866 ± 0.305
3.628GluSer: 3.628 ± 0.291
3.483GluThr: 3.483 ± 0.304
4.317GluVal: 4.317 ± 0.296
0.816GluTrp: 0.816 ± 0.12
2.503GluTyr: 2.503 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
2.866PheAla: 2.866 ± 0.247
0.453PheCys: 0.453 ± 0.1
3.664PheAsp: 3.664 ± 0.251
2.648PheGlu: 2.648 ± 0.235
1.977PhePhe: 1.977 ± 0.22
3.374PheGly: 3.374 ± 0.399
0.689PheHis: 0.689 ± 0.131
2.685PheIle: 2.685 ± 0.236
2.485PheLys: 2.485 ± 0.272
3.12PheLeu: 3.12 ± 0.27
1.016PheMet: 1.016 ± 0.163
3.065PheAsn: 3.065 ± 0.23
1.614PhePro: 1.614 ± 0.221
1.669PheGln: 1.669 ± 0.175
1.778PheArg: 1.778 ± 0.136
3.319PheSer: 3.319 ± 0.225
3.283PheThr: 3.283 ± 0.371
2.793PheVal: 2.793 ± 0.293
0.345PheTrp: 0.345 ± 0.067
1.506PheTyr: 1.506 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
6.004GlyAla: 6.004 ± 0.487
0.544GlyCys: 0.544 ± 0.101
4.97GlyAsp: 4.97 ± 0.417
4.172GlyGlu: 4.172 ± 0.259
3.301GlyPhe: 3.301 ± 0.261
9.36GlyGly: 9.36 ± 1.359
1.451GlyHis: 1.451 ± 0.174
4.48GlyIle: 4.48 ± 0.341
4.19GlyLys: 4.19 ± 0.417
5.133GlyLeu: 5.133 ± 0.4
1.796GlyMet: 1.796 ± 0.27
4.571GlyAsn: 4.571 ± 0.375
2.032GlyPro: 2.032 ± 0.24
3.229GlyGln: 3.229 ± 0.303
3.011GlyArg: 3.011 ± 0.273
7.129GlySer: 7.129 ± 0.569
7.02GlyThr: 7.02 ± 0.644
5.133GlyVal: 5.133 ± 0.344
1.052GlyTrp: 1.052 ± 0.112
3.537GlyTyr: 3.537 ± 0.287
0.0GlyXaa: 0.0 ± 0.0
His
0.671HisAla: 0.671 ± 0.123
0.163HisCys: 0.163 ± 0.05
0.998HisAsp: 0.998 ± 0.162
0.925HisGlu: 0.925 ± 0.15
0.762HisPhe: 0.762 ± 0.115
1.179HisGly: 1.179 ± 0.169
0.308HisHis: 0.308 ± 0.071
0.834HisIle: 0.834 ± 0.13
0.744HisLys: 0.744 ± 0.118
1.197HisLeu: 1.197 ± 0.206
0.49HisMet: 0.49 ± 0.107
0.762HisAsn: 0.762 ± 0.149
0.834HisPro: 0.834 ± 0.137
0.58HisGln: 0.58 ± 0.094
0.707HisArg: 0.707 ± 0.123
0.925HisSer: 0.925 ± 0.132
1.052HisThr: 1.052 ± 0.154
0.943HisVal: 0.943 ± 0.124
0.218HisTrp: 0.218 ± 0.064
0.689HisTyr: 0.689 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
4.226IleAla: 4.226 ± 0.266
0.453IleCys: 0.453 ± 0.091
4.789IleAsp: 4.789 ± 0.265
3.61IleGlu: 3.61 ± 0.234
2.467IlePhe: 2.467 ± 0.23
4.136IleGly: 4.136 ± 0.352
0.653IleHis: 0.653 ± 0.115
3.573IleIle: 3.573 ± 0.267
3.936IleLys: 3.936 ± 0.325
4.263IleLeu: 4.263 ± 0.279
1.143IleMet: 1.143 ± 0.175
3.827IleAsn: 3.827 ± 0.302
2.83IlePro: 2.83 ± 0.265
2.267IleGln: 2.267 ± 0.263
2.558IleArg: 2.558 ± 0.232
4.154IleSer: 4.154 ± 0.404
5.097IleThr: 5.097 ± 0.539
3.9IleVal: 3.9 ± 0.458
0.671IleTrp: 0.671 ± 0.11
2.013IleTyr: 2.013 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
3.864LysAla: 3.864 ± 0.435
0.508LysCys: 0.508 ± 0.096
3.156LysAsp: 3.156 ± 0.272
3.7LysGlu: 3.7 ± 0.45
2.449LysPhe: 2.449 ± 0.264
3.682LysGly: 3.682 ± 0.331
0.78LysHis: 0.78 ± 0.147
3.845LysIle: 3.845 ± 0.247
4.444LysLys: 4.444 ± 0.627
4.716LysLeu: 4.716 ± 0.368
1.487LysMet: 1.487 ± 0.2
3.029LysAsn: 3.029 ± 0.253
2.231LysPro: 2.231 ± 0.292
2.177LysGln: 2.177 ± 0.251
2.412LysArg: 2.412 ± 0.26
3.882LysSer: 3.882 ± 0.338
3.991LysThr: 3.991 ± 0.244
4.208LysVal: 4.208 ± 0.324
0.744LysTrp: 0.744 ± 0.144
2.757LysTyr: 2.757 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
5.097LeuAla: 5.097 ± 0.371
0.653LeuCys: 0.653 ± 0.143
5.95LeuAsp: 5.95 ± 0.36
4.263LeuGlu: 4.263 ± 0.325
2.83LeuPhe: 2.83 ± 0.267
4.861LeuGly: 4.861 ± 0.39
1.233LeuHis: 1.233 ± 0.185
3.972LeuIle: 3.972 ± 0.27
4.771LeuLys: 4.771 ± 0.364
5.224LeuLeu: 5.224 ± 0.391
1.542LeuMet: 1.542 ± 0.194
4.444LeuAsn: 4.444 ± 0.268
3.102LeuPro: 3.102 ± 0.255
2.975LeuGln: 2.975 ± 0.212
3.247LeuArg: 3.247 ± 0.263
4.97LeuSer: 4.97 ± 0.311
5.804LeuThr: 5.804 ± 0.346
4.027LeuVal: 4.027 ± 0.278
0.599LeuTrp: 0.599 ± 0.136
2.848LeuTyr: 2.848 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
1.36MetAla: 1.36 ± 0.201
0.091MetCys: 0.091 ± 0.041
0.961MetAsp: 0.961 ± 0.164
1.397MetGlu: 1.397 ± 0.235
0.943MetPhe: 0.943 ± 0.159
1.324MetGly: 1.324 ± 0.208
0.363MetHis: 0.363 ± 0.091
1.342MetIle: 1.342 ± 0.175
1.814MetLys: 1.814 ± 0.27
1.633MetLeu: 1.633 ± 0.209
0.653MetMet: 0.653 ± 0.124
1.233MetAsn: 1.233 ± 0.209
0.834MetPro: 0.834 ± 0.111
0.961MetGln: 0.961 ± 0.149
1.088MetArg: 1.088 ± 0.176
1.542MetSer: 1.542 ± 0.221
1.578MetThr: 1.578 ± 0.233
1.034MetVal: 1.034 ± 0.156
0.236MetTrp: 0.236 ± 0.067
0.726MetTyr: 0.726 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
4.208AsnAla: 4.208 ± 0.288
0.417AsnCys: 0.417 ± 0.098
3.211AsnAsp: 3.211 ± 0.217
3.065AsnGlu: 3.065 ± 0.262
2.83AsnPhe: 2.83 ± 0.21
4.371AsnGly: 4.371 ± 0.466
0.744AsnHis: 0.744 ± 0.119
3.682AsnIle: 3.682 ± 0.268
2.848AsnLys: 2.848 ± 0.267
4.589AsnLeu: 4.589 ± 0.313
0.943AsnMet: 0.943 ± 0.151
3.483AsnAsn: 3.483 ± 0.251
3.229AsnPro: 3.229 ± 0.251
2.267AsnGln: 2.267 ± 0.219
2.34AsnArg: 2.34 ± 0.152
3.954AsnSer: 3.954 ± 0.285
4.245AsnThr: 4.245 ± 0.397
4.027AsnVal: 4.027 ± 0.347
0.744AsnTrp: 0.744 ± 0.11
2.322AsnTyr: 2.322 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
2.848ProAla: 2.848 ± 0.294
0.181ProCys: 0.181 ± 0.055
2.594ProAsp: 2.594 ± 0.224
2.83ProGlu: 2.83 ± 0.28
1.723ProPhe: 1.723 ± 0.193
3.392ProGly: 3.392 ± 0.338
0.653ProHis: 0.653 ± 0.126
2.304ProIle: 2.304 ± 0.251
2.14ProLys: 2.14 ± 0.317
2.503ProLeu: 2.503 ± 0.228
0.617ProMet: 0.617 ± 0.116
1.923ProAsn: 1.923 ± 0.212
1.56ProPro: 1.56 ± 0.235
1.088ProGln: 1.088 ± 0.125
1.633ProArg: 1.633 ± 0.172
3.084ProSer: 3.084 ± 0.234
2.812ProThr: 2.812 ± 0.218
2.521ProVal: 2.521 ± 0.241
0.599ProTrp: 0.599 ± 0.106
1.705ProTyr: 1.705 ± 0.185
0.0ProXaa: 0.0 ± 0.0
Gln
2.34GlnAla: 2.34 ± 0.203
0.29GlnCys: 0.29 ± 0.08
2.34GlnAsp: 2.34 ± 0.175
2.358GlnGlu: 2.358 ± 0.226
1.814GlnPhe: 1.814 ± 0.189
2.485GlnGly: 2.485 ± 0.215
0.635GlnHis: 0.635 ± 0.101
2.431GlnIle: 2.431 ± 0.265
2.467GlnLys: 2.467 ± 0.293
2.812GlnLeu: 2.812 ± 0.201
0.889GlnMet: 0.889 ± 0.136
2.122GlnAsn: 2.122 ± 0.186
1.288GlnPro: 1.288 ± 0.151
1.633GlnGln: 1.633 ± 0.307
1.542GlnArg: 1.542 ± 0.186
2.521GlnSer: 2.521 ± 0.186
2.721GlnThr: 2.721 ± 0.244
2.902GlnVal: 2.902 ± 0.221
0.435GlnTrp: 0.435 ± 0.087
1.941GlnTyr: 1.941 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
2.558ArgAla: 2.558 ± 0.269
0.363ArgCys: 0.363 ± 0.084
2.449ArgAsp: 2.449 ± 0.193
2.485ArgGlu: 2.485 ± 0.273
1.995ArgPhe: 1.995 ± 0.187
3.12ArgGly: 3.12 ± 0.257
0.834ArgHis: 0.834 ± 0.124
2.957ArgIle: 2.957 ± 0.206
2.63ArgLys: 2.63 ± 0.321
3.573ArgLeu: 3.573 ± 0.241
1.106ArgMet: 1.106 ± 0.157
2.032ArgAsn: 2.032 ± 0.172
1.36ArgPro: 1.36 ± 0.15
1.669ArgGln: 1.669 ± 0.187
2.104ArgArg: 2.104 ± 0.299
2.231ArgSer: 2.231 ± 0.235
2.485ArgThr: 2.485 ± 0.219
2.83ArgVal: 2.83 ± 0.223
0.453ArgTrp: 0.453 ± 0.094
2.467ArgTyr: 2.467 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
5.514SerAla: 5.514 ± 0.407
0.453SerCys: 0.453 ± 0.093
3.972SerAsp: 3.972 ± 0.322
3.356SerGlu: 3.356 ± 0.218
3.065SerPhe: 3.065 ± 0.247
7.636SerGly: 7.636 ± 0.644
0.798SerHis: 0.798 ± 0.12
4.063SerIle: 4.063 ± 0.442
3.592SerLys: 3.592 ± 0.333
4.988SerLeu: 4.988 ± 0.3
1.542SerMet: 1.542 ± 0.174
4.226SerAsn: 4.226 ± 0.372
2.34SerPro: 2.34 ± 0.254
2.394SerGln: 2.394 ± 0.193
2.594SerArg: 2.594 ± 0.248
5.659SerSer: 5.659 ± 0.534
5.442SerThr: 5.442 ± 0.4
4.861SerVal: 4.861 ± 0.396
0.744SerTrp: 0.744 ± 0.135
2.975SerTyr: 2.975 ± 0.214
0.0SerXaa: 0.0 ± 0.0
Thr
6.095ThrAla: 6.095 ± 0.661
0.417ThrCys: 0.417 ± 0.094
4.39ThrAsp: 4.39 ± 0.373
4.081ThrGlu: 4.081 ± 0.305
3.319ThrPhe: 3.319 ± 0.419
7.6ThrGly: 7.6 ± 0.722
0.871ThrHis: 0.871 ± 0.139
4.517ThrIle: 4.517 ± 0.374
3.755ThrLys: 3.755 ± 0.236
6.113ThrLeu: 6.113 ± 0.607
0.998ThrMet: 0.998 ± 0.117
4.644ThrAsn: 4.644 ± 0.483
2.993ThrPro: 2.993 ± 0.233
2.431ThrGln: 2.431 ± 0.222
2.703ThrArg: 2.703 ± 0.193
5.26ThrSer: 5.26 ± 0.474
6.24ThrThr: 6.24 ± 0.675
5.333ThrVal: 5.333 ± 0.585
0.816ThrTrp: 0.816 ± 0.119
2.757ThrTyr: 2.757 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
4.789ValAla: 4.789 ± 0.3
0.345ValCys: 0.345 ± 0.084
5.405ValAsp: 5.405 ± 0.474
4.535ValGlu: 4.535 ± 0.276
2.576ValPhe: 2.576 ± 0.24
5.151ValGly: 5.151 ± 0.349
0.889ValHis: 0.889 ± 0.159
4.063ValIle: 4.063 ± 0.38
3.954ValLys: 3.954 ± 0.297
3.882ValLeu: 3.882 ± 0.243
1.143ValMet: 1.143 ± 0.145
4.081ValAsn: 4.081 ± 0.33
2.648ValPro: 2.648 ± 0.279
2.412ValGln: 2.412 ± 0.209
2.685ValArg: 2.685 ± 0.22
4.843ValSer: 4.843 ± 0.252
5.551ValThr: 5.551 ± 0.557
4.335ValVal: 4.335 ± 0.323
0.671ValTrp: 0.671 ± 0.105
2.467ValTyr: 2.467 ± 0.244
0.0ValXaa: 0.0 ± 0.0
Trp
0.635TrpAla: 0.635 ± 0.092
0.091TrpCys: 0.091 ± 0.047
0.816TrpAsp: 0.816 ± 0.122
0.78TrpGlu: 0.78 ± 0.144
0.453TrpPhe: 0.453 ± 0.095
0.689TrpGly: 0.689 ± 0.126
0.327TrpHis: 0.327 ± 0.073
0.617TrpIle: 0.617 ± 0.118
0.943TrpLys: 0.943 ± 0.137
0.816TrpLeu: 0.816 ± 0.128
0.181TrpMet: 0.181 ± 0.065
0.726TrpAsn: 0.726 ± 0.116
0.254TrpPro: 0.254 ± 0.065
0.453TrpGln: 0.453 ± 0.091
0.508TrpArg: 0.508 ± 0.1
0.871TrpSer: 0.871 ± 0.11
0.762TrpThr: 0.762 ± 0.108
0.816TrpVal: 0.816 ± 0.127
0.127TrpTrp: 0.127 ± 0.046
0.472TrpTyr: 0.472 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.757TyrAla: 2.757 ± 0.187
0.526TyrCys: 0.526 ± 0.109
3.356TyrAsp: 3.356 ± 0.217
2.412TyrGlu: 2.412 ± 0.255
1.905TyrPhe: 1.905 ± 0.183
2.63TyrGly: 2.63 ± 0.234
0.49TyrHis: 0.49 ± 0.097
2.539TyrIle: 2.539 ± 0.212
2.159TyrLys: 2.159 ± 0.231
2.83TyrLeu: 2.83 ± 0.237
0.943TyrMet: 0.943 ± 0.154
2.558TyrAsn: 2.558 ± 0.207
1.524TyrPro: 1.524 ± 0.15
1.687TyrGln: 1.687 ± 0.177
2.358TyrArg: 2.358 ± 0.216
2.539TyrSer: 2.539 ± 0.264
2.685TyrThr: 2.685 ± 0.318
3.029TyrVal: 3.029 ± 0.251
0.453TyrTrp: 0.453 ± 0.109
1.759TyrTyr: 1.759 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (55131 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski