Amino acid dipepetide frequency for Cyanophage S-RIM12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.572AlaAla: 6.572 ± 0.51
0.575AlaCys: 0.575 ± 0.118
3.843AlaAsp: 3.843 ± 0.291
4.076AlaGlu: 4.076 ± 0.312
2.981AlaPhe: 2.981 ± 0.24
6.824AlaGly: 6.824 ± 0.562
0.7AlaHis: 0.7 ± 0.102
4.471AlaIle: 4.471 ± 0.302
3.645AlaLys: 3.645 ± 0.336
4.938AlaLeu: 4.938 ± 0.314
1.365AlaMet: 1.365 ± 0.216
4.148AlaAsn: 4.148 ± 0.338
2.352AlaPro: 2.352 ± 0.197
2.55AlaGln: 2.55 ± 0.226
2.73AlaArg: 2.73 ± 0.251
5.477AlaSer: 5.477 ± 0.367
5.746AlaThr: 5.746 ± 0.59
4.938AlaVal: 4.938 ± 0.382
0.646AlaTrp: 0.646 ± 0.125
2.191AlaTyr: 2.191 ± 0.182
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.104
0.072CysCys: 0.072 ± 0.037
0.611CysAsp: 0.611 ± 0.124
0.503CysGlu: 0.503 ± 0.104
0.521CysPhe: 0.521 ± 0.118
0.575CysGly: 0.575 ± 0.113
0.215CysHis: 0.215 ± 0.06
0.467CysIle: 0.467 ± 0.105
0.646CysLys: 0.646 ± 0.113
0.629CysLeu: 0.629 ± 0.13
0.233CysMet: 0.233 ± 0.073
0.413CysAsn: 0.413 ± 0.091
0.287CysPro: 0.287 ± 0.081
0.323CysGln: 0.323 ± 0.083
0.395CysArg: 0.395 ± 0.104
0.593CysSer: 0.593 ± 0.096
0.575CysThr: 0.575 ± 0.106
0.449CysVal: 0.449 ± 0.109
0.144CysTrp: 0.144 ± 0.056
0.251CysTyr: 0.251 ± 0.069
0.0CysXaa: 0.0 ± 0.0
Asp
5.369AspAla: 5.369 ± 0.345
0.503AspCys: 0.503 ± 0.108
4.184AspAsp: 4.184 ± 0.308
3.861AspGlu: 3.861 ± 0.349
2.927AspPhe: 2.927 ± 0.285
6.07AspGly: 6.07 ± 0.456
0.593AspHis: 0.593 ± 0.111
4.112AspIle: 4.112 ± 0.374
3.358AspLys: 3.358 ± 0.337
4.274AspLeu: 4.274 ± 0.273
1.544AspMet: 1.544 ± 0.164
3.717AspAsn: 3.717 ± 0.397
2.837AspPro: 2.837 ± 0.223
2.083AspGln: 2.083 ± 0.185
2.46AspArg: 2.46 ± 0.234
4.759AspSer: 4.759 ± 0.265
4.507AspThr: 4.507 ± 0.371
4.13AspVal: 4.13 ± 0.304
0.916AspTrp: 0.916 ± 0.15
3.071AspTyr: 3.071 ± 0.218
0.0AspXaa: 0.0 ± 0.0
Glu
2.999GluAla: 2.999 ± 0.287
0.826GluCys: 0.826 ± 0.14
4.148GluAsp: 4.148 ± 0.332
4.525GluGlu: 4.525 ± 0.402
3.196GluPhe: 3.196 ± 0.209
3.933GluGly: 3.933 ± 0.324
0.844GluHis: 0.844 ± 0.139
3.843GluIle: 3.843 ± 0.322
3.466GluLys: 3.466 ± 0.394
5.118GluLeu: 5.118 ± 0.4
1.706GluMet: 1.706 ± 0.231
3.178GluAsn: 3.178 ± 0.242
1.526GluPro: 1.526 ± 0.146
2.352GluGln: 2.352 ± 0.22
2.855GluArg: 2.855 ± 0.326
3.538GluSer: 3.538 ± 0.321
4.346GluThr: 4.346 ± 0.274
4.489GluVal: 4.489 ± 0.24
0.736GluTrp: 0.736 ± 0.15
2.945GluTyr: 2.945 ± 0.23
0.0GluXaa: 0.0 ± 0.0
Phe
2.64PheAla: 2.64 ± 0.211
0.521PheCys: 0.521 ± 0.117
3.52PheAsp: 3.52 ± 0.247
2.64PheGlu: 2.64 ± 0.214
1.724PhePhe: 1.724 ± 0.188
3.125PheGly: 3.125 ± 0.276
0.467PheHis: 0.467 ± 0.118
2.712PheIle: 2.712 ± 0.227
2.263PheLys: 2.263 ± 0.199
3.089PheLeu: 3.089 ± 0.329
0.988PheMet: 0.988 ± 0.152
2.873PheAsn: 2.873 ± 0.217
1.652PhePro: 1.652 ± 0.206
1.778PheGln: 1.778 ± 0.14
1.473PheArg: 1.473 ± 0.165
3.125PheSer: 3.125 ± 0.24
3.358PheThr: 3.358 ± 0.265
2.855PheVal: 2.855 ± 0.234
0.233PheTrp: 0.233 ± 0.06
1.76PheTyr: 1.76 ± 0.137
0.0PheXaa: 0.0 ± 0.0
Gly
6.644GlyAla: 6.644 ± 0.552
0.629GlyCys: 0.629 ± 0.11
5.046GlyAsp: 5.046 ± 0.403
4.364GlyGlu: 4.364 ± 0.321
3.286GlyPhe: 3.286 ± 0.308
7.919GlyGly: 7.919 ± 0.906
0.988GlyHis: 0.988 ± 0.134
4.112GlyIle: 4.112 ± 0.333
3.681GlyLys: 3.681 ± 0.339
4.561GlyLeu: 4.561 ± 0.344
1.634GlyMet: 1.634 ± 0.267
4.436GlyAsn: 4.436 ± 0.371
1.939GlyPro: 1.939 ± 0.242
2.658GlyGln: 2.658 ± 0.206
2.747GlyArg: 2.747 ± 0.239
6.878GlySer: 6.878 ± 0.597
7.345GlyThr: 7.345 ± 0.681
5.244GlyVal: 5.244 ± 0.322
1.077GlyTrp: 1.077 ± 0.126
3.538GlyTyr: 3.538 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
0.736HisAla: 0.736 ± 0.129
0.215HisCys: 0.215 ± 0.071
0.88HisAsp: 0.88 ± 0.131
0.557HisGlu: 0.557 ± 0.131
0.7HisPhe: 0.7 ± 0.134
0.934HisGly: 0.934 ± 0.137
0.341HisHis: 0.341 ± 0.092
0.808HisIle: 0.808 ± 0.141
0.898HisLys: 0.898 ± 0.144
0.898HisLeu: 0.898 ± 0.148
0.323HisMet: 0.323 ± 0.07
0.629HisAsn: 0.629 ± 0.111
0.79HisPro: 0.79 ± 0.125
0.395HisGln: 0.395 ± 0.091
0.539HisArg: 0.539 ± 0.116
0.88HisSer: 0.88 ± 0.125
0.898HisThr: 0.898 ± 0.133
0.916HisVal: 0.916 ± 0.141
0.215HisTrp: 0.215 ± 0.067
0.772HisTyr: 0.772 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
4.471IleAla: 4.471 ± 0.357
0.575IleCys: 0.575 ± 0.114
4.31IleAsp: 4.31 ± 0.368
4.112IleGlu: 4.112 ± 0.286
2.532IlePhe: 2.532 ± 0.188
4.094IleGly: 4.094 ± 0.311
0.575IleHis: 0.575 ± 0.108
3.915IleIle: 3.915 ± 0.349
3.897IleLys: 3.897 ± 0.323
4.256IleLeu: 4.256 ± 0.266
1.113IleMet: 1.113 ± 0.21
4.058IleAsn: 4.058 ± 0.231
3.053IlePro: 3.053 ± 0.279
2.406IleGln: 2.406 ± 0.215
2.101IleArg: 2.101 ± 0.186
4.795IleSer: 4.795 ± 0.542
5.854IleThr: 5.854 ± 0.476
4.184IleVal: 4.184 ± 0.337
0.664IleTrp: 0.664 ± 0.106
2.011IleTyr: 2.011 ± 0.21
0.0IleXaa: 0.0 ± 0.0
Lys
3.178LysAla: 3.178 ± 0.316
0.629LysCys: 0.629 ± 0.11
3.43LysAsp: 3.43 ± 0.33
3.897LysGlu: 3.897 ± 0.5
2.299LysPhe: 2.299 ± 0.249
3.161LysGly: 3.161 ± 0.302
0.916LysHis: 0.916 ± 0.155
3.592LysIle: 3.592 ± 0.356
4.382LysLys: 4.382 ± 0.555
4.471LysLeu: 4.471 ± 0.387
1.437LysMet: 1.437 ± 0.226
3.34LysAsn: 3.34 ± 0.277
1.903LysPro: 1.903 ± 0.248
2.083LysGln: 2.083 ± 0.263
2.281LysArg: 2.281 ± 0.27
4.058LysSer: 4.058 ± 0.352
3.502LysThr: 3.502 ± 0.264
4.022LysVal: 4.022 ± 0.258
0.754LysTrp: 0.754 ± 0.16
2.801LysTyr: 2.801 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
4.831LeuAla: 4.831 ± 0.327
0.718LeuCys: 0.718 ± 0.149
5.531LeuAsp: 5.531 ± 0.383
4.436LeuGlu: 4.436 ± 0.387
2.37LeuPhe: 2.37 ± 0.171
4.813LeuGly: 4.813 ± 0.411
1.275LeuHis: 1.275 ± 0.186
3.825LeuIle: 3.825 ± 0.26
5.046LeuLys: 5.046 ± 0.382
5.28LeuLeu: 5.28 ± 0.39
1.383LeuMet: 1.383 ± 0.206
4.489LeuAsn: 4.489 ± 0.238
2.981LeuPro: 2.981 ± 0.257
2.855LeuGln: 2.855 ± 0.262
3.394LeuArg: 3.394 ± 0.282
5.351LeuSer: 5.351 ± 0.29
5.459LeuThr: 5.459 ± 0.458
4.364LeuVal: 4.364 ± 0.293
0.754LeuTrp: 0.754 ± 0.137
3.358LeuTyr: 3.358 ± 0.246
0.0LeuXaa: 0.0 ± 0.0
Met
1.598MetAla: 1.598 ± 0.256
0.126MetCys: 0.126 ± 0.052
1.239MetAsp: 1.239 ± 0.188
1.257MetGlu: 1.257 ± 0.213
0.862MetPhe: 0.862 ± 0.143
1.311MetGly: 1.311 ± 0.23
0.521MetHis: 0.521 ± 0.118
1.113MetIle: 1.113 ± 0.156
1.526MetLys: 1.526 ± 0.27
1.562MetLeu: 1.562 ± 0.227
0.664MetMet: 0.664 ± 0.142
1.347MetAsn: 1.347 ± 0.194
1.042MetPro: 1.042 ± 0.167
0.952MetGln: 0.952 ± 0.185
1.042MetArg: 1.042 ± 0.206
1.652MetSer: 1.652 ± 0.233
1.544MetThr: 1.544 ± 0.238
0.952MetVal: 0.952 ± 0.136
0.287MetTrp: 0.287 ± 0.087
0.611MetTyr: 0.611 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.951AsnAla: 3.951 ± 0.316
0.305AsnCys: 0.305 ± 0.069
3.322AsnAsp: 3.322 ± 0.242
3.304AsnGlu: 3.304 ± 0.221
2.963AsnPhe: 2.963 ± 0.202
4.579AsnGly: 4.579 ± 0.347
0.646AsnHis: 0.646 ± 0.121
4.256AsnIle: 4.256 ± 0.355
2.676AsnLys: 2.676 ± 0.27
5.046AsnLeu: 5.046 ± 0.378
0.844AsnMet: 0.844 ± 0.157
3.178AsnAsn: 3.178 ± 0.338
3.017AsnPro: 3.017 ± 0.247
2.137AsnGln: 2.137 ± 0.164
2.137AsnArg: 2.137 ± 0.162
4.005AsnSer: 4.005 ± 0.288
4.274AsnThr: 4.274 ± 0.41
4.382AsnVal: 4.382 ± 0.31
0.916AsnTrp: 0.916 ± 0.13
2.37AsnTyr: 2.37 ± 0.178
0.0AsnXaa: 0.0 ± 0.0
Pro
2.514ProAla: 2.514 ± 0.225
0.251ProCys: 0.251 ± 0.077
2.406ProAsp: 2.406 ± 0.284
2.819ProGlu: 2.819 ± 0.222
1.58ProPhe: 1.58 ± 0.16
3.035ProGly: 3.035 ± 0.278
0.629ProHis: 0.629 ± 0.087
2.55ProIle: 2.55 ± 0.26
1.993ProLys: 1.993 ± 0.249
2.424ProLeu: 2.424 ± 0.178
0.7ProMet: 0.7 ± 0.137
2.281ProAsn: 2.281 ± 0.211
1.437ProPro: 1.437 ± 0.18
1.293ProGln: 1.293 ± 0.162
1.329ProArg: 1.329 ± 0.155
2.999ProSer: 2.999 ± 0.229
2.945ProThr: 2.945 ± 0.254
2.514ProVal: 2.514 ± 0.222
0.575ProTrp: 0.575 ± 0.103
1.652ProTyr: 1.652 ± 0.173
0.0ProXaa: 0.0 ± 0.0
Gln
2.299GlnAla: 2.299 ± 0.23
0.305GlnCys: 0.305 ± 0.088
2.173GlnAsp: 2.173 ± 0.186
2.64GlnGlu: 2.64 ± 0.287
1.473GlnPhe: 1.473 ± 0.177
2.676GlnGly: 2.676 ± 0.269
0.593GlnHis: 0.593 ± 0.106
2.64GlnIle: 2.64 ± 0.262
2.299GlnLys: 2.299 ± 0.269
3.286GlnLeu: 3.286 ± 0.223
0.916GlnMet: 0.916 ± 0.175
1.67GlnAsn: 1.67 ± 0.183
1.329GlnPro: 1.329 ± 0.126
1.652GlnGln: 1.652 ± 0.198
1.437GlnArg: 1.437 ± 0.176
2.299GlnSer: 2.299 ± 0.27
2.263GlnThr: 2.263 ± 0.216
2.712GlnVal: 2.712 ± 0.246
0.593GlnTrp: 0.593 ± 0.106
1.993GlnTyr: 1.993 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
2.442ArgAla: 2.442 ± 0.212
0.215ArgCys: 0.215 ± 0.053
2.191ArgAsp: 2.191 ± 0.184
2.352ArgGlu: 2.352 ± 0.313
1.634ArgPhe: 1.634 ± 0.174
2.801ArgGly: 2.801 ± 0.281
0.611ArgHis: 0.611 ± 0.107
2.658ArgIle: 2.658 ± 0.249
2.747ArgLys: 2.747 ± 0.383
3.52ArgLeu: 3.52 ± 0.232
1.095ArgMet: 1.095 ± 0.184
2.227ArgAsn: 2.227 ± 0.208
1.185ArgPro: 1.185 ± 0.138
1.688ArgGln: 1.688 ± 0.182
1.868ArgArg: 1.868 ± 0.204
2.263ArgSer: 2.263 ± 0.226
2.496ArgThr: 2.496 ± 0.253
2.963ArgVal: 2.963 ± 0.294
0.539ArgTrp: 0.539 ± 0.107
2.101ArgTyr: 2.101 ± 0.205
0.0ArgXaa: 0.0 ± 0.0
Ser
5.639SerAla: 5.639 ± 0.339
0.431SerCys: 0.431 ± 0.107
4.256SerAsp: 4.256 ± 0.35
3.609SerGlu: 3.609 ± 0.24
3.556SerPhe: 3.556 ± 0.346
7.434SerGly: 7.434 ± 0.581
0.898SerHis: 0.898 ± 0.142
4.669SerIle: 4.669 ± 0.38
3.753SerLys: 3.753 ± 0.305
4.992SerLeu: 4.992 ± 0.298
1.329SerMet: 1.329 ± 0.219
4.148SerAsn: 4.148 ± 0.325
2.676SerPro: 2.676 ± 0.292
2.604SerGln: 2.604 ± 0.219
2.55SerArg: 2.55 ± 0.215
6.177SerSer: 6.177 ± 0.553
6.339SerThr: 6.339 ± 0.506
4.992SerVal: 4.992 ± 0.483
0.611SerTrp: 0.611 ± 0.096
2.963SerTyr: 2.963 ± 0.238
0.0SerXaa: 0.0 ± 0.0
Thr
6.537ThrAla: 6.537 ± 0.571
0.449ThrCys: 0.449 ± 0.097
4.543ThrAsp: 4.543 ± 0.335
4.256ThrGlu: 4.256 ± 0.289
3.196ThrPhe: 3.196 ± 0.358
6.842ThrGly: 6.842 ± 0.707
0.88ThrHis: 0.88 ± 0.159
5.567ThrIle: 5.567 ± 0.384
3.286ThrLys: 3.286 ± 0.261
6.231ThrLeu: 6.231 ± 0.507
1.221ThrMet: 1.221 ± 0.175
4.471ThrAsn: 4.471 ± 0.452
3.34ThrPro: 3.34 ± 0.279
2.622ThrGln: 2.622 ± 0.189
2.586ThrArg: 2.586 ± 0.213
5.908ThrSer: 5.908 ± 0.612
6.788ThrThr: 6.788 ± 0.652
6.124ThrVal: 6.124 ± 0.719
0.629ThrTrp: 0.629 ± 0.096
2.765ThrTyr: 2.765 ± 0.184
0.0ThrXaa: 0.0 ± 0.0
Val
4.615ValAla: 4.615 ± 0.372
0.467ValCys: 0.467 ± 0.107
5.639ValAsp: 5.639 ± 0.42
4.13ValGlu: 4.13 ± 0.25
2.676ValPhe: 2.676 ± 0.235
5.441ValGly: 5.441 ± 0.56
0.79ValHis: 0.79 ± 0.129
4.22ValIle: 4.22 ± 0.318
3.376ValLys: 3.376 ± 0.27
4.22ValLeu: 4.22 ± 0.324
1.293ValMet: 1.293 ± 0.205
4.148ValAsn: 4.148 ± 0.326
2.783ValPro: 2.783 ± 0.223
2.622ValGln: 2.622 ± 0.187
2.873ValArg: 2.873 ± 0.295
5.495ValSer: 5.495 ± 0.322
5.944ValThr: 5.944 ± 0.65
4.723ValVal: 4.723 ± 0.393
0.664ValTrp: 0.664 ± 0.086
2.532ValTyr: 2.532 ± 0.239
0.0ValXaa: 0.0 ± 0.0
Trp
0.826TrpAla: 0.826 ± 0.099
0.162TrpCys: 0.162 ± 0.06
0.754TrpAsp: 0.754 ± 0.136
0.7TrpGlu: 0.7 ± 0.14
0.467TrpPhe: 0.467 ± 0.099
0.646TrpGly: 0.646 ± 0.1
0.305TrpHis: 0.305 ± 0.087
0.629TrpIle: 0.629 ± 0.11
0.88TrpLys: 0.88 ± 0.154
0.646TrpLeu: 0.646 ± 0.111
0.341TrpMet: 0.341 ± 0.085
0.934TrpAsn: 0.934 ± 0.162
0.162TrpPro: 0.162 ± 0.048
0.485TrpGln: 0.485 ± 0.089
0.539TrpArg: 0.539 ± 0.092
0.772TrpSer: 0.772 ± 0.109
0.844TrpThr: 0.844 ± 0.118
0.934TrpVal: 0.934 ± 0.128
0.126TrpTrp: 0.126 ± 0.047
0.413TrpTyr: 0.413 ± 0.076
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.163
0.539TyrCys: 0.539 ± 0.107
3.25TyrAsp: 3.25 ± 0.266
2.532TyrGlu: 2.532 ± 0.22
1.85TyrPhe: 1.85 ± 0.162
2.514TyrGly: 2.514 ± 0.265
0.539TyrHis: 0.539 ± 0.09
2.819TyrIle: 2.819 ± 0.287
2.317TyrLys: 2.317 ± 0.241
3.071TyrLeu: 3.071 ± 0.246
1.059TyrMet: 1.059 ± 0.18
2.622TyrAsn: 2.622 ± 0.202
1.634TyrPro: 1.634 ± 0.159
1.706TyrGln: 1.706 ± 0.184
2.263TyrArg: 2.263 ± 0.228
2.604TyrSer: 2.604 ± 0.226
3.214TyrThr: 3.214 ± 0.375
2.765TyrVal: 2.765 ± 0.203
0.485TyrTrp: 0.485 ± 0.11
1.939TyrTyr: 1.939 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (55688 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski