Amino acid dipepetide frequency for Planktothrix agardhii (Oscillatoria agardhii)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.227AlaAla: 4.227 ± 0.404
0.818AlaCys: 0.818 ± 0.161
3.636AlaAsp: 3.636 ± 0.278
4.25AlaGlu: 4.25 ± 0.282
2.273AlaPhe: 2.273 ± 0.272
4.113AlaGly: 4.113 ± 0.359
1.227AlaHis: 1.227 ± 0.141
5.75AlaIle: 5.75 ± 0.363
4.295AlaLys: 4.295 ± 0.351
6.84AlaLeu: 6.84 ± 0.459
1.273AlaMet: 1.273 ± 0.137
2.409AlaAsn: 2.409 ± 0.204
2.045AlaPro: 2.045 ± 0.197
3.477AlaGln: 3.477 ± 0.347
3.0AlaArg: 3.0 ± 0.309
3.841AlaSer: 3.841 ± 0.285
4.227AlaThr: 4.227 ± 0.296
4.159AlaVal: 4.159 ± 0.329
1.114AlaTrp: 1.114 ± 0.147
2.341AlaTyr: 2.341 ± 0.252
0.0AlaXaa: 0.0 ± 0.0
Cys
0.568CysAla: 0.568 ± 0.108
0.227CysCys: 0.227 ± 0.071
0.704CysAsp: 0.704 ± 0.117
0.659CysGlu: 0.659 ± 0.1
0.636CysPhe: 0.636 ± 0.122
0.75CysGly: 0.75 ± 0.134
0.545CysHis: 0.545 ± 0.121
0.545CysIle: 0.545 ± 0.11
0.682CysLys: 0.682 ± 0.155
1.341CysLeu: 1.341 ± 0.162
0.227CysMet: 0.227 ± 0.071
0.5CysAsn: 0.5 ± 0.102
0.704CysPro: 0.704 ± 0.134
1.023CysGln: 1.023 ± 0.162
0.977CysArg: 0.977 ± 0.146
0.954CysSer: 0.954 ± 0.152
0.477CysThr: 0.477 ± 0.093
0.591CysVal: 0.591 ± 0.114
0.114CysTrp: 0.114 ± 0.048
0.568CysTyr: 0.568 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
3.227AspAla: 3.227 ± 0.228
0.864AspCys: 0.864 ± 0.111
2.659AspAsp: 2.659 ± 0.296
3.772AspGlu: 3.772 ± 0.316
2.432AspPhe: 2.432 ± 0.203
2.659AspGly: 2.659 ± 0.315
1.045AspHis: 1.045 ± 0.18
3.704AspIle: 3.704 ± 0.319
3.136AspLys: 3.136 ± 0.343
6.318AspLeu: 6.318 ± 0.393
1.0AspMet: 1.0 ± 0.17
2.227AspAsn: 2.227 ± 0.191
2.545AspPro: 2.545 ± 0.307
2.182AspGln: 2.182 ± 0.231
3.159AspArg: 3.159 ± 0.326
3.886AspSer: 3.886 ± 0.311
2.273AspThr: 2.273 ± 0.252
2.273AspVal: 2.273 ± 0.229
1.295AspTrp: 1.295 ± 0.134
2.341AspTyr: 2.341 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
5.318GluAla: 5.318 ± 0.329
0.727GluCys: 0.727 ± 0.127
2.841GluAsp: 2.841 ± 0.25
4.204GluGlu: 4.204 ± 0.322
3.045GluPhe: 3.045 ± 0.267
3.295GluGly: 3.295 ± 0.247
1.25GluHis: 1.25 ± 0.159
5.068GluIle: 5.068 ± 0.3
3.932GluLys: 3.932 ± 0.259
8.113GluLeu: 8.113 ± 0.494
1.864GluMet: 1.864 ± 0.183
3.363GluAsn: 3.363 ± 0.268
2.932GluPro: 2.932 ± 0.279
4.409GluGln: 4.409 ± 0.322
4.432GluArg: 4.432 ± 0.394
4.5GluSer: 4.5 ± 0.326
3.75GluThr: 3.75 ± 0.29
4.363GluVal: 4.363 ± 0.291
1.318GluTrp: 1.318 ± 0.179
2.409GluTyr: 2.409 ± 0.219
0.0GluXaa: 0.0 ± 0.0
Phe
2.454PheAla: 2.454 ± 0.215
0.523PheCys: 0.523 ± 0.116
2.318PheAsp: 2.318 ± 0.221
2.409PheGlu: 2.409 ± 0.23
1.182PhePhe: 1.182 ± 0.168
2.091PheGly: 2.091 ± 0.246
1.023PheHis: 1.023 ± 0.174
2.682PheIle: 2.682 ± 0.233
2.204PheLys: 2.204 ± 0.27
4.045PheLeu: 4.045 ± 0.301
0.977PheMet: 0.977 ± 0.158
2.068PheAsn: 2.068 ± 0.209
2.023PhePro: 2.023 ± 0.196
1.568PheGln: 1.568 ± 0.193
1.727PheArg: 1.727 ± 0.188
2.704PheSer: 2.704 ± 0.229
2.227PheThr: 2.227 ± 0.272
2.273PheVal: 2.273 ± 0.209
0.682PheTrp: 0.682 ± 0.142
1.591PheTyr: 1.591 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
3.522GlyAla: 3.522 ± 0.347
1.114GlyCys: 1.114 ± 0.19
2.841GlyAsp: 2.841 ± 0.426
4.545GlyGlu: 4.545 ± 0.226
2.682GlyPhe: 2.682 ± 0.241
3.727GlyGly: 3.727 ± 0.344
1.0GlyHis: 1.0 ± 0.158
4.909GlyIle: 4.909 ± 0.643
3.636GlyLys: 3.636 ± 0.349
6.295GlyLeu: 6.295 ± 0.407
1.182GlyMet: 1.182 ± 0.21
2.545GlyAsn: 2.545 ± 0.24
0.773GlyPro: 0.773 ± 0.154
2.704GlyGln: 2.704 ± 0.229
2.886GlyArg: 2.886 ± 0.31
3.636GlySer: 3.636 ± 0.283
3.091GlyThr: 3.091 ± 0.271
4.182GlyVal: 4.182 ± 0.279
0.886GlyTrp: 0.886 ± 0.139
2.159GlyTyr: 2.159 ± 0.21
0.0GlyXaa: 0.0 ± 0.0
His
0.727HisAla: 0.727 ± 0.12
0.386HisCys: 0.386 ± 0.107
0.909HisAsp: 0.909 ± 0.141
1.0HisGlu: 1.0 ± 0.162
0.954HisPhe: 0.954 ± 0.154
1.045HisGly: 1.045 ± 0.162
0.818HisHis: 0.818 ± 0.149
1.227HisIle: 1.227 ± 0.161
0.841HisLys: 0.841 ± 0.136
2.75HisLeu: 2.75 ± 0.287
0.136HisMet: 0.136 ± 0.061
0.909HisAsn: 0.909 ± 0.138
1.432HisPro: 1.432 ± 0.205
1.432HisGln: 1.432 ± 0.3
1.386HisArg: 1.386 ± 0.165
1.614HisSer: 1.614 ± 0.182
1.023HisThr: 1.023 ± 0.187
0.909HisVal: 0.909 ± 0.142
0.5HisTrp: 0.5 ± 0.121
0.954HisTyr: 0.954 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
5.931IleAla: 5.931 ± 0.435
1.023IleCys: 1.023 ± 0.146
4.272IleAsp: 4.272 ± 0.262
5.977IleGlu: 5.977 ± 0.433
2.363IlePhe: 2.363 ± 0.237
3.432IleGly: 3.432 ± 0.303
1.659IleHis: 1.659 ± 0.26
4.25IleIle: 4.25 ± 0.392
4.295IleLys: 4.295 ± 0.375
6.431IleLeu: 6.431 ± 0.412
0.75IleMet: 0.75 ± 0.131
3.591IleAsn: 3.591 ± 0.313
4.068IlePro: 4.068 ± 0.288
3.704IleGln: 3.704 ± 0.364
2.977IleArg: 2.977 ± 0.325
4.659IleSer: 4.659 ± 0.287
3.227IleThr: 3.227 ± 0.219
4.045IleVal: 4.045 ± 0.315
0.909IleTrp: 0.909 ± 0.148
2.341IleTyr: 2.341 ± 0.201
0.0IleXaa: 0.0 ± 0.0
Lys
4.091LysAla: 4.091 ± 0.379
0.5LysCys: 0.5 ± 0.105
2.909LysAsp: 2.909 ± 0.335
3.545LysGlu: 3.545 ± 0.31
2.318LysPhe: 2.318 ± 0.285
3.636LysGly: 3.636 ± 0.497
1.136LysHis: 1.136 ± 0.138
4.432LysIle: 4.432 ± 0.441
3.522LysLys: 3.522 ± 0.341
6.045LysLeu: 6.045 ± 0.377
0.909LysMet: 0.909 ± 0.181
2.977LysAsn: 2.977 ± 0.277
2.818LysPro: 2.818 ± 0.219
2.932LysGln: 2.932 ± 0.286
3.0LysArg: 3.0 ± 0.249
3.932LysSer: 3.932 ± 0.424
3.409LysThr: 3.409 ± 0.217
3.75LysVal: 3.75 ± 0.38
0.75LysTrp: 0.75 ± 0.127
1.864LysTyr: 1.864 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
7.499LeuAla: 7.499 ± 0.39
1.364LeuCys: 1.364 ± 0.153
6.318LeuAsp: 6.318 ± 0.377
8.84LeuGlu: 8.84 ± 0.543
3.409LeuPhe: 3.409 ± 0.27
6.0LeuGly: 6.0 ± 0.318
2.182LeuHis: 2.182 ± 0.199
6.613LeuIle: 6.613 ± 0.446
7.568LeuLys: 7.568 ± 0.415
9.408LeuLeu: 9.408 ± 0.495
2.068LeuMet: 2.068 ± 0.202
5.431LeuAsn: 5.431 ± 0.319
5.295LeuPro: 5.295 ± 0.528
4.795LeuGln: 4.795 ± 0.331
5.409LeuArg: 5.409 ± 0.374
7.454LeuSer: 7.454 ± 0.409
6.386LeuThr: 6.386 ± 0.367
6.772LeuVal: 6.772 ± 0.344
1.182LeuTrp: 1.182 ± 0.163
3.159LeuTyr: 3.159 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.432MetAla: 1.432 ± 0.172
0.114MetCys: 0.114 ± 0.048
0.636MetAsp: 0.636 ± 0.13
1.159MetGlu: 1.159 ± 0.119
0.432MetPhe: 0.432 ± 0.081
1.386MetGly: 1.386 ± 0.186
0.205MetHis: 0.205 ± 0.062
1.204MetIle: 1.204 ± 0.167
1.295MetLys: 1.295 ± 0.174
1.432MetLeu: 1.432 ± 0.198
0.455MetMet: 0.455 ± 0.085
1.0MetAsn: 1.0 ± 0.187
0.909MetPro: 0.909 ± 0.121
0.795MetGln: 0.795 ± 0.131
0.977MetArg: 0.977 ± 0.186
1.114MetSer: 1.114 ± 0.14
1.159MetThr: 1.159 ± 0.192
1.454MetVal: 1.454 ± 0.177
0.205MetTrp: 0.205 ± 0.056
0.364MetTyr: 0.364 ± 0.098
0.0MetXaa: 0.0 ± 0.0
Asn
2.363AsnAla: 2.363 ± 0.278
0.727AsnCys: 0.727 ± 0.115
2.159AsnAsp: 2.159 ± 0.299
2.318AsnGlu: 2.318 ± 0.213
1.818AsnPhe: 1.818 ± 0.187
2.113AsnGly: 2.113 ± 0.184
1.159AsnHis: 1.159 ± 0.168
2.909AsnIle: 2.909 ± 0.272
2.295AsnLys: 2.295 ± 0.296
5.159AsnLeu: 5.159 ± 0.286
0.795AsnMet: 0.795 ± 0.139
3.068AsnAsn: 3.068 ± 0.302
3.682AsnPro: 3.682 ± 0.335
3.386AsnGln: 3.386 ± 0.267
2.682AsnArg: 2.682 ± 0.275
3.204AsnSer: 3.204 ± 0.245
1.977AsnThr: 1.977 ± 0.201
2.204AsnVal: 2.204 ± 0.198
1.114AsnTrp: 1.114 ± 0.151
2.25AsnTyr: 2.25 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
2.704ProAla: 2.704 ± 0.25
0.318ProCys: 0.318 ± 0.071
3.613ProAsp: 3.613 ± 0.26
4.591ProGlu: 4.591 ± 0.342
2.023ProPhe: 2.023 ± 0.24
2.841ProGly: 2.841 ± 0.266
0.886ProHis: 0.886 ± 0.141
2.841ProIle: 2.841 ± 0.249
2.863ProLys: 2.863 ± 0.261
4.591ProLeu: 4.591 ± 0.297
0.841ProMet: 0.841 ± 0.152
2.159ProAsn: 2.159 ± 0.21
2.068ProPro: 2.068 ± 0.322
2.932ProGln: 2.932 ± 0.313
1.659ProArg: 1.659 ± 0.211
3.068ProSer: 3.068 ± 0.239
3.227ProThr: 3.227 ± 0.354
2.318ProVal: 2.318 ± 0.228
0.523ProTrp: 0.523 ± 0.105
1.273ProTyr: 1.273 ± 0.18
0.0ProXaa: 0.0 ± 0.0
Gln
3.727GlnAla: 3.727 ± 0.329
0.477GlnCys: 0.477 ± 0.117
2.386GlnAsp: 2.386 ± 0.25
3.909GlnGlu: 3.909 ± 0.29
1.977GlnPhe: 1.977 ± 0.194
3.295GlnGly: 3.295 ± 0.419
0.954GlnHis: 0.954 ± 0.163
4.113GlnIle: 4.113 ± 0.303
3.25GlnLys: 3.25 ± 0.284
6.681GlnLeu: 6.681 ± 0.479
1.0GlnMet: 1.0 ± 0.161
2.409GlnAsn: 2.409 ± 0.287
2.091GlnPro: 2.091 ± 0.231
3.5GlnGln: 3.5 ± 0.247
2.659GlnArg: 2.659 ± 0.247
3.613GlnSer: 3.613 ± 0.283
3.023GlnThr: 3.023 ± 0.325
3.023GlnVal: 3.023 ± 0.272
0.864GlnTrp: 0.864 ± 0.174
1.545GlnTyr: 1.545 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
3.0ArgAla: 3.0 ± 0.275
0.591ArgCys: 0.591 ± 0.14
2.5ArgAsp: 2.5 ± 0.218
3.636ArgGlu: 3.636 ± 0.408
2.318ArgPhe: 2.318 ± 0.233
2.818ArgGly: 2.818 ± 0.337
1.273ArgHis: 1.273 ± 0.129
3.75ArgIle: 3.75 ± 0.304
2.341ArgLys: 2.341 ± 0.247
6.204ArgLeu: 6.204 ± 0.554
0.636ArgMet: 0.636 ± 0.111
2.454ArgAsn: 2.454 ± 0.245
2.113ArgPro: 2.113 ± 0.24
3.295ArgGln: 3.295 ± 0.278
2.545ArgArg: 2.545 ± 0.314
2.954ArgSer: 2.954 ± 0.248
2.363ArgThr: 2.363 ± 0.251
2.886ArgVal: 2.886 ± 0.235
0.704ArgTrp: 0.704 ± 0.139
1.932ArgTyr: 1.932 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
4.045SerAla: 4.045 ± 0.337
0.704SerCys: 0.704 ± 0.117
3.954SerAsp: 3.954 ± 0.247
5.022SerGlu: 5.022 ± 0.355
2.5SerPhe: 2.5 ± 0.232
4.227SerGly: 4.227 ± 0.34
1.25SerHis: 1.25 ± 0.171
4.363SerIle: 4.363 ± 0.293
3.318SerLys: 3.318 ± 0.352
7.499SerLeu: 7.499 ± 0.329
1.182SerMet: 1.182 ± 0.16
3.068SerAsn: 3.068 ± 0.28
3.909SerPro: 3.909 ± 0.338
3.977SerGln: 3.977 ± 0.329
2.704SerArg: 2.704 ± 0.238
4.227SerSer: 4.227 ± 0.341
3.204SerThr: 3.204 ± 0.267
3.545SerVal: 3.545 ± 0.262
0.773SerTrp: 0.773 ± 0.115
2.0SerTyr: 2.0 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
3.432ThrAla: 3.432 ± 0.294
0.614ThrCys: 0.614 ± 0.121
2.5ThrAsp: 2.5 ± 0.222
3.522ThrGlu: 3.522 ± 0.267
1.795ThrPhe: 1.795 ± 0.214
4.204ThrGly: 4.204 ± 0.394
0.909ThrHis: 0.909 ± 0.127
3.977ThrIle: 3.977 ± 0.31
2.682ThrLys: 2.682 ± 0.24
6.59ThrLeu: 6.59 ± 0.348
0.614ThrMet: 0.614 ± 0.129
1.886ThrAsn: 1.886 ± 0.21
3.227ThrPro: 3.227 ± 0.269
2.636ThrGln: 2.636 ± 0.269
2.091ThrArg: 2.091 ± 0.189
3.318ThrSer: 3.318 ± 0.331
2.659ThrThr: 2.659 ± 0.239
3.477ThrVal: 3.477 ± 0.253
0.773ThrTrp: 0.773 ± 0.118
1.682ThrTyr: 1.682 ± 0.199
0.0ThrXaa: 0.0 ± 0.0
Val
4.386ValAla: 4.386 ± 0.368
1.0ValCys: 1.0 ± 0.148
3.25ValAsp: 3.25 ± 0.284
4.363ValGlu: 4.363 ± 0.307
2.386ValPhe: 2.386 ± 0.223
3.454ValGly: 3.454 ± 0.3
1.159ValHis: 1.159 ± 0.185
4.704ValIle: 4.704 ± 0.299
3.909ValLys: 3.909 ± 0.285
5.841ValLeu: 5.841 ± 0.324
1.045ValMet: 1.045 ± 0.156
3.0ValAsn: 3.0 ± 0.277
2.477ValPro: 2.477 ± 0.234
2.295ValGln: 2.295 ± 0.227
3.295ValArg: 3.295 ± 0.313
3.409ValSer: 3.409 ± 0.197
2.613ValThr: 2.613 ± 0.229
3.704ValVal: 3.704 ± 0.343
0.932ValTrp: 0.932 ± 0.137
1.773ValTyr: 1.773 ± 0.243
0.0ValXaa: 0.0 ± 0.0
Trp
0.886TrpAla: 0.886 ± 0.131
0.25TrpCys: 0.25 ± 0.074
0.841TrpAsp: 0.841 ± 0.139
1.227TrpGlu: 1.227 ± 0.151
0.659TrpPhe: 0.659 ± 0.091
1.023TrpGly: 1.023 ± 0.135
0.523TrpHis: 0.523 ± 0.116
0.886TrpIle: 0.886 ± 0.157
1.023TrpLys: 1.023 ± 0.156
1.864TrpLeu: 1.864 ± 0.172
0.273TrpMet: 0.273 ± 0.07
0.909TrpAsn: 0.909 ± 0.133
0.227TrpPro: 0.227 ± 0.066
1.136TrpGln: 1.136 ± 0.184
0.614TrpArg: 0.614 ± 0.109
0.841TrpSer: 0.841 ± 0.182
0.5TrpThr: 0.5 ± 0.104
1.25TrpVal: 1.25 ± 0.166
0.273TrpTrp: 0.273 ± 0.083
0.386TrpTyr: 0.386 ± 0.082
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.773TyrAla: 1.773 ± 0.186
0.455TyrCys: 0.455 ± 0.095
1.75TyrAsp: 1.75 ± 0.22
2.182TyrGlu: 2.182 ± 0.26
1.568TyrPhe: 1.568 ± 0.189
2.204TyrGly: 2.204 ± 0.158
0.75TyrHis: 0.75 ± 0.12
1.977TyrIle: 1.977 ± 0.238
1.409TyrLys: 1.409 ± 0.208
3.704TyrLeu: 3.704 ± 0.338
0.477TyrMet: 0.477 ± 0.14
1.409TyrAsn: 1.409 ± 0.188
2.0TyrPro: 2.0 ± 0.189
2.318TyrGln: 2.318 ± 0.243
2.136TyrArg: 2.136 ± 0.242
2.545TyrSer: 2.545 ± 0.242
1.773TyrThr: 1.773 ± 0.168
1.818TyrVal: 1.818 ± 0.197
0.682TyrTrp: 0.682 ± 0.113
1.364TyrTyr: 1.364 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 147 proteins (44004 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski