Amino acid dipepetide frequency for Nodularia phage vB_NspS-kac65v151

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.185AlaAla: 6.185 ± 0.572
0.409AlaCys: 0.409 ± 0.102
4.025AlaAsp: 4.025 ± 0.279
5.366AlaGlu: 5.366 ± 0.556
2.206AlaPhe: 2.206 ± 0.24
4.297AlaGly: 4.297 ± 0.453
0.796AlaHis: 0.796 ± 0.15
6.003AlaIle: 6.003 ± 0.347
5.616AlaLys: 5.616 ± 0.507
6.071AlaLeu: 6.071 ± 0.508
1.592AlaMet: 1.592 ± 0.174
3.297AlaAsn: 3.297 ± 0.227
2.729AlaPro: 2.729 ± 0.351
3.297AlaGln: 3.297 ± 0.395
3.183AlaArg: 3.183 ± 0.272
4.752AlaSer: 4.752 ± 0.416
4.798AlaThr: 4.798 ± 0.429
5.161AlaVal: 5.161 ± 0.338
0.841AlaTrp: 0.841 ± 0.149
1.796AlaTyr: 1.796 ± 0.164
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.101
0.205CysCys: 0.205 ± 0.072
0.819CysAsp: 0.819 ± 0.139
0.819CysGlu: 0.819 ± 0.173
0.637CysPhe: 0.637 ± 0.13
1.046CysGly: 1.046 ± 0.206
0.318CysHis: 0.318 ± 0.087
0.568CysIle: 0.568 ± 0.123
0.978CysLys: 0.978 ± 0.152
1.137CysLeu: 1.137 ± 0.197
0.159CysMet: 0.159 ± 0.063
0.523CysAsn: 0.523 ± 0.116
0.455CysPro: 0.455 ± 0.099
0.568CysGln: 0.568 ± 0.123
0.546CysArg: 0.546 ± 0.117
0.91CysSer: 0.91 ± 0.158
0.341CysThr: 0.341 ± 0.091
0.637CysVal: 0.637 ± 0.137
0.227CysTrp: 0.227 ± 0.07
0.568CysTyr: 0.568 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.888AspAla: 3.888 ± 0.35
0.682AspCys: 0.682 ± 0.129
3.388AspAsp: 3.388 ± 0.32
3.388AspGlu: 3.388 ± 0.296
2.91AspPhe: 2.91 ± 0.308
4.07AspGly: 4.07 ± 0.37
0.819AspHis: 0.819 ± 0.139
3.911AspIle: 3.911 ± 0.323
4.343AspLys: 4.343 ± 0.327
5.935AspLeu: 5.935 ± 0.411
0.887AspMet: 0.887 ± 0.162
2.638AspAsn: 2.638 ± 0.209
2.433AspPro: 2.433 ± 0.355
2.115AspGln: 2.115 ± 0.31
3.138AspArg: 3.138 ± 0.292
4.252AspSer: 4.252 ± 0.332
3.047AspThr: 3.047 ± 0.267
3.57AspVal: 3.57 ± 0.276
0.819AspTrp: 0.819 ± 0.136
2.456AspTyr: 2.456 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
5.548GluAla: 5.548 ± 0.475
0.864GluCys: 0.864 ± 0.139
3.365GluAsp: 3.365 ± 0.27
3.524GluGlu: 3.524 ± 0.453
2.979GluPhe: 2.979 ± 0.273
2.888GluGly: 2.888 ± 0.241
0.887GluHis: 0.887 ± 0.173
5.116GluIle: 5.116 ± 0.342
3.843GluLys: 3.843 ± 0.433
7.776GluLeu: 7.776 ± 0.454
1.501GluMet: 1.501 ± 0.199
2.819GluAsn: 2.819 ± 0.291
3.001GluPro: 3.001 ± 0.344
2.91GluGln: 2.91 ± 0.392
3.57GluArg: 3.57 ± 0.299
4.32GluSer: 4.32 ± 0.35
3.661GluThr: 3.661 ± 0.358
4.798GluVal: 4.798 ± 0.321
0.546GluTrp: 0.546 ± 0.116
2.865GluTyr: 2.865 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
1.978PheAla: 1.978 ± 0.229
0.5PheCys: 0.5 ± 0.116
2.751PheAsp: 2.751 ± 0.254
2.115PheGlu: 2.115 ± 0.276
1.228PhePhe: 1.228 ± 0.209
2.66PheGly: 2.66 ± 0.322
0.455PheHis: 0.455 ± 0.112
2.456PheIle: 2.456 ± 0.276
2.842PheLys: 2.842 ± 0.304
3.092PheLeu: 3.092 ± 0.272
0.932PheMet: 0.932 ± 0.177
2.365PheAsn: 2.365 ± 0.242
1.796PhePro: 1.796 ± 0.216
1.796PheGln: 1.796 ± 0.178
1.774PheArg: 1.774 ± 0.203
3.07PheSer: 3.07 ± 0.261
3.229PheThr: 3.229 ± 0.319
2.092PheVal: 2.092 ± 0.23
0.25PheTrp: 0.25 ± 0.067
1.546PheTyr: 1.546 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
3.752GlyAla: 3.752 ± 0.321
0.932GlyCys: 0.932 ± 0.163
4.047GlyAsp: 4.047 ± 0.309
3.888GlyGlu: 3.888 ± 0.32
3.138GlyPhe: 3.138 ± 0.264
4.116GlyGly: 4.116 ± 0.356
0.91GlyHis: 0.91 ± 0.144
3.729GlyIle: 3.729 ± 0.264
4.502GlyLys: 4.502 ± 0.341
5.48GlyLeu: 5.48 ± 0.385
1.432GlyMet: 1.432 ± 0.197
3.229GlyAsn: 3.229 ± 0.322
0.023GlyPro: 0.023 ± 0.022
2.228GlyGln: 2.228 ± 0.257
2.888GlyArg: 2.888 ± 0.306
4.98GlySer: 4.98 ± 0.409
3.865GlyThr: 3.865 ± 0.438
4.343GlyVal: 4.343 ± 0.383
1.205GlyTrp: 1.205 ± 0.202
2.888GlyTyr: 2.888 ± 0.356
0.0GlyXaa: 0.0 ± 0.0
His
0.932HisAla: 0.932 ± 0.162
0.364HisCys: 0.364 ± 0.095
0.773HisAsp: 0.773 ± 0.157
0.75HisGlu: 0.75 ± 0.144
0.659HisPhe: 0.659 ± 0.139
0.705HisGly: 0.705 ± 0.12
0.318HisHis: 0.318 ± 0.086
0.864HisIle: 0.864 ± 0.144
1.319HisLys: 1.319 ± 0.261
1.66HisLeu: 1.66 ± 0.212
0.159HisMet: 0.159 ± 0.054
0.819HisAsn: 0.819 ± 0.137
0.728HisPro: 0.728 ± 0.131
0.978HisGln: 0.978 ± 0.19
0.773HisArg: 0.773 ± 0.154
1.046HisSer: 1.046 ± 0.214
0.705HisThr: 0.705 ± 0.157
0.523HisVal: 0.523 ± 0.103
0.25HisTrp: 0.25 ± 0.097
0.841HisTyr: 0.841 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
6.298IleAla: 6.298 ± 0.378
0.546IleCys: 0.546 ± 0.109
4.32IleAsp: 4.32 ± 0.303
4.411IleGlu: 4.411 ± 0.34
1.864IlePhe: 1.864 ± 0.235
3.706IleGly: 3.706 ± 0.381
1.0IleHis: 1.0 ± 0.195
3.024IleIle: 3.024 ± 0.333
5.093IleLys: 5.093 ± 0.358
4.32IleLeu: 4.32 ± 0.368
0.796IleMet: 0.796 ± 0.132
3.479IleAsn: 3.479 ± 0.298
2.592IlePro: 2.592 ± 0.335
2.615IleGln: 2.615 ± 0.275
2.979IleArg: 2.979 ± 0.269
4.843IleSer: 4.843 ± 0.392
4.275IleThr: 4.275 ± 0.368
3.547IleVal: 3.547 ± 0.323
0.568IleTrp: 0.568 ± 0.136
2.001IleTyr: 2.001 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
5.434LysAla: 5.434 ± 0.376
0.773LysCys: 0.773 ± 0.138
3.706LysAsp: 3.706 ± 0.317
4.752LysGlu: 4.752 ± 0.336
2.751LysPhe: 2.751 ± 0.248
3.979LysGly: 3.979 ± 0.364
1.023LysHis: 1.023 ± 0.199
4.047LysIle: 4.047 ± 0.301
5.548LysLys: 5.548 ± 0.507
8.276LysLeu: 8.276 ± 0.556
1.0LysMet: 1.0 ± 0.157
3.183LysAsn: 3.183 ± 0.342
3.934LysPro: 3.934 ± 0.31
3.57LysGln: 3.57 ± 0.395
3.661LysArg: 3.661 ± 0.335
5.161LysSer: 5.161 ± 0.316
4.32LysThr: 4.32 ± 0.406
5.025LysVal: 5.025 ± 0.325
0.705LysTrp: 0.705 ± 0.121
2.137LysTyr: 2.137 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
7.617LeuAla: 7.617 ± 0.655
0.819LeuCys: 0.819 ± 0.169
5.503LeuAsp: 5.503 ± 0.375
6.776LeuGlu: 6.776 ± 0.368
3.365LeuPhe: 3.365 ± 0.308
5.753LeuGly: 5.753 ± 0.433
1.66LeuHis: 1.66 ± 0.251
5.139LeuIle: 5.139 ± 0.326
6.435LeuLys: 6.435 ± 0.48
8.458LeuLeu: 8.458 ± 0.49
1.296LeuMet: 1.296 ± 0.188
5.48LeuAsn: 5.48 ± 0.432
4.32LeuPro: 4.32 ± 0.323
4.479LeuGln: 4.479 ± 0.436
4.297LeuArg: 4.297 ± 0.388
6.617LeuSer: 6.617 ± 0.387
5.866LeuThr: 5.866 ± 0.428
5.775LeuVal: 5.775 ± 0.388
0.864LeuTrp: 0.864 ± 0.159
2.569LeuTyr: 2.569 ± 0.264
0.0LeuXaa: 0.0 ± 0.0
Met
1.205MetAla: 1.205 ± 0.154
0.455MetCys: 0.455 ± 0.108
0.887MetAsp: 0.887 ± 0.144
1.114MetGlu: 1.114 ± 0.179
0.5MetPhe: 0.5 ± 0.114
0.91MetGly: 0.91 ± 0.137
0.182MetHis: 0.182 ± 0.063
1.137MetIle: 1.137 ± 0.172
1.069MetLys: 1.069 ± 0.191
1.864MetLeu: 1.864 ± 0.265
0.318MetMet: 0.318 ± 0.091
0.637MetAsn: 0.637 ± 0.117
0.728MetPro: 0.728 ± 0.132
0.819MetGln: 0.819 ± 0.146
0.728MetArg: 0.728 ± 0.144
1.796MetSer: 1.796 ± 0.265
1.432MetThr: 1.432 ± 0.181
1.228MetVal: 1.228 ± 0.169
0.068MetTrp: 0.068 ± 0.036
0.659MetTyr: 0.659 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
3.07AsnAla: 3.07 ± 0.241
1.069AsnCys: 1.069 ± 0.188
2.41AsnAsp: 2.41 ± 0.26
2.774AsnGlu: 2.774 ± 0.233
2.092AsnPhe: 2.092 ± 0.247
3.342AsnGly: 3.342 ± 0.338
0.728AsnHis: 0.728 ± 0.175
2.979AsnIle: 2.979 ± 0.247
3.706AsnLys: 3.706 ± 0.299
4.729AsnLeu: 4.729 ± 0.398
0.773AsnMet: 0.773 ± 0.156
2.615AsnAsn: 2.615 ± 0.245
3.024AsnPro: 3.024 ± 0.252
2.524AsnGln: 2.524 ± 0.368
2.183AsnArg: 2.183 ± 0.222
3.411AsnSer: 3.411 ± 0.462
2.501AsnThr: 2.501 ± 0.278
2.615AsnVal: 2.615 ± 0.255
0.637AsnTrp: 0.637 ± 0.125
1.887AsnTyr: 1.887 ± 0.176
0.0AsnXaa: 0.0 ± 0.0
Pro
2.729ProAla: 2.729 ± 0.329
0.432ProCys: 0.432 ± 0.102
3.251ProAsp: 3.251 ± 0.268
3.865ProGlu: 3.865 ± 0.433
1.455ProPhe: 1.455 ± 0.202
2.638ProGly: 2.638 ± 0.282
0.75ProHis: 0.75 ± 0.169
1.842ProIle: 1.842 ± 0.199
3.297ProLys: 3.297 ± 0.318
2.706ProLeu: 2.706 ± 0.307
0.659ProMet: 0.659 ± 0.126
1.933ProAsn: 1.933 ± 0.203
1.614ProPro: 1.614 ± 0.266
2.183ProGln: 2.183 ± 0.209
1.16ProArg: 1.16 ± 0.158
3.115ProSer: 3.115 ± 0.346
2.933ProThr: 2.933 ± 0.284
3.206ProVal: 3.206 ± 0.305
0.296ProTrp: 0.296 ± 0.093
1.228ProTyr: 1.228 ± 0.176
0.0ProXaa: 0.0 ± 0.0
Gln
3.388GlnAla: 3.388 ± 0.398
0.364GlnCys: 0.364 ± 0.102
1.933GlnAsp: 1.933 ± 0.207
3.661GlnGlu: 3.661 ± 0.449
1.683GlnPhe: 1.683 ± 0.225
2.365GlnGly: 2.365 ± 0.241
0.728GlnHis: 0.728 ± 0.12
3.661GlnIle: 3.661 ± 0.295
2.615GlnLys: 2.615 ± 0.313
5.23GlnLeu: 5.23 ± 0.623
0.773GlnMet: 0.773 ± 0.155
1.523GlnAsn: 1.523 ± 0.185
2.433GlnPro: 2.433 ± 0.284
3.388GlnGln: 3.388 ± 0.472
2.092GlnArg: 2.092 ± 0.209
3.297GlnSer: 3.297 ± 0.48
2.433GlnThr: 2.433 ± 0.248
3.274GlnVal: 3.274 ± 0.226
0.5GlnTrp: 0.5 ± 0.116
1.273GlnTyr: 1.273 ± 0.149
0.0GlnXaa: 0.0 ± 0.0
Arg
2.569ArgAla: 2.569 ± 0.219
0.546ArgCys: 0.546 ± 0.119
2.547ArgAsp: 2.547 ± 0.241
3.365ArgGlu: 3.365 ± 0.306
2.024ArgPhe: 2.024 ± 0.241
2.456ArgGly: 2.456 ± 0.245
0.841ArgHis: 0.841 ± 0.153
2.547ArgIle: 2.547 ± 0.261
3.797ArgLys: 3.797 ± 0.292
5.252ArgLeu: 5.252 ± 0.423
0.978ArgMet: 0.978 ± 0.205
2.296ArgAsn: 2.296 ± 0.224
1.342ArgPro: 1.342 ± 0.173
2.365ArgGln: 2.365 ± 0.239
2.251ArgArg: 2.251 ± 0.262
3.615ArgSer: 3.615 ± 0.28
2.16ArgThr: 2.16 ± 0.202
3.547ArgVal: 3.547 ± 0.296
0.546ArgTrp: 0.546 ± 0.136
1.614ArgTyr: 1.614 ± 0.202
0.0ArgXaa: 0.0 ± 0.0
Ser
4.729SerAla: 4.729 ± 0.305
0.568SerCys: 0.568 ± 0.121
4.889SerAsp: 4.889 ± 0.461
4.775SerGlu: 4.775 ± 0.354
2.979SerPhe: 2.979 ± 0.262
5.639SerGly: 5.639 ± 0.524
1.251SerHis: 1.251 ± 0.232
4.138SerIle: 4.138 ± 0.35
5.389SerLys: 5.389 ± 0.383
6.799SerLeu: 6.799 ± 0.491
1.182SerMet: 1.182 ± 0.183
3.82SerAsn: 3.82 ± 0.323
3.07SerPro: 3.07 ± 0.293
3.206SerGln: 3.206 ± 0.452
3.297SerArg: 3.297 ± 0.301
5.139SerSer: 5.139 ± 0.55
4.525SerThr: 4.525 ± 0.386
4.661SerVal: 4.661 ± 0.353
0.637SerTrp: 0.637 ± 0.131
1.955SerTyr: 1.955 ± 0.189
0.0SerXaa: 0.0 ± 0.0
Thr
4.98ThrAla: 4.98 ± 0.431
0.773ThrCys: 0.773 ± 0.13
3.433ThrAsp: 3.433 ± 0.341
3.843ThrGlu: 3.843 ± 0.347
2.092ThrPhe: 2.092 ± 0.258
3.729ThrGly: 3.729 ± 0.323
1.046ThrHis: 1.046 ± 0.168
4.161ThrIle: 4.161 ± 0.315
4.57ThrLys: 4.57 ± 0.335
5.298ThrLeu: 5.298 ± 0.371
0.682ThrMet: 0.682 ± 0.126
3.001ThrAsn: 3.001 ± 0.231
3.433ThrPro: 3.433 ± 0.383
2.865ThrGln: 2.865 ± 0.306
2.615ThrArg: 2.615 ± 0.301
4.229ThrSer: 4.229 ± 0.379
4.661ThrThr: 4.661 ± 0.478
3.888ThrVal: 3.888 ± 0.353
0.546ThrTrp: 0.546 ± 0.118
2.433ThrTyr: 2.433 ± 0.238
0.0ThrXaa: 0.0 ± 0.0
Val
5.457ValAla: 5.457 ± 0.318
0.773ValCys: 0.773 ± 0.149
4.116ValAsp: 4.116 ± 0.241
5.048ValGlu: 5.048 ± 0.301
2.228ValPhe: 2.228 ± 0.255
4.047ValGly: 4.047 ± 0.442
0.614ValHis: 0.614 ± 0.116
4.388ValIle: 4.388 ± 0.389
4.525ValLys: 4.525 ± 0.297
4.593ValLeu: 4.593 ± 0.263
1.41ValMet: 1.41 ± 0.22
3.342ValAsn: 3.342 ± 0.308
2.183ValPro: 2.183 ± 0.246
2.433ValGln: 2.433 ± 0.25
3.229ValArg: 3.229 ± 0.264
4.752ValSer: 4.752 ± 0.337
4.798ValThr: 4.798 ± 0.391
3.934ValVal: 3.934 ± 0.298
0.659ValTrp: 0.659 ± 0.144
1.91ValTyr: 1.91 ± 0.218
0.0ValXaa: 0.0 ± 0.0
Trp
0.455TrpAla: 0.455 ± 0.1
0.205TrpCys: 0.205 ± 0.082
0.773TrpAsp: 0.773 ± 0.159
0.728TrpGlu: 0.728 ± 0.167
0.409TrpPhe: 0.409 ± 0.101
0.887TrpGly: 0.887 ± 0.134
0.205TrpHis: 0.205 ± 0.068
0.568TrpIle: 0.568 ± 0.126
0.841TrpLys: 0.841 ± 0.122
1.069TrpLeu: 1.069 ± 0.159
0.364TrpMet: 0.364 ± 0.096
0.455TrpAsn: 0.455 ± 0.099
0.045TrpPro: 0.045 ± 0.034
0.5TrpGln: 0.5 ± 0.113
0.682TrpArg: 0.682 ± 0.133
0.637TrpSer: 0.637 ± 0.12
0.409TrpThr: 0.409 ± 0.084
1.0TrpVal: 1.0 ± 0.175
0.25TrpTrp: 0.25 ± 0.085
0.546TrpTyr: 0.546 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.796TyrAla: 1.796 ± 0.199
0.591TyrCys: 0.591 ± 0.136
1.751TyrAsp: 1.751 ± 0.203
1.819TyrGlu: 1.819 ± 0.208
1.728TyrPhe: 1.728 ± 0.224
2.342TyrGly: 2.342 ± 0.276
0.637TyrHis: 0.637 ± 0.14
2.001TyrIle: 2.001 ± 0.248
2.66TyrLys: 2.66 ± 0.323
3.342TyrLeu: 3.342 ± 0.304
0.819TyrMet: 0.819 ± 0.145
1.614TyrAsn: 1.614 ± 0.167
1.41TyrPro: 1.41 ± 0.185
1.774TyrGln: 1.774 ± 0.192
1.637TyrArg: 1.637 ± 0.227
2.706TyrSer: 2.706 ± 0.27
2.319TyrThr: 2.319 ± 0.199
1.614TyrVal: 1.614 ± 0.225
0.614TyrTrp: 0.614 ± 0.112
1.296TyrTyr: 1.296 ± 0.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 206 proteins (43981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski