Amino acid dipepetide frequency for Vibrio phage ICP1_2011_A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.832AlaAla: 3.832 ± 0.452
0.788AlaCys: 0.788 ± 0.117
2.935AlaAsp: 2.935 ± 0.349
3.886AlaGlu: 3.886 ± 0.35
2.12AlaPhe: 2.12 ± 0.22
2.908AlaGly: 2.908 ± 0.317
0.978AlaHis: 0.978 ± 0.142
3.696AlaIle: 3.696 ± 0.335
3.723AlaLys: 3.723 ± 0.35
4.973AlaLeu: 4.973 ± 0.395
1.223AlaMet: 1.223 ± 0.23
2.419AlaAsn: 2.419 ± 0.251
1.223AlaPro: 1.223 ± 0.191
1.495AlaGln: 1.495 ± 0.206
1.929AlaArg: 1.929 ± 0.26
2.853AlaSer: 2.853 ± 0.36
3.016AlaThr: 3.016 ± 0.316
2.962AlaVal: 2.962 ± 0.329
0.815AlaTrp: 0.815 ± 0.149
2.228AlaTyr: 2.228 ± 0.258
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.152
0.408CysCys: 0.408 ± 0.092
0.842CysAsp: 0.842 ± 0.143
1.386CysGlu: 1.386 ± 0.206
0.707CysPhe: 0.707 ± 0.118
1.766CysGly: 1.766 ± 0.241
0.462CysHis: 0.462 ± 0.132
0.951CysIle: 0.951 ± 0.169
1.794CysLys: 1.794 ± 0.259
0.897CysLeu: 0.897 ± 0.203
0.462CysMet: 0.462 ± 0.108
0.87CysAsn: 0.87 ± 0.172
0.707CysPro: 0.707 ± 0.14
0.489CysGln: 0.489 ± 0.104
0.815CysArg: 0.815 ± 0.136
1.223CysSer: 1.223 ± 0.245
1.005CysThr: 1.005 ± 0.192
1.06CysVal: 1.06 ± 0.148
0.245CysTrp: 0.245 ± 0.088
0.924CysTyr: 0.924 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
2.609AspAla: 2.609 ± 0.281
1.386AspCys: 1.386 ± 0.178
3.451AspAsp: 3.451 ± 0.354
4.62AspGlu: 4.62 ± 0.376
3.125AspPhe: 3.125 ± 0.319
4.647AspGly: 4.647 ± 0.35
0.978AspHis: 0.978 ± 0.159
4.593AspIle: 4.593 ± 0.371
5.353AspLys: 5.353 ± 0.468
5.734AspLeu: 5.734 ± 0.471
1.902AspMet: 1.902 ± 0.24
3.75AspAsn: 3.75 ± 0.272
1.712AspPro: 1.712 ± 0.214
1.875AspGln: 1.875 ± 0.28
2.582AspArg: 2.582 ± 0.261
3.478AspSer: 3.478 ± 0.276
3.804AspThr: 3.804 ± 0.293
4.076AspVal: 4.076 ± 0.352
1.141AspTrp: 1.141 ± 0.172
3.832AspTyr: 3.832 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
4.185GluAla: 4.185 ± 0.355
1.332GluCys: 1.332 ± 0.214
6.196GluAsp: 6.196 ± 0.434
7.69GluGlu: 7.69 ± 0.581
3.37GluPhe: 3.37 ± 0.308
4.973GluGly: 4.973 ± 0.412
1.549GluHis: 1.549 ± 0.233
4.728GluIle: 4.728 ± 0.365
5.19GluLys: 5.19 ± 0.425
6.413GluLeu: 6.413 ± 0.416
2.092GluMet: 2.092 ± 0.234
3.669GluAsn: 3.669 ± 0.298
1.522GluPro: 1.522 ± 0.247
2.582GluGln: 2.582 ± 0.306
3.016GluArg: 3.016 ± 0.346
4.783GluSer: 4.783 ± 0.325
3.016GluThr: 3.016 ± 0.304
5.625GluVal: 5.625 ± 0.386
1.359GluTrp: 1.359 ± 0.214
3.75GluTyr: 3.75 ± 0.285
0.0GluXaa: 0.0 ± 0.0
Phe
2.12PheAla: 2.12 ± 0.286
0.788PheCys: 0.788 ± 0.144
3.777PheAsp: 3.777 ± 0.315
3.315PheGlu: 3.315 ± 0.316
1.277PhePhe: 1.277 ± 0.198
2.881PheGly: 2.881 ± 0.29
0.951PheHis: 0.951 ± 0.145
2.962PheIle: 2.962 ± 0.283
3.506PheLys: 3.506 ± 0.323
2.962PheLeu: 2.962 ± 0.271
0.951PheMet: 0.951 ± 0.165
2.147PheAsn: 2.147 ± 0.224
1.359PhePro: 1.359 ± 0.212
1.277PheGln: 1.277 ± 0.181
1.386PheArg: 1.386 ± 0.168
2.772PheSer: 2.772 ± 0.296
3.234PheThr: 3.234 ± 0.26
3.071PheVal: 3.071 ± 0.268
0.734PheTrp: 0.734 ± 0.16
1.875PheTyr: 1.875 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
2.69GlyAla: 2.69 ± 0.296
1.277GlyCys: 1.277 ± 0.175
4.402GlyAsp: 4.402 ± 0.314
5.136GlyGlu: 5.136 ± 0.374
3.098GlyPhe: 3.098 ± 0.335
4.131GlyGly: 4.131 ± 0.397
0.815GlyHis: 0.815 ± 0.144
3.696GlyIle: 3.696 ± 0.306
5.435GlyLys: 5.435 ± 0.41
4.538GlyLeu: 4.538 ± 0.367
1.25GlyMet: 1.25 ± 0.16
3.75GlyAsn: 3.75 ± 0.298
0.245GlyPro: 0.245 ± 0.09
1.766GlyGln: 1.766 ± 0.213
2.391GlyArg: 2.391 ± 0.254
4.049GlySer: 4.049 ± 0.372
3.723GlyThr: 3.723 ± 0.363
5.68GlyVal: 5.68 ± 0.411
1.576GlyTrp: 1.576 ± 0.22
3.56GlyTyr: 3.56 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
0.707HisAla: 0.707 ± 0.128
0.462HisCys: 0.462 ± 0.111
0.924HisAsp: 0.924 ± 0.148
1.413HisGlu: 1.413 ± 0.178
1.005HisPhe: 1.005 ± 0.175
1.169HisGly: 1.169 ± 0.156
0.598HisHis: 0.598 ± 0.151
1.63HisIle: 1.63 ± 0.199
1.875HisLys: 1.875 ± 0.205
1.549HisLeu: 1.549 ± 0.182
0.543HisMet: 0.543 ± 0.128
1.169HisAsn: 1.169 ± 0.188
0.571HisPro: 0.571 ± 0.105
0.788HisGln: 0.788 ± 0.162
0.815HisArg: 0.815 ± 0.198
1.277HisSer: 1.277 ± 0.204
1.413HisThr: 1.413 ± 0.192
0.951HisVal: 0.951 ± 0.182
0.217HisTrp: 0.217 ± 0.071
0.897HisTyr: 0.897 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
3.506IleAla: 3.506 ± 0.3
0.87IleCys: 0.87 ± 0.146
4.701IleAsp: 4.701 ± 0.392
4.375IleGlu: 4.375 ± 0.328
2.31IlePhe: 2.31 ± 0.27
3.342IleGly: 3.342 ± 0.263
1.223IleHis: 1.223 ± 0.208
4.266IleIle: 4.266 ± 0.356
6.033IleLys: 6.033 ± 0.378
4.81IleLeu: 4.81 ± 0.343
1.114IleMet: 1.114 ± 0.165
3.641IleAsn: 3.641 ± 0.295
2.636IlePro: 2.636 ± 0.292
1.902IleGln: 1.902 ± 0.25
2.962IleArg: 2.962 ± 0.317
3.886IleSer: 3.886 ± 0.316
4.022IleThr: 4.022 ± 0.382
4.593IleVal: 4.593 ± 0.385
0.571IleTrp: 0.571 ± 0.133
2.69IleTyr: 2.69 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
4.022LysAla: 4.022 ± 0.413
1.304LysCys: 1.304 ± 0.194
5.326LysAsp: 5.326 ± 0.356
6.359LysGlu: 6.359 ± 0.437
3.478LysPhe: 3.478 ± 0.263
5.435LysGly: 5.435 ± 0.409
2.065LysHis: 2.065 ± 0.25
4.837LysIle: 4.837 ± 0.368
4.429LysLys: 4.429 ± 0.387
7.5LysLeu: 7.5 ± 0.426
2.255LysMet: 2.255 ± 0.234
3.315LysAsn: 3.315 ± 0.31
2.201LysPro: 2.201 ± 0.229
3.288LysGln: 3.288 ± 0.331
4.239LysArg: 4.239 ± 0.378
5.326LysSer: 5.326 ± 0.409
3.75LysThr: 3.75 ± 0.335
6.141LysVal: 6.141 ± 0.437
1.087LysTrp: 1.087 ± 0.208
4.131LysTyr: 4.131 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
4.647LeuAla: 4.647 ± 0.383
1.603LeuCys: 1.603 ± 0.214
5.843LeuAsp: 5.843 ± 0.498
6.957LeuGlu: 6.957 ± 0.443
3.098LeuPhe: 3.098 ± 0.31
5.326LeuGly: 5.326 ± 0.428
1.63LeuHis: 1.63 ± 0.218
4.239LeuIle: 4.239 ± 0.332
6.413LeuLys: 6.413 ± 0.399
7.337LeuLeu: 7.337 ± 0.454
2.174LeuMet: 2.174 ± 0.287
4.593LeuAsn: 4.593 ± 0.401
2.826LeuPro: 2.826 ± 0.283
2.935LeuGln: 2.935 ± 0.261
3.56LeuArg: 3.56 ± 0.329
6.603LeuSer: 6.603 ± 0.513
4.62LeuThr: 4.62 ± 0.352
5.353LeuVal: 5.353 ± 0.356
1.44LeuTrp: 1.44 ± 0.195
3.533LeuTyr: 3.533 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
1.549MetAla: 1.549 ± 0.251
0.435MetCys: 0.435 ± 0.119
1.359MetAsp: 1.359 ± 0.209
1.413MetGlu: 1.413 ± 0.215
1.114MetPhe: 1.114 ± 0.148
0.924MetGly: 0.924 ± 0.165
0.571MetHis: 0.571 ± 0.137
1.576MetIle: 1.576 ± 0.233
2.446MetLys: 2.446 ± 0.266
1.794MetLeu: 1.794 ± 0.223
0.815MetMet: 0.815 ± 0.164
1.766MetAsn: 1.766 ± 0.264
0.761MetPro: 0.761 ± 0.14
1.25MetGln: 1.25 ± 0.166
1.169MetArg: 1.169 ± 0.204
2.201MetSer: 2.201 ± 0.19
1.603MetThr: 1.603 ± 0.224
1.196MetVal: 1.196 ± 0.169
0.163MetTrp: 0.163 ± 0.075
1.087MetTyr: 1.087 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
2.391AsnAla: 2.391 ± 0.256
0.707AsnCys: 0.707 ± 0.16
2.962AsnAsp: 2.962 ± 0.235
2.745AsnGlu: 2.745 ± 0.238
2.609AsnPhe: 2.609 ± 0.249
4.158AsnGly: 4.158 ± 0.281
1.332AsnHis: 1.332 ± 0.191
4.185AsnIle: 4.185 ± 0.384
4.81AsnLys: 4.81 ± 0.4
4.728AsnLeu: 4.728 ± 0.359
1.304AsnMet: 1.304 ± 0.196
3.179AsnAsn: 3.179 ± 0.331
2.038AsnPro: 2.038 ± 0.229
1.929AsnGln: 1.929 ± 0.224
2.092AsnArg: 2.092 ± 0.256
3.397AsnSer: 3.397 ± 0.349
3.207AsnThr: 3.207 ± 0.338
2.745AsnVal: 2.745 ± 0.261
0.842AsnTrp: 0.842 ± 0.141
2.391AsnTyr: 2.391 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
1.522ProAla: 1.522 ± 0.222
0.516ProCys: 0.516 ± 0.139
1.957ProAsp: 1.957 ± 0.224
2.69ProGlu: 2.69 ± 0.244
1.549ProPhe: 1.549 ± 0.2
0.109ProGly: 0.109 ± 0.061
1.005ProHis: 1.005 ± 0.192
1.821ProIle: 1.821 ± 0.248
2.065ProLys: 2.065 ± 0.235
2.065ProLeu: 2.065 ± 0.261
0.734ProMet: 0.734 ± 0.132
1.522ProAsn: 1.522 ± 0.2
0.87ProPro: 0.87 ± 0.184
1.169ProGln: 1.169 ± 0.17
0.842ProArg: 0.842 ± 0.157
1.821ProSer: 1.821 ± 0.244
1.739ProThr: 1.739 ± 0.213
2.255ProVal: 2.255 ± 0.292
0.462ProTrp: 0.462 ± 0.11
1.467ProTyr: 1.467 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
2.038GlnAla: 2.038 ± 0.272
0.462GlnCys: 0.462 ± 0.111
1.984GlnAsp: 1.984 ± 0.227
3.669GlnGlu: 3.669 ± 0.302
1.005GlnPhe: 1.005 ± 0.145
2.174GlnGly: 2.174 ± 0.264
0.788GlnHis: 0.788 ± 0.128
1.984GlnIle: 1.984 ± 0.234
2.636GlnLys: 2.636 ± 0.268
2.527GlnLeu: 2.527 ± 0.249
0.815GlnMet: 0.815 ± 0.151
1.821GlnAsn: 1.821 ± 0.192
0.761GlnPro: 0.761 ± 0.138
1.63GlnGln: 1.63 ± 0.218
1.794GlnArg: 1.794 ± 0.232
2.228GlnSer: 2.228 ± 0.219
1.875GlnThr: 1.875 ± 0.236
2.174GlnVal: 2.174 ± 0.258
0.38GlnTrp: 0.38 ± 0.11
1.44GlnTyr: 1.44 ± 0.212
0.0GlnXaa: 0.0 ± 0.0
Arg
1.603ArgAla: 1.603 ± 0.213
0.978ArgCys: 0.978 ± 0.157
2.473ArgAsp: 2.473 ± 0.257
3.207ArgGlu: 3.207 ± 0.292
2.011ArgPhe: 2.011 ± 0.249
2.772ArgGly: 2.772 ± 0.263
0.734ArgHis: 0.734 ± 0.124
2.554ArgIle: 2.554 ± 0.265
3.614ArgLys: 3.614 ± 0.337
4.076ArgLeu: 4.076 ± 0.364
1.277ArgMet: 1.277 ± 0.211
2.228ArgAsn: 2.228 ± 0.232
1.141ArgPro: 1.141 ± 0.185
1.658ArgGln: 1.658 ± 0.208
1.63ArgArg: 1.63 ± 0.197
2.31ArgSer: 2.31 ± 0.274
1.902ArgThr: 1.902 ± 0.232
2.745ArgVal: 2.745 ± 0.247
0.788ArgTrp: 0.788 ± 0.136
1.902ArgTyr: 1.902 ± 0.187
0.0ArgXaa: 0.0 ± 0.0
Ser
2.853SerAla: 2.853 ± 0.283
1.005SerCys: 1.005 ± 0.158
4.103SerAsp: 4.103 ± 0.344
4.674SerGlu: 4.674 ± 0.34
3.044SerPhe: 3.044 ± 0.334
5.027SerGly: 5.027 ± 0.357
1.06SerHis: 1.06 ± 0.176
3.669SerIle: 3.669 ± 0.32
5.924SerLys: 5.924 ± 0.378
5.707SerLeu: 5.707 ± 0.47
1.576SerMet: 1.576 ± 0.209
3.288SerAsn: 3.288 ± 0.264
1.549SerPro: 1.549 ± 0.193
1.712SerGln: 1.712 ± 0.216
2.69SerArg: 2.69 ± 0.244
4.647SerSer: 4.647 ± 0.368
3.207SerThr: 3.207 ± 0.314
4.266SerVal: 4.266 ± 0.377
1.196SerTrp: 1.196 ± 0.189
3.261SerTyr: 3.261 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
3.207ThrAla: 3.207 ± 0.357
1.005ThrCys: 1.005 ± 0.199
2.582ThrAsp: 2.582 ± 0.273
3.506ThrGlu: 3.506 ± 0.343
2.717ThrPhe: 2.717 ± 0.301
3.75ThrGly: 3.75 ± 0.327
1.114ThrHis: 1.114 ± 0.167
4.593ThrIle: 4.593 ± 0.334
4.212ThrLys: 4.212 ± 0.348
5.815ThrLeu: 5.815 ± 0.422
1.467ThrMet: 1.467 ± 0.204
2.962ThrAsn: 2.962 ± 0.304
2.826ThrPro: 2.826 ± 0.28
1.685ThrGln: 1.685 ± 0.203
2.228ThrArg: 2.228 ± 0.234
3.451ThrSer: 3.451 ± 0.34
3.696ThrThr: 3.696 ± 0.39
3.696ThrVal: 3.696 ± 0.34
0.761ThrTrp: 0.761 ± 0.126
2.446ThrTyr: 2.446 ± 0.248
0.0ThrXaa: 0.0 ± 0.0
Val
3.37ValAla: 3.37 ± 0.306
1.141ValCys: 1.141 ± 0.229
4.674ValAsp: 4.674 ± 0.326
5.625ValGlu: 5.625 ± 0.365
3.016ValPhe: 3.016 ± 0.321
4.294ValGly: 4.294 ± 0.386
0.734ValHis: 0.734 ± 0.17
3.723ValIle: 3.723 ± 0.325
5.761ValLys: 5.761 ± 0.469
5.381ValLeu: 5.381 ± 0.358
1.63ValMet: 1.63 ± 0.201
4.131ValAsn: 4.131 ± 0.334
1.658ValPro: 1.658 ± 0.184
2.283ValGln: 2.283 ± 0.273
2.5ValArg: 2.5 ± 0.21
4.103ValSer: 4.103 ± 0.289
4.756ValThr: 4.756 ± 0.415
4.728ValVal: 4.728 ± 0.414
0.951ValTrp: 0.951 ± 0.181
2.935ValTyr: 2.935 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
0.38TrpAla: 0.38 ± 0.111
0.408TrpCys: 0.408 ± 0.126
1.223TrpAsp: 1.223 ± 0.198
1.332TrpGlu: 1.332 ± 0.206
0.598TrpPhe: 0.598 ± 0.129
0.598TrpGly: 0.598 ± 0.144
0.245TrpHis: 0.245 ± 0.074
0.788TrpIle: 0.788 ± 0.148
1.63TrpLys: 1.63 ± 0.203
1.63TrpLeu: 1.63 ± 0.222
0.462TrpMet: 0.462 ± 0.111
0.788TrpAsn: 0.788 ± 0.175
0.19TrpPro: 0.19 ± 0.056
0.679TrpGln: 0.679 ± 0.12
0.571TrpArg: 0.571 ± 0.108
0.87TrpSer: 0.87 ± 0.148
0.734TrpThr: 0.734 ± 0.151
1.44TrpVal: 1.44 ± 0.232
0.245TrpTrp: 0.245 ± 0.072
0.707TrpTyr: 0.707 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.902TyrAla: 1.902 ± 0.251
1.033TyrCys: 1.033 ± 0.162
2.935TyrAsp: 2.935 ± 0.275
3.044TyrGlu: 3.044 ± 0.29
2.092TyrPhe: 2.092 ± 0.244
2.826TyrGly: 2.826 ± 0.247
1.141TyrHis: 1.141 ± 0.18
2.962TyrIle: 2.962 ± 0.332
3.641TyrLys: 3.641 ± 0.35
4.294TyrLeu: 4.294 ± 0.367
1.114TyrMet: 1.114 ± 0.186
2.745TyrAsn: 2.745 ± 0.284
1.386TyrPro: 1.386 ± 0.164
1.794TyrGln: 1.794 ± 0.232
2.391TyrArg: 2.391 ± 0.237
3.179TyrSer: 3.179 ± 0.312
3.397TyrThr: 3.397 ± 0.344
2.636TyrVal: 2.636 ± 0.267
0.543TyrTrp: 0.543 ± 0.116
2.147TyrTyr: 2.147 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 222 proteins (36800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski