Amino acid dipepetide frequency for Streptococcus phage phi-SsuFJNP8_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.964AlaAla: 3.964 ± 0.691
0.56AlaCys: 0.56 ± 0.15
3.016AlaAsp: 3.016 ± 0.322
4.826AlaGlu: 4.826 ± 0.377
2.93AlaPhe: 2.93 ± 0.255
3.749AlaGly: 3.749 ± 0.396
0.776AlaHis: 0.776 ± 0.208
5.214AlaIle: 5.214 ± 0.529
4.998AlaLys: 4.998 ± 0.534
5.817AlaLeu: 5.817 ± 0.475
1.551AlaMet: 1.551 ± 0.222
3.318AlaAsn: 3.318 ± 0.384
1.594AlaPro: 1.594 ± 0.353
2.413AlaGln: 2.413 ± 0.413
2.198AlaArg: 2.198 ± 0.311
5.041AlaSer: 5.041 ± 0.511
4.438AlaThr: 4.438 ± 0.484
3.964AlaVal: 3.964 ± 0.432
0.646AlaTrp: 0.646 ± 0.161
3.059AlaTyr: 3.059 ± 0.341
0.0AlaXaa: 0.0 ± 0.0
Cys
0.474CysAla: 0.474 ± 0.167
0.172CysCys: 0.172 ± 0.072
0.259CysAsp: 0.259 ± 0.106
0.474CysGlu: 0.474 ± 0.125
0.215CysPhe: 0.215 ± 0.098
0.56CysGly: 0.56 ± 0.165
0.215CysHis: 0.215 ± 0.096
0.345CysIle: 0.345 ± 0.097
0.56CysLys: 0.56 ± 0.182
0.689CysLeu: 0.689 ± 0.211
0.086CysMet: 0.086 ± 0.06
0.302CysAsn: 0.302 ± 0.122
0.345CysPro: 0.345 ± 0.127
0.56CysGln: 0.56 ± 0.172
0.474CysArg: 0.474 ± 0.161
0.56CysSer: 0.56 ± 0.173
0.129CysThr: 0.129 ± 0.084
0.345CysVal: 0.345 ± 0.12
0.0CysTrp: 0.0 ± 0.0
0.56CysTyr: 0.56 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
3.361AspAla: 3.361 ± 0.384
0.431AspCys: 0.431 ± 0.125
3.102AspAsp: 3.102 ± 0.453
4.912AspGlu: 4.912 ± 0.624
3.232AspPhe: 3.232 ± 0.414
4.567AspGly: 4.567 ± 0.414
0.819AspHis: 0.819 ± 0.187
4.481AspIle: 4.481 ± 0.445
4.137AspLys: 4.137 ± 0.409
5.645AspLeu: 5.645 ± 0.645
1.379AspMet: 1.379 ± 0.268
2.715AspAsn: 2.715 ± 0.313
1.465AspPro: 1.465 ± 0.302
1.724AspGln: 1.724 ± 0.189
1.767AspArg: 1.767 ± 0.332
4.05AspSer: 4.05 ± 0.463
3.102AspThr: 3.102 ± 0.317
2.973AspVal: 2.973 ± 0.323
0.689AspTrp: 0.689 ± 0.157
3.189AspTyr: 3.189 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
4.912GluAla: 4.912 ± 0.436
0.517GluCys: 0.517 ± 0.157
3.749GluAsp: 3.749 ± 0.49
6.679GluGlu: 6.679 ± 0.805
2.284GluPhe: 2.284 ± 0.309
4.266GluGly: 4.266 ± 0.407
1.379GluHis: 1.379 ± 0.291
4.697GluIle: 4.697 ± 0.434
5.86GluLys: 5.86 ± 0.643
8.962GluLeu: 8.962 ± 0.547
2.327GluMet: 2.327 ± 0.396
4.137GluAsn: 4.137 ± 0.472
1.508GluPro: 1.508 ± 0.262
4.093GluGln: 4.093 ± 0.413
3.145GluArg: 3.145 ± 0.38
3.921GluSer: 3.921 ± 0.435
4.567GluThr: 4.567 ± 0.49
4.395GluVal: 4.395 ± 0.434
0.948GluTrp: 0.948 ± 0.234
2.456GluTyr: 2.456 ± 0.447
0.0GluXaa: 0.0 ± 0.0
Phe
2.758PheAla: 2.758 ± 0.371
0.517PheCys: 0.517 ± 0.144
2.844PheAsp: 2.844 ± 0.325
2.758PheGlu: 2.758 ± 0.314
1.206PhePhe: 1.206 ± 0.223
2.37PheGly: 2.37 ± 0.239
0.991PheHis: 0.991 ± 0.17
2.154PheIle: 2.154 ± 0.346
2.801PheLys: 2.801 ± 0.425
3.576PheLeu: 3.576 ± 0.509
0.862PheMet: 0.862 ± 0.209
2.068PheAsn: 2.068 ± 0.281
0.776PhePro: 0.776 ± 0.173
1.724PheGln: 1.724 ± 0.237
1.637PheArg: 1.637 ± 0.245
2.973PheSer: 2.973 ± 0.358
2.542PheThr: 2.542 ± 0.319
2.327PheVal: 2.327 ± 0.35
0.56PheTrp: 0.56 ± 0.157
1.939PheTyr: 1.939 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
3.275GlyAla: 3.275 ± 0.472
0.388GlyCys: 0.388 ± 0.091
3.404GlyAsp: 3.404 ± 0.488
3.232GlyGlu: 3.232 ± 0.365
2.628GlyPhe: 2.628 ± 0.32
3.878GlyGly: 3.878 ± 0.526
1.422GlyHis: 1.422 ± 0.253
5.472GlyIle: 5.472 ± 0.587
4.869GlyLys: 4.869 ± 0.365
5.602GlyLeu: 5.602 ± 0.517
1.896GlyMet: 1.896 ± 0.304
3.576GlyAsn: 3.576 ± 0.345
0.56GlyPro: 0.56 ± 0.151
3.016GlyGln: 3.016 ± 0.384
3.016GlyArg: 3.016 ± 0.388
3.878GlySer: 3.878 ± 0.385
3.576GlyThr: 3.576 ± 0.386
3.619GlyVal: 3.619 ± 0.402
0.733GlyTrp: 0.733 ± 0.148
3.619GlyTyr: 3.619 ± 0.341
0.0GlyXaa: 0.0 ± 0.0
His
0.991HisAla: 0.991 ± 0.183
0.129HisCys: 0.129 ± 0.07
1.163HisAsp: 1.163 ± 0.252
0.862HisGlu: 0.862 ± 0.18
1.034HisPhe: 1.034 ± 0.203
1.465HisGly: 1.465 ± 0.238
0.603HisHis: 0.603 ± 0.16
1.293HisIle: 1.293 ± 0.211
1.163HisLys: 1.163 ± 0.283
2.068HisLeu: 2.068 ± 0.251
0.259HisMet: 0.259 ± 0.095
0.819HisAsn: 0.819 ± 0.18
1.034HisPro: 1.034 ± 0.213
0.733HisGln: 0.733 ± 0.214
1.034HisArg: 1.034 ± 0.201
1.034HisSer: 1.034 ± 0.225
1.163HisThr: 1.163 ± 0.195
1.25HisVal: 1.25 ± 0.268
0.259HisTrp: 0.259 ± 0.1
0.991HisTyr: 0.991 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
4.266IleAla: 4.266 ± 0.392
0.603IleCys: 0.603 ± 0.132
4.869IleAsp: 4.869 ± 0.429
4.998IleGlu: 4.998 ± 0.503
2.198IlePhe: 2.198 ± 0.345
5.257IleGly: 5.257 ± 0.543
1.206IleHis: 1.206 ± 0.221
4.093IleIle: 4.093 ± 0.449
4.998IleLys: 4.998 ± 0.5
6.032IleLeu: 6.032 ± 0.485
0.948IleMet: 0.948 ± 0.214
3.447IleAsn: 3.447 ± 0.34
2.198IlePro: 2.198 ± 0.256
2.542IleGln: 2.542 ± 0.27
3.232IleArg: 3.232 ± 0.466
4.955IleSer: 4.955 ± 0.628
4.093IleThr: 4.093 ± 0.641
4.74IleVal: 4.74 ± 0.511
0.905IleTrp: 0.905 ± 0.205
2.542IleTyr: 2.542 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
5.602LysAla: 5.602 ± 0.463
0.302LysCys: 0.302 ± 0.112
3.404LysAsp: 3.404 ± 0.446
6.162LysGlu: 6.162 ± 0.599
2.413LysPhe: 2.413 ± 0.299
4.783LysGly: 4.783 ± 0.462
1.25LysHis: 1.25 ± 0.209
4.567LysIle: 4.567 ± 0.508
5.645LysLys: 5.645 ± 0.691
6.506LysLeu: 6.506 ± 0.455
1.767LysMet: 1.767 ± 0.297
3.189LysAsn: 3.189 ± 0.362
1.81LysPro: 1.81 ± 0.213
3.792LysGln: 3.792 ± 0.387
3.361LysArg: 3.361 ± 0.473
5.084LysSer: 5.084 ± 0.441
4.007LysThr: 4.007 ± 0.434
5.3LysVal: 5.3 ± 0.511
1.034LysTrp: 1.034 ± 0.204
2.37LysTyr: 2.37 ± 0.338
0.0LysXaa: 0.0 ± 0.0
Leu
6.334LeuAla: 6.334 ± 0.517
0.603LeuCys: 0.603 ± 0.187
5.903LeuAsp: 5.903 ± 0.609
7.282LeuGlu: 7.282 ± 0.735
3.576LeuPhe: 3.576 ± 0.428
5.257LeuGly: 5.257 ± 0.468
1.508LeuHis: 1.508 ± 0.299
5.386LeuIle: 5.386 ± 0.463
7.411LeuLys: 7.411 ± 0.514
8.79LeuLeu: 8.79 ± 0.72
2.499LeuMet: 2.499 ± 0.289
5.214LeuAsn: 5.214 ± 0.563
3.921LeuPro: 3.921 ± 0.578
4.395LeuGln: 4.395 ± 0.466
3.792LeuArg: 3.792 ± 0.395
8.101LeuSer: 8.101 ± 0.658
6.032LeuThr: 6.032 ± 0.585
6.075LeuVal: 6.075 ± 0.559
0.345LeuTrp: 0.345 ± 0.106
3.792LeuTyr: 3.792 ± 0.488
0.0LeuXaa: 0.0 ± 0.0
Met
1.68MetAla: 1.68 ± 0.279
0.043MetCys: 0.043 ± 0.045
1.508MetAsp: 1.508 ± 0.26
1.81MetGlu: 1.81 ± 0.282
0.819MetPhe: 0.819 ± 0.188
1.422MetGly: 1.422 ± 0.31
0.086MetHis: 0.086 ± 0.056
1.637MetIle: 1.637 ± 0.268
1.551MetLys: 1.551 ± 0.246
2.068MetLeu: 2.068 ± 0.306
0.776MetMet: 0.776 ± 0.188
0.776MetAsn: 0.776 ± 0.143
0.56MetPro: 0.56 ± 0.154
0.56MetGln: 0.56 ± 0.181
1.077MetArg: 1.077 ± 0.225
1.853MetSer: 1.853 ± 0.295
1.982MetThr: 1.982 ± 0.32
1.422MetVal: 1.422 ± 0.211
0.129MetTrp: 0.129 ± 0.066
0.733MetTyr: 0.733 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
3.533AsnAla: 3.533 ± 0.457
0.259AsnCys: 0.259 ± 0.1
2.758AsnAsp: 2.758 ± 0.384
3.404AsnGlu: 3.404 ± 0.361
2.241AsnPhe: 2.241 ± 0.309
3.706AsnGly: 3.706 ± 0.445
1.379AsnHis: 1.379 ± 0.199
3.275AsnIle: 3.275 ± 0.352
2.801AsnLys: 2.801 ± 0.393
4.826AsnLeu: 4.826 ± 0.556
0.862AsnMet: 0.862 ± 0.162
2.887AsnAsn: 2.887 ± 0.38
2.327AsnPro: 2.327 ± 0.277
3.232AsnGln: 3.232 ± 0.476
2.413AsnArg: 2.413 ± 0.372
2.758AsnSer: 2.758 ± 0.404
2.499AsnThr: 2.499 ± 0.387
2.585AsnVal: 2.585 ± 0.34
0.776AsnTrp: 0.776 ± 0.171
1.724AsnTyr: 1.724 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
1.077ProAla: 1.077 ± 0.177
0.215ProCys: 0.215 ± 0.08
1.853ProAsp: 1.853 ± 0.313
2.542ProGlu: 2.542 ± 0.286
1.12ProPhe: 1.12 ± 0.178
0.776ProGly: 0.776 ± 0.217
0.905ProHis: 0.905 ± 0.219
1.724ProIle: 1.724 ± 0.252
2.413ProLys: 2.413 ± 0.311
3.275ProLeu: 3.275 ± 0.5
0.603ProMet: 0.603 ± 0.142
1.594ProAsn: 1.594 ± 0.26
0.819ProPro: 0.819 ± 0.186
1.12ProGln: 1.12 ± 0.244
1.68ProArg: 1.68 ± 0.206
2.413ProSer: 2.413 ± 0.331
1.939ProThr: 1.939 ± 0.295
1.767ProVal: 1.767 ± 0.319
0.345ProTrp: 0.345 ± 0.121
1.293ProTyr: 1.293 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
4.137GlnAla: 4.137 ± 0.339
0.302GlnCys: 0.302 ± 0.13
2.327GlnAsp: 2.327 ± 0.348
4.093GlnGlu: 4.093 ± 0.398
1.81GlnPhe: 1.81 ± 0.263
2.025GlnGly: 2.025 ± 0.264
0.517GlnHis: 0.517 ± 0.135
2.456GlnIle: 2.456 ± 0.305
2.887GlnLys: 2.887 ± 0.332
4.524GlnLeu: 4.524 ± 0.461
1.25GlnMet: 1.25 ± 0.25
1.896GlnAsn: 1.896 ± 0.275
1.163GlnPro: 1.163 ± 0.258
1.724GlnGln: 1.724 ± 0.248
1.896GlnArg: 1.896 ± 0.332
2.801GlnSer: 2.801 ± 0.373
3.706GlnThr: 3.706 ± 0.608
3.749GlnVal: 3.749 ± 0.46
0.733GlnTrp: 0.733 ± 0.191
1.379GlnTyr: 1.379 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
2.241ArgAla: 2.241 ± 0.288
0.474ArgCys: 0.474 ± 0.145
2.671ArgAsp: 2.671 ± 0.331
3.404ArgGlu: 3.404 ± 0.35
1.637ArgPhe: 1.637 ± 0.295
2.154ArgGly: 2.154 ± 0.246
0.733ArgHis: 0.733 ± 0.171
3.059ArgIle: 3.059 ± 0.428
3.232ArgLys: 3.232 ± 0.437
5.214ArgLeu: 5.214 ± 0.524
0.56ArgMet: 0.56 ± 0.155
2.111ArgAsn: 2.111 ± 0.281
1.379ArgPro: 1.379 ± 0.241
2.284ArgGln: 2.284 ± 0.303
1.81ArgArg: 1.81 ± 0.354
2.628ArgSer: 2.628 ± 0.303
2.628ArgThr: 2.628 ± 0.437
2.628ArgVal: 2.628 ± 0.369
0.733ArgTrp: 0.733 ± 0.209
1.422ArgTyr: 1.422 ± 0.27
0.0ArgXaa: 0.0 ± 0.0
Ser
4.223SerAla: 4.223 ± 0.496
0.388SerCys: 0.388 ± 0.145
4.266SerAsp: 4.266 ± 0.373
4.266SerGlu: 4.266 ± 0.449
3.318SerPhe: 3.318 ± 0.421
4.697SerGly: 4.697 ± 0.453
1.594SerHis: 1.594 ± 0.252
5.731SerIle: 5.731 ± 0.579
4.869SerLys: 4.869 ± 0.465
6.549SerLeu: 6.549 ± 0.592
1.163SerMet: 1.163 ± 0.215
3.361SerAsn: 3.361 ± 0.42
2.154SerPro: 2.154 ± 0.392
3.059SerGln: 3.059 ± 0.51
3.232SerArg: 3.232 ± 0.332
5.989SerSer: 5.989 ± 0.669
4.223SerThr: 4.223 ± 0.387
4.481SerVal: 4.481 ± 0.433
0.948SerTrp: 0.948 ± 0.19
2.456SerTyr: 2.456 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
4.352ThrAla: 4.352 ± 0.509
0.172ThrCys: 0.172 ± 0.091
3.447ThrAsp: 3.447 ± 0.464
4.481ThrGlu: 4.481 ± 0.487
2.37ThrPhe: 2.37 ± 0.338
3.749ThrGly: 3.749 ± 0.527
1.034ThrHis: 1.034 ± 0.214
4.74ThrIle: 4.74 ± 0.533
4.869ThrLys: 4.869 ± 0.436
5.429ThrLeu: 5.429 ± 0.397
1.25ThrMet: 1.25 ± 0.188
2.801ThrAsn: 2.801 ± 0.33
2.154ThrPro: 2.154 ± 0.326
2.499ThrGln: 2.499 ± 0.421
2.068ThrArg: 2.068 ± 0.341
4.093ThrSer: 4.093 ± 0.635
4.869ThrThr: 4.869 ± 0.519
4.783ThrVal: 4.783 ± 0.476
0.646ThrTrp: 0.646 ± 0.172
2.37ThrTyr: 2.37 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
4.137ValAla: 4.137 ± 0.446
0.517ValCys: 0.517 ± 0.178
3.878ValAsp: 3.878 ± 0.396
4.567ValGlu: 4.567 ± 0.468
2.327ValPhe: 2.327 ± 0.342
3.663ValGly: 3.663 ± 0.505
1.163ValHis: 1.163 ± 0.183
4.481ValIle: 4.481 ± 0.498
4.18ValLys: 4.18 ± 0.363
5.774ValLeu: 5.774 ± 0.427
1.637ValMet: 1.637 ± 0.243
2.844ValAsn: 2.844 ± 0.334
2.025ValPro: 2.025 ± 0.245
2.887ValGln: 2.887 ± 0.439
2.715ValArg: 2.715 ± 0.467
4.869ValSer: 4.869 ± 0.604
3.964ValThr: 3.964 ± 0.402
3.921ValVal: 3.921 ± 0.545
0.689ValTrp: 0.689 ± 0.173
2.758ValTyr: 2.758 ± 0.409
0.0ValXaa: 0.0 ± 0.0
Trp
0.603TrpAla: 0.603 ± 0.184
0.172TrpCys: 0.172 ± 0.09
0.474TrpAsp: 0.474 ± 0.118
1.25TrpGlu: 1.25 ± 0.207
0.517TrpPhe: 0.517 ± 0.182
0.603TrpGly: 0.603 ± 0.162
0.215TrpHis: 0.215 ± 0.091
0.733TrpIle: 0.733 ± 0.225
0.733TrpLys: 0.733 ± 0.174
0.905TrpLeu: 0.905 ± 0.188
0.215TrpMet: 0.215 ± 0.092
0.948TrpAsn: 0.948 ± 0.259
0.172TrpPro: 0.172 ± 0.074
0.733TrpGln: 0.733 ± 0.174
0.388TrpArg: 0.388 ± 0.168
1.12TrpSer: 1.12 ± 0.246
0.733TrpThr: 0.733 ± 0.216
0.733TrpVal: 0.733 ± 0.196
0.172TrpTrp: 0.172 ± 0.079
0.259TrpTyr: 0.259 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.154TyrAla: 2.154 ± 0.207
0.56TyrCys: 0.56 ± 0.153
2.844TyrAsp: 2.844 ± 0.387
2.801TyrGlu: 2.801 ± 0.374
1.508TyrPhe: 1.508 ± 0.283
2.758TyrGly: 2.758 ± 0.38
1.551TyrHis: 1.551 ± 0.265
2.585TyrIle: 2.585 ± 0.363
2.37TyrLys: 2.37 ± 0.394
3.964TyrLeu: 3.964 ± 0.438
0.56TyrMet: 0.56 ± 0.217
2.241TyrAsn: 2.241 ± 0.319
1.508TyrPro: 1.508 ± 0.25
2.284TyrGln: 2.284 ± 0.3
2.068TyrArg: 2.068 ± 0.326
2.844TyrSer: 2.844 ± 0.313
1.982TyrThr: 1.982 ± 0.319
1.939TyrVal: 1.939 ± 0.291
0.388TyrTrp: 0.388 ± 0.104
1.68TyrTyr: 1.68 ± 0.367
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (23209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski