Amino acid dipepetide frequency for Sporosarcina phage Lietuvens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.274AlaAla: 7.274 ± 1.137
0.82AlaCys: 0.82 ± 0.195
7.377AlaAsp: 7.377 ± 0.629
6.198AlaGlu: 6.198 ± 0.841
3.074AlaPhe: 3.074 ± 0.775
6.147AlaGly: 6.147 ± 0.716
1.742AlaHis: 1.742 ± 0.264
5.533AlaIle: 5.533 ± 0.567
7.172AlaLys: 7.172 ± 0.724
8.35AlaLeu: 8.35 ± 0.697
2.869AlaMet: 2.869 ± 0.504
3.637AlaAsn: 3.637 ± 0.534
2.766AlaPro: 2.766 ± 0.389
3.074AlaGln: 3.074 ± 0.406
4.252AlaArg: 4.252 ± 0.522
5.789AlaSer: 5.789 ± 0.561
5.276AlaThr: 5.276 ± 0.605
6.506AlaVal: 6.506 ± 0.784
1.127AlaTrp: 1.127 ± 0.302
3.432AlaTyr: 3.432 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.871CysAla: 0.871 ± 0.199
0.102CysCys: 0.102 ± 0.065
0.768CysAsp: 0.768 ± 0.211
0.512CysGlu: 0.512 ± 0.199
0.256CysPhe: 0.256 ± 0.127
0.768CysGly: 0.768 ± 0.219
0.256CysHis: 0.256 ± 0.117
0.41CysIle: 0.41 ± 0.165
0.41CysLys: 0.41 ± 0.165
0.615CysLeu: 0.615 ± 0.181
0.154CysMet: 0.154 ± 0.084
0.205CysAsn: 0.205 ± 0.137
0.307CysPro: 0.307 ± 0.118
0.256CysGln: 0.256 ± 0.112
0.615CysArg: 0.615 ± 0.215
0.461CysSer: 0.461 ± 0.17
0.666CysThr: 0.666 ± 0.206
0.512CysVal: 0.512 ± 0.167
0.0CysTrp: 0.0 ± 0.0
0.359CysTyr: 0.359 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
6.813AspAla: 6.813 ± 0.555
0.666AspCys: 0.666 ± 0.161
4.559AspAsp: 4.559 ± 0.601
6.608AspGlu: 6.608 ± 0.644
2.869AspPhe: 2.869 ± 0.312
5.071AspGly: 5.071 ± 0.466
0.563AspHis: 0.563 ± 0.156
4.303AspIle: 4.303 ± 0.532
4.047AspLys: 4.047 ± 0.454
4.457AspLeu: 4.457 ± 0.536
1.742AspMet: 1.742 ± 0.269
2.664AspAsn: 2.664 ± 0.43
1.69AspPro: 1.69 ± 0.235
0.615AspGln: 0.615 ± 0.164
3.022AspArg: 3.022 ± 0.38
3.176AspSer: 3.176 ± 0.368
3.381AspThr: 3.381 ± 0.529
6.045AspVal: 6.045 ± 0.588
0.922AspTrp: 0.922 ± 0.21
3.022AspTyr: 3.022 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
5.379GluAla: 5.379 ± 0.585
0.922GluCys: 0.922 ± 0.272
3.944GluAsp: 3.944 ± 0.42
4.662GluGlu: 4.662 ± 0.484
2.817GluPhe: 2.817 ± 0.534
4.149GluGly: 4.149 ± 0.462
1.076GluHis: 1.076 ± 0.196
4.969GluIle: 4.969 ± 0.547
4.406GluLys: 4.406 ± 0.652
5.994GluLeu: 5.994 ± 0.567
2.049GluMet: 2.049 ± 0.311
2.459GluAsn: 2.459 ± 0.285
1.998GluPro: 1.998 ± 0.309
2.971GluGln: 2.971 ± 0.399
6.198GluArg: 6.198 ± 0.576
4.303GluSer: 4.303 ± 0.541
3.176GluThr: 3.176 ± 0.388
5.481GluVal: 5.481 ± 0.566
1.076GluTrp: 1.076 ± 0.214
4.252GluTyr: 4.252 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
2.715PheAla: 2.715 ± 0.551
0.256PheCys: 0.256 ± 0.124
2.664PheAsp: 2.664 ± 0.382
3.022PheGlu: 3.022 ± 0.414
0.82PhePhe: 0.82 ± 0.202
2.869PheGly: 2.869 ± 0.325
0.768PheHis: 0.768 ± 0.235
1.998PheIle: 1.998 ± 0.396
2.971PheLys: 2.971 ± 0.351
2.152PheLeu: 2.152 ± 0.327
0.768PheMet: 0.768 ± 0.215
2.203PheAsn: 2.203 ± 0.315
0.768PhePro: 0.768 ± 0.244
0.512PheGln: 0.512 ± 0.211
1.895PheArg: 1.895 ± 0.28
2.459PheSer: 2.459 ± 0.31
2.869PheThr: 2.869 ± 0.382
2.664PheVal: 2.664 ± 0.448
0.41PheTrp: 0.41 ± 0.131
1.332PheTyr: 1.332 ± 0.271
0.0PheXaa: 0.0 ± 0.0
Gly
6.455GlyAla: 6.455 ± 0.748
0.512GlyCys: 0.512 ± 0.156
4.713GlyAsp: 4.713 ± 0.633
5.994GlyGlu: 5.994 ± 0.607
3.227GlyPhe: 3.227 ± 0.434
4.867GlyGly: 4.867 ± 0.567
1.127GlyHis: 1.127 ± 0.239
4.61GlyIle: 4.61 ± 0.516
5.123GlyLys: 5.123 ± 0.597
4.252GlyLeu: 4.252 ± 0.516
1.742GlyMet: 1.742 ± 0.374
2.561GlyAsn: 2.561 ± 0.386
0.512GlyPro: 0.512 ± 0.18
1.69GlyGln: 1.69 ± 0.267
4.149GlyArg: 4.149 ± 0.465
3.74GlySer: 3.74 ± 0.472
4.303GlyThr: 4.303 ± 0.46
6.506GlyVal: 6.506 ± 0.58
0.973GlyTrp: 0.973 ± 0.22
3.586GlyTyr: 3.586 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
1.947HisAla: 1.947 ± 0.328
0.154HisCys: 0.154 ± 0.098
1.229HisAsp: 1.229 ± 0.226
1.229HisGlu: 1.229 ± 0.305
0.973HisPhe: 0.973 ± 0.226
1.486HisGly: 1.486 ± 0.222
0.359HisHis: 0.359 ± 0.128
0.768HisIle: 0.768 ± 0.161
1.229HisLys: 1.229 ± 0.259
1.537HisLeu: 1.537 ± 0.325
0.205HisMet: 0.205 ± 0.099
0.615HisAsn: 0.615 ± 0.195
0.461HisPro: 0.461 ± 0.165
0.41HisGln: 0.41 ± 0.172
0.82HisArg: 0.82 ± 0.193
1.588HisSer: 1.588 ± 0.325
0.768HisThr: 0.768 ± 0.201
1.537HisVal: 1.537 ± 0.273
0.205HisTrp: 0.205 ± 0.087
0.82HisTyr: 0.82 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
6.711IleAla: 6.711 ± 0.784
0.359IleCys: 0.359 ± 0.142
4.354IleAsp: 4.354 ± 0.534
5.276IleGlu: 5.276 ± 0.636
1.69IlePhe: 1.69 ± 0.237
3.893IleGly: 3.893 ± 0.54
0.973IleHis: 0.973 ± 0.223
2.971IleIle: 2.971 ± 0.465
4.201IleLys: 4.201 ± 0.468
3.996IleLeu: 3.996 ± 0.44
0.973IleMet: 0.973 ± 0.194
2.254IleAsn: 2.254 ± 0.376
2.305IlePro: 2.305 ± 0.369
1.537IleGln: 1.537 ± 0.254
3.022IleArg: 3.022 ± 0.361
3.74IleSer: 3.74 ± 0.44
5.071IleThr: 5.071 ± 0.546
3.637IleVal: 3.637 ± 0.555
0.307IleTrp: 0.307 ± 0.126
2.561IleTyr: 2.561 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
7.838LysAla: 7.838 ± 0.771
0.461LysCys: 0.461 ± 0.208
3.483LysAsp: 3.483 ± 0.445
4.098LysGlu: 4.098 ± 0.478
2.049LysPhe: 2.049 ± 0.269
3.944LysGly: 3.944 ± 0.488
2.1LysHis: 2.1 ± 0.295
2.869LysIle: 2.869 ± 0.375
3.381LysLys: 3.381 ± 0.522
5.276LysLeu: 5.276 ± 0.421
1.281LysMet: 1.281 ± 0.305
2.817LysAsn: 2.817 ± 0.487
3.791LysPro: 3.791 ± 0.551
2.305LysGln: 2.305 ± 0.307
5.584LysArg: 5.584 ± 0.591
3.227LysSer: 3.227 ± 0.384
4.867LysThr: 4.867 ± 0.502
4.406LysVal: 4.406 ± 0.465
0.973LysTrp: 0.973 ± 0.215
3.125LysTyr: 3.125 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
7.838LeuAla: 7.838 ± 0.608
0.615LeuCys: 0.615 ± 0.206
4.354LeuAsp: 4.354 ± 0.442
5.533LeuGlu: 5.533 ± 0.513
2.356LeuPhe: 2.356 ± 0.425
4.867LeuGly: 4.867 ± 0.432
1.69LeuHis: 1.69 ± 0.271
4.098LeuIle: 4.098 ± 0.533
4.969LeuLys: 4.969 ± 0.434
6.813LeuLeu: 6.813 ± 0.642
1.947LeuMet: 1.947 ± 0.319
4.406LeuAsn: 4.406 ± 0.468
2.817LeuPro: 2.817 ± 0.388
2.254LeuGln: 2.254 ± 0.333
5.328LeuArg: 5.328 ± 0.585
5.071LeuSer: 5.071 ± 0.558
6.198LeuThr: 6.198 ± 0.536
3.893LeuVal: 3.893 ± 0.418
0.563LeuTrp: 0.563 ± 0.189
2.51LeuTyr: 2.51 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.1MetAla: 2.1 ± 0.477
0.154MetCys: 0.154 ± 0.099
1.998MetAsp: 1.998 ± 0.337
1.229MetGlu: 1.229 ± 0.245
0.871MetPhe: 0.871 ± 0.174
1.486MetGly: 1.486 ± 0.286
0.359MetHis: 0.359 ± 0.168
1.178MetIle: 1.178 ± 0.208
1.229MetLys: 1.229 ± 0.256
1.793MetLeu: 1.793 ± 0.301
0.512MetMet: 0.512 ± 0.149
0.871MetAsn: 0.871 ± 0.221
1.025MetPro: 1.025 ± 0.186
0.82MetGln: 0.82 ± 0.169
1.742MetArg: 1.742 ± 0.374
1.895MetSer: 1.895 ± 0.334
3.586MetThr: 3.586 ± 0.367
0.615MetVal: 0.615 ± 0.17
0.102MetTrp: 0.102 ± 0.072
0.615MetTyr: 0.615 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
5.123AsnAla: 5.123 ± 0.556
0.256AsnCys: 0.256 ± 0.106
2.817AsnAsp: 2.817 ± 0.335
2.715AsnGlu: 2.715 ± 0.364
1.434AsnPhe: 1.434 ± 0.314
3.842AsnGly: 3.842 ± 0.405
0.461AsnHis: 0.461 ± 0.152
2.869AsnIle: 2.869 ± 0.555
2.254AsnLys: 2.254 ± 0.369
3.33AsnLeu: 3.33 ± 0.381
1.229AsnMet: 1.229 ± 0.198
1.793AsnAsn: 1.793 ± 0.322
1.998AsnPro: 1.998 ± 0.391
0.615AsnGln: 0.615 ± 0.258
1.434AsnArg: 1.434 ± 0.267
2.203AsnSer: 2.203 ± 0.329
2.51AsnThr: 2.51 ± 0.5
3.842AsnVal: 3.842 ± 0.438
0.205AsnTrp: 0.205 ± 0.112
1.178AsnTyr: 1.178 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
2.613ProAla: 2.613 ± 0.364
0.154ProCys: 0.154 ± 0.096
2.254ProAsp: 2.254 ± 0.374
2.561ProGlu: 2.561 ± 0.314
1.69ProPhe: 1.69 ± 0.331
1.895ProGly: 1.895 ± 0.304
0.615ProHis: 0.615 ± 0.157
2.408ProIle: 2.408 ± 0.34
2.715ProLys: 2.715 ± 0.49
2.356ProLeu: 2.356 ± 0.317
0.615ProMet: 0.615 ± 0.183
1.076ProAsn: 1.076 ± 0.237
1.076ProPro: 1.076 ± 0.233
1.025ProGln: 1.025 ± 0.26
1.178ProArg: 1.178 ± 0.328
1.742ProSer: 1.742 ± 0.299
2.459ProThr: 2.459 ± 0.338
2.203ProVal: 2.203 ± 0.342
0.256ProTrp: 0.256 ± 0.104
1.486ProTyr: 1.486 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
3.74GlnAla: 3.74 ± 0.513
0.205GlnCys: 0.205 ± 0.093
1.69GlnAsp: 1.69 ± 0.324
1.434GlnGlu: 1.434 ± 0.223
1.588GlnPhe: 1.588 ± 0.277
1.742GlnGly: 1.742 ± 0.283
0.82GlnHis: 0.82 ± 0.181
1.69GlnIle: 1.69 ± 0.289
1.076GlnLys: 1.076 ± 0.242
2.356GlnLeu: 2.356 ± 0.378
0.717GlnMet: 0.717 ± 0.201
0.871GlnAsn: 0.871 ± 0.171
0.973GlnPro: 0.973 ± 0.237
0.871GlnGln: 0.871 ± 0.202
2.152GlnArg: 2.152 ± 0.32
1.537GlnSer: 1.537 ± 0.342
1.588GlnThr: 1.588 ± 0.282
1.434GlnVal: 1.434 ± 0.259
0.563GlnTrp: 0.563 ± 0.164
1.434GlnTyr: 1.434 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
4.867ArgAla: 4.867 ± 0.538
0.41ArgCys: 0.41 ± 0.117
3.842ArgAsp: 3.842 ± 0.522
3.688ArgGlu: 3.688 ± 0.459
1.844ArgPhe: 1.844 ± 0.372
3.791ArgGly: 3.791 ± 0.508
0.871ArgHis: 0.871 ± 0.221
4.047ArgIle: 4.047 ± 0.659
5.942ArgLys: 5.942 ± 0.575
5.584ArgLeu: 5.584 ± 0.608
1.69ArgMet: 1.69 ± 0.3
2.51ArgAsn: 2.51 ± 0.366
1.332ArgPro: 1.332 ± 0.289
1.69ArgGln: 1.69 ± 0.276
3.637ArgArg: 3.637 ± 0.486
3.535ArgSer: 3.535 ± 0.375
3.074ArgThr: 3.074 ± 0.408
3.022ArgVal: 3.022 ± 0.386
0.82ArgTrp: 0.82 ± 0.185
2.356ArgTyr: 2.356 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
4.508SerAla: 4.508 ± 0.516
0.666SerCys: 0.666 ± 0.234
4.559SerAsp: 4.559 ± 0.51
4.354SerGlu: 4.354 ± 0.615
1.947SerPhe: 1.947 ± 0.284
5.123SerGly: 5.123 ± 0.434
1.281SerHis: 1.281 ± 0.248
3.842SerIle: 3.842 ± 0.486
4.252SerLys: 4.252 ± 0.513
4.354SerLeu: 4.354 ± 0.496
1.332SerMet: 1.332 ± 0.303
2.971SerAsn: 2.971 ± 0.444
2.408SerPro: 2.408 ± 0.404
1.947SerGln: 1.947 ± 0.273
2.869SerArg: 2.869 ± 0.341
2.613SerSer: 2.613 ± 0.339
2.561SerThr: 2.561 ± 0.386
4.406SerVal: 4.406 ± 0.449
0.41SerTrp: 0.41 ± 0.149
1.537SerTyr: 1.537 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
4.508ThrAla: 4.508 ± 0.522
0.461ThrCys: 0.461 ± 0.168
3.944ThrAsp: 3.944 ± 0.448
4.098ThrGlu: 4.098 ± 0.382
2.152ThrPhe: 2.152 ± 0.287
4.508ThrGly: 4.508 ± 0.539
1.486ThrHis: 1.486 ± 0.302
3.944ThrIle: 3.944 ± 0.471
4.098ThrLys: 4.098 ± 0.298
5.225ThrLeu: 5.225 ± 0.484
1.281ThrMet: 1.281 ± 0.338
2.971ThrAsn: 2.971 ± 0.425
3.176ThrPro: 3.176 ± 0.437
2.92ThrGln: 2.92 ± 0.419
3.483ThrArg: 3.483 ± 0.412
3.33ThrSer: 3.33 ± 0.459
3.33ThrThr: 3.33 ± 0.426
5.533ThrVal: 5.533 ± 0.541
0.717ThrTrp: 0.717 ± 0.173
2.254ThrTyr: 2.254 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
5.942ValAla: 5.942 ± 0.56
0.666ValCys: 0.666 ± 0.197
4.559ValAsp: 4.559 ± 0.415
4.918ValGlu: 4.918 ± 0.624
2.561ValPhe: 2.561 ± 0.352
6.25ValGly: 6.25 ± 0.693
0.82ValHis: 0.82 ± 0.22
4.303ValIle: 4.303 ± 0.499
5.328ValLys: 5.328 ± 0.484
5.02ValLeu: 5.02 ± 0.468
1.537ValMet: 1.537 ± 0.267
3.535ValAsn: 3.535 ± 0.606
1.947ValPro: 1.947 ± 0.35
2.1ValGln: 2.1 ± 0.357
3.893ValArg: 3.893 ± 0.502
4.815ValSer: 4.815 ± 0.542
4.406ValThr: 4.406 ± 0.53
4.61ValVal: 4.61 ± 0.552
0.82ValTrp: 0.82 ± 0.228
2.766ValTyr: 2.766 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
0.871TrpAla: 0.871 ± 0.216
0.102TrpCys: 0.102 ± 0.075
0.666TrpAsp: 0.666 ± 0.186
0.512TrpGlu: 0.512 ± 0.151
0.615TrpPhe: 0.615 ± 0.193
1.127TrpGly: 1.127 ± 0.28
0.359TrpHis: 0.359 ± 0.111
0.41TrpIle: 0.41 ± 0.129
0.922TrpLys: 0.922 ± 0.21
1.127TrpLeu: 1.127 ± 0.208
0.359TrpMet: 0.359 ± 0.132
0.461TrpAsn: 0.461 ± 0.122
0.051TrpPro: 0.051 ± 0.048
0.256TrpGln: 0.256 ± 0.104
0.717TrpArg: 0.717 ± 0.217
0.973TrpSer: 0.973 ± 0.202
0.512TrpThr: 0.512 ± 0.18
0.41TrpVal: 0.41 ± 0.147
0.205TrpTrp: 0.205 ± 0.143
0.359TrpTyr: 0.359 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.303TyrAla: 4.303 ± 0.444
0.563TyrCys: 0.563 ± 0.222
2.51TyrAsp: 2.51 ± 0.406
3.637TyrGlu: 3.637 ± 0.475
1.281TyrPhe: 1.281 ± 0.212
2.817TyrGly: 2.817 ± 0.358
0.615TyrHis: 0.615 ± 0.186
2.613TyrIle: 2.613 ± 0.345
2.152TyrLys: 2.152 ± 0.328
3.535TyrLeu: 3.535 ± 0.429
1.127TyrMet: 1.127 ± 0.23
1.229TyrAsn: 1.229 ± 0.289
0.973TyrPro: 0.973 ± 0.253
0.615TyrGln: 0.615 ± 0.231
2.561TyrArg: 2.561 ± 0.389
1.742TyrSer: 1.742 ± 0.262
2.817TyrThr: 2.817 ± 0.378
3.535TyrVal: 3.535 ± 0.494
0.359TyrTrp: 0.359 ± 0.123
1.793TyrTyr: 1.793 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (19522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski