Amino acid dipepetide frequency for Acinetobacter phage AP22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.609AlaAla: 4.609 ± 0.958
0.851AlaCys: 0.851 ± 0.264
3.474AlaAsp: 3.474 ± 0.415
4.042AlaGlu: 4.042 ± 0.528
2.765AlaPhe: 2.765 ± 0.448
4.254AlaGly: 4.254 ± 0.573
1.347AlaHis: 1.347 ± 0.314
5.389AlaIle: 5.389 ± 0.594
4.963AlaLys: 4.963 ± 0.662
6.665AlaLeu: 6.665 ± 0.736
1.985AlaMet: 1.985 ± 0.481
3.333AlaAsn: 3.333 ± 0.535
2.836AlaPro: 2.836 ± 0.337
3.474AlaGln: 3.474 ± 0.551
2.056AlaArg: 2.056 ± 0.301
4.184AlaSer: 4.184 ± 0.611
5.105AlaThr: 5.105 ± 0.733
4.184AlaVal: 4.184 ± 0.518
1.205AlaTrp: 1.205 ± 0.226
2.836AlaTyr: 2.836 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.567CysAla: 0.567 ± 0.192
0.142CysCys: 0.142 ± 0.107
1.276CysAsp: 1.276 ± 0.298
0.993CysGlu: 0.993 ± 0.307
0.78CysPhe: 0.78 ± 0.246
0.922CysGly: 0.922 ± 0.236
0.142CysHis: 0.142 ± 0.093
0.496CysIle: 0.496 ± 0.16
1.135CysLys: 1.135 ± 0.294
1.135CysLeu: 1.135 ± 0.27
0.355CysMet: 0.355 ± 0.205
0.496CysAsn: 0.496 ± 0.201
0.355CysPro: 0.355 ± 0.148
0.355CysGln: 0.355 ± 0.13
0.496CysArg: 0.496 ± 0.227
0.709CysSer: 0.709 ± 0.236
0.567CysThr: 0.567 ± 0.184
0.851CysVal: 0.851 ± 0.237
0.142CysTrp: 0.142 ± 0.106
0.355CysTyr: 0.355 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
4.68AspAla: 4.68 ± 0.573
0.78AspCys: 0.78 ± 0.239
4.184AspAsp: 4.184 ± 0.494
3.971AspGlu: 3.971 ± 0.604
3.404AspPhe: 3.404 ± 0.439
4.751AspGly: 4.751 ± 0.705
0.638AspHis: 0.638 ± 0.222
3.049AspIle: 3.049 ± 0.472
4.751AspLys: 4.751 ± 0.767
5.389AspLeu: 5.389 ± 0.508
1.631AspMet: 1.631 ± 0.334
3.12AspAsn: 3.12 ± 0.426
1.844AspPro: 1.844 ± 0.298
2.482AspGln: 2.482 ± 0.387
2.624AspArg: 2.624 ± 0.386
3.404AspSer: 3.404 ± 0.611
2.907AspThr: 2.907 ± 0.444
4.609AspVal: 4.609 ± 0.532
0.922AspTrp: 0.922 ± 0.226
2.553AspTyr: 2.553 ± 0.347
0.0AspXaa: 0.0 ± 0.0
Glu
4.538GluAla: 4.538 ± 0.607
0.355GluCys: 0.355 ± 0.186
3.758GluAsp: 3.758 ± 0.631
3.616GluGlu: 3.616 ± 0.614
3.474GluPhe: 3.474 ± 0.648
4.254GluGly: 4.254 ± 0.539
1.064GluHis: 1.064 ± 0.259
5.389GluIle: 5.389 ± 0.651
5.318GluLys: 5.318 ± 0.713
5.673GluLeu: 5.673 ± 0.768
2.127GluMet: 2.127 ± 0.4
3.616GluAsn: 3.616 ± 0.423
1.773GluPro: 1.773 ± 0.411
3.191GluGln: 3.191 ± 0.46
2.694GluArg: 2.694 ± 0.442
4.822GluSer: 4.822 ± 0.523
2.907GluThr: 2.907 ± 0.404
4.042GluVal: 4.042 ± 0.485
0.993GluTrp: 0.993 ± 0.264
3.262GluTyr: 3.262 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
3.474PheAla: 3.474 ± 0.47
0.78PheCys: 0.78 ± 0.254
2.907PheAsp: 2.907 ± 0.419
3.049PheGlu: 3.049 ± 0.416
1.773PhePhe: 1.773 ± 0.399
3.9PheGly: 3.9 ± 0.536
0.425PheHis: 0.425 ± 0.185
3.262PheIle: 3.262 ± 0.496
3.333PheLys: 3.333 ± 0.4
3.474PheLeu: 3.474 ± 0.473
1.276PheMet: 1.276 ± 0.341
2.836PheAsn: 2.836 ± 0.57
1.347PhePro: 1.347 ± 0.337
1.418PheGln: 1.418 ± 0.356
1.418PheArg: 1.418 ± 0.325
1.844PheSer: 1.844 ± 0.32
2.482PheThr: 2.482 ± 0.457
2.198PheVal: 2.198 ± 0.391
0.78PheTrp: 0.78 ± 0.293
2.269PheTyr: 2.269 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
5.389GlyAla: 5.389 ± 0.625
0.922GlyCys: 0.922 ± 0.279
4.254GlyAsp: 4.254 ± 0.611
4.538GlyGlu: 4.538 ± 0.578
3.9GlyPhe: 3.9 ± 0.512
5.46GlyGly: 5.46 ± 0.74
1.205GlyHis: 1.205 ± 0.256
4.893GlyIle: 4.893 ± 0.593
4.893GlyLys: 4.893 ± 0.624
5.034GlyLeu: 5.034 ± 0.536
2.907GlyMet: 2.907 ± 0.514
3.687GlyAsn: 3.687 ± 0.565
0.638GlyPro: 0.638 ± 0.281
2.056GlyGln: 2.056 ± 0.411
2.269GlyArg: 2.269 ± 0.375
5.46GlySer: 5.46 ± 0.599
2.978GlyThr: 2.978 ± 0.471
5.673GlyVal: 5.673 ± 0.701
0.922GlyTrp: 0.922 ± 0.262
3.262GlyTyr: 3.262 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
1.205HisAla: 1.205 ± 0.295
0.355HisCys: 0.355 ± 0.132
1.135HisAsp: 1.135 ± 0.281
1.418HisGlu: 1.418 ± 0.355
0.496HisPhe: 0.496 ± 0.223
1.418HisGly: 1.418 ± 0.352
0.355HisHis: 0.355 ± 0.157
1.276HisIle: 1.276 ± 0.24
1.418HisLys: 1.418 ± 0.315
1.347HisLeu: 1.347 ± 0.308
0.355HisMet: 0.355 ± 0.155
0.922HisAsn: 0.922 ± 0.271
0.496HisPro: 0.496 ± 0.186
0.993HisGln: 0.993 ± 0.266
0.355HisArg: 0.355 ± 0.164
0.709HisSer: 0.709 ± 0.23
0.638HisThr: 0.638 ± 0.197
0.851HisVal: 0.851 ± 0.226
0.355HisTrp: 0.355 ± 0.132
0.425HisTyr: 0.425 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
4.467IleAla: 4.467 ± 0.616
0.78IleCys: 0.78 ± 0.283
4.893IleAsp: 4.893 ± 0.537
6.594IleGlu: 6.594 ± 0.608
1.914IlePhe: 1.914 ± 0.318
5.247IleGly: 5.247 ± 0.641
1.205IleHis: 1.205 ± 0.32
3.687IleIle: 3.687 ± 0.519
6.453IleLys: 6.453 ± 0.693
4.68IleLeu: 4.68 ± 0.65
1.347IleMet: 1.347 ± 0.298
3.687IleAsn: 3.687 ± 0.414
3.971IlePro: 3.971 ± 0.561
2.198IleGln: 2.198 ± 0.339
2.482IleArg: 2.482 ± 0.389
4.538IleSer: 4.538 ± 0.584
3.9IleThr: 3.9 ± 0.6
4.396IleVal: 4.396 ± 0.505
0.709IleTrp: 0.709 ± 0.231
2.056IleTyr: 2.056 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
5.389LysAla: 5.389 ± 0.871
0.851LysCys: 0.851 ± 0.263
4.68LysAsp: 4.68 ± 0.521
4.893LysGlu: 4.893 ± 0.665
3.12LysPhe: 3.12 ± 0.489
6.24LysGly: 6.24 ± 0.594
1.702LysHis: 1.702 ± 0.359
4.751LysIle: 4.751 ± 0.592
5.814LysLys: 5.814 ± 0.781
6.382LysLeu: 6.382 ± 0.542
2.694LysMet: 2.694 ± 0.531
4.467LysAsn: 4.467 ± 0.46
2.127LysPro: 2.127 ± 0.462
2.482LysGln: 2.482 ± 0.433
3.758LysArg: 3.758 ± 0.54
3.616LysSer: 3.616 ± 0.472
4.963LysThr: 4.963 ± 0.619
5.814LysVal: 5.814 ± 0.652
0.709LysTrp: 0.709 ± 0.229
2.34LysTyr: 2.34 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
6.24LeuAla: 6.24 ± 0.638
0.78LeuCys: 0.78 ± 0.227
5.389LeuAsp: 5.389 ± 0.607
5.885LeuGlu: 5.885 ± 0.67
3.262LeuPhe: 3.262 ± 0.601
5.176LeuGly: 5.176 ± 0.69
1.56LeuHis: 1.56 ± 0.416
7.303LeuIle: 7.303 ± 0.98
7.303LeuLys: 7.303 ± 0.633
5.743LeuLeu: 5.743 ± 0.575
2.127LeuMet: 2.127 ± 0.459
6.523LeuAsn: 6.523 ± 0.633
1.844LeuPro: 1.844 ± 0.387
1.844LeuGln: 1.844 ± 0.362
3.404LeuArg: 3.404 ± 0.476
5.247LeuSer: 5.247 ± 0.694
4.609LeuThr: 4.609 ± 0.52
4.963LeuVal: 4.963 ± 0.555
0.922LeuTrp: 0.922 ± 0.248
2.482LeuTyr: 2.482 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
2.482MetAla: 2.482 ± 0.388
0.355MetCys: 0.355 ± 0.162
1.631MetAsp: 1.631 ± 0.439
1.064MetGlu: 1.064 ± 0.263
0.922MetPhe: 0.922 ± 0.279
2.127MetGly: 2.127 ± 0.439
0.567MetHis: 0.567 ± 0.255
1.773MetIle: 1.773 ± 0.408
2.056MetLys: 2.056 ± 0.429
2.269MetLeu: 2.269 ± 0.382
0.709MetMet: 0.709 ± 0.248
2.553MetAsn: 2.553 ± 0.375
0.922MetPro: 0.922 ± 0.228
1.489MetGln: 1.489 ± 0.386
1.347MetArg: 1.347 ± 0.347
2.907MetSer: 2.907 ± 0.473
2.269MetThr: 2.269 ± 0.338
1.064MetVal: 1.064 ± 0.31
0.213MetTrp: 0.213 ± 0.127
0.425MetTyr: 0.425 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
3.616AsnAla: 3.616 ± 0.477
1.064AsnCys: 1.064 ± 0.309
3.687AsnAsp: 3.687 ± 0.456
4.467AsnGlu: 4.467 ± 0.502
2.269AsnPhe: 2.269 ± 0.427
5.956AsnGly: 5.956 ± 0.673
0.851AsnHis: 0.851 ± 0.268
3.687AsnIle: 3.687 ± 0.554
3.12AsnLys: 3.12 ± 0.416
4.609AsnLeu: 4.609 ± 0.601
1.205AsnMet: 1.205 ± 0.289
3.404AsnAsn: 3.404 ± 0.57
2.765AsnPro: 2.765 ± 0.465
2.198AsnGln: 2.198 ± 0.315
2.056AsnArg: 2.056 ± 0.374
3.829AsnSer: 3.829 ± 0.594
3.616AsnThr: 3.616 ± 0.447
2.978AsnVal: 2.978 ± 0.617
0.709AsnTrp: 0.709 ± 0.214
2.694AsnTyr: 2.694 ± 0.456
0.0AsnXaa: 0.0 ± 0.0
Pro
1.418ProAla: 1.418 ± 0.313
0.425ProCys: 0.425 ± 0.142
1.773ProAsp: 1.773 ± 0.322
2.765ProGlu: 2.765 ± 0.38
1.205ProPhe: 1.205 ± 0.355
0.0ProGly: 0.0 ± 0.0
0.567ProHis: 0.567 ± 0.213
2.127ProIle: 2.127 ± 0.369
2.836ProLys: 2.836 ± 0.519
3.12ProLeu: 3.12 ± 0.487
1.276ProMet: 1.276 ± 0.306
2.482ProAsn: 2.482 ± 0.359
0.851ProPro: 0.851 ± 0.213
1.773ProGln: 1.773 ± 0.346
0.78ProArg: 0.78 ± 0.215
2.624ProSer: 2.624 ± 0.364
1.56ProThr: 1.56 ± 0.276
2.127ProVal: 2.127 ± 0.309
0.213ProTrp: 0.213 ± 0.125
1.985ProTyr: 1.985 ± 0.393
0.0ProXaa: 0.0 ± 0.0
Gln
3.191GlnAla: 3.191 ± 0.534
0.496GlnCys: 0.496 ± 0.219
2.553GlnAsp: 2.553 ± 0.437
2.127GlnGlu: 2.127 ± 0.346
1.702GlnPhe: 1.702 ± 0.416
2.127GlnGly: 2.127 ± 0.485
0.851GlnHis: 0.851 ± 0.258
2.694GlnIle: 2.694 ± 0.47
2.482GlnLys: 2.482 ± 0.358
3.758GlnLeu: 3.758 ± 0.729
0.993GlnMet: 0.993 ± 0.296
1.844GlnAsn: 1.844 ± 0.372
0.78GlnPro: 0.78 ± 0.229
1.489GlnGln: 1.489 ± 0.284
0.993GlnArg: 0.993 ± 0.317
2.624GlnSer: 2.624 ± 0.462
1.773GlnThr: 1.773 ± 0.343
2.553GlnVal: 2.553 ± 0.418
0.78GlnTrp: 0.78 ± 0.241
2.198GlnTyr: 2.198 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
2.127ArgAla: 2.127 ± 0.394
0.851ArgCys: 0.851 ± 0.26
2.127ArgAsp: 2.127 ± 0.347
2.978ArgGlu: 2.978 ± 0.547
1.56ArgPhe: 1.56 ± 0.358
1.914ArgGly: 1.914 ± 0.362
1.064ArgHis: 1.064 ± 0.249
2.34ArgIle: 2.34 ± 0.418
3.12ArgLys: 3.12 ± 0.53
3.404ArgLeu: 3.404 ± 0.478
0.993ArgMet: 0.993 ± 0.241
1.631ArgAsn: 1.631 ± 0.322
1.135ArgPro: 1.135 ± 0.244
1.631ArgGln: 1.631 ± 0.358
1.276ArgArg: 1.276 ± 0.4
2.765ArgSer: 2.765 ± 0.439
1.56ArgThr: 1.56 ± 0.361
2.694ArgVal: 2.694 ± 0.511
0.567ArgTrp: 0.567 ± 0.176
1.276ArgTyr: 1.276 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
4.254SerAla: 4.254 ± 0.559
0.496SerCys: 0.496 ± 0.159
3.333SerAsp: 3.333 ± 0.46
3.687SerGlu: 3.687 ± 0.563
2.836SerPhe: 2.836 ± 0.591
4.893SerGly: 4.893 ± 0.599
0.355SerHis: 0.355 ± 0.14
5.673SerIle: 5.673 ± 0.635
4.893SerLys: 4.893 ± 0.538
5.176SerLeu: 5.176 ± 0.444
2.198SerMet: 2.198 ± 0.432
3.687SerAsn: 3.687 ± 0.496
1.631SerPro: 1.631 ± 0.334
2.836SerGln: 2.836 ± 0.467
2.269SerArg: 2.269 ± 0.428
2.836SerSer: 2.836 ± 0.526
3.616SerThr: 3.616 ± 0.512
4.325SerVal: 4.325 ± 0.499
1.489SerTrp: 1.489 ± 0.318
1.985SerTyr: 1.985 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
4.325ThrAla: 4.325 ± 0.676
0.638ThrCys: 0.638 ± 0.207
3.262ThrAsp: 3.262 ± 0.537
2.34ThrGlu: 2.34 ± 0.409
2.127ThrPhe: 2.127 ± 0.394
3.829ThrGly: 3.829 ± 0.545
0.993ThrHis: 0.993 ± 0.295
3.12ThrIle: 3.12 ± 0.45
3.758ThrLys: 3.758 ± 0.596
4.538ThrLeu: 4.538 ± 0.629
1.489ThrMet: 1.489 ± 0.254
2.836ThrAsn: 2.836 ± 0.462
2.198ThrPro: 2.198 ± 0.406
1.914ThrGln: 1.914 ± 0.399
2.056ThrArg: 2.056 ± 0.39
2.836ThrSer: 2.836 ± 0.451
2.553ThrThr: 2.553 ± 0.514
4.68ThrVal: 4.68 ± 0.707
1.56ThrTrp: 1.56 ± 0.311
1.773ThrTyr: 1.773 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
4.113ValAla: 4.113 ± 0.586
0.355ValCys: 0.355 ± 0.173
3.474ValAsp: 3.474 ± 0.423
4.963ValGlu: 4.963 ± 0.584
3.616ValPhe: 3.616 ± 0.467
4.467ValGly: 4.467 ± 0.628
0.851ValHis: 0.851 ± 0.275
5.034ValIle: 5.034 ± 0.576
5.531ValLys: 5.531 ± 0.678
5.885ValLeu: 5.885 ± 0.701
2.056ValMet: 2.056 ± 0.428
4.184ValAsn: 4.184 ± 0.584
2.34ValPro: 2.34 ± 0.37
2.198ValGln: 2.198 ± 0.593
2.127ValArg: 2.127 ± 0.42
3.687ValSer: 3.687 ± 0.515
3.049ValThr: 3.049 ± 0.521
4.042ValVal: 4.042 ± 0.565
1.135ValTrp: 1.135 ± 0.276
2.624ValTyr: 2.624 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
0.993TrpAla: 0.993 ± 0.298
0.496TrpCys: 0.496 ± 0.162
0.78TrpAsp: 0.78 ± 0.238
0.922TrpGlu: 0.922 ± 0.293
0.922TrpPhe: 0.922 ± 0.22
0.567TrpGly: 0.567 ± 0.168
0.284TrpHis: 0.284 ± 0.138
1.064TrpIle: 1.064 ± 0.252
0.993TrpLys: 0.993 ± 0.302
1.56TrpLeu: 1.56 ± 0.283
0.496TrpMet: 0.496 ± 0.196
1.276TrpAsn: 1.276 ± 0.323
0.284TrpPro: 0.284 ± 0.136
0.567TrpGln: 0.567 ± 0.198
0.709TrpArg: 0.709 ± 0.24
1.064TrpSer: 1.064 ± 0.317
0.567TrpThr: 0.567 ± 0.196
1.56TrpVal: 1.56 ± 0.366
0.284TrpTrp: 0.284 ± 0.14
0.142TrpTyr: 0.142 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.368
0.496TyrCys: 0.496 ± 0.18
3.12TyrAsp: 3.12 ± 0.497
2.553TyrGlu: 2.553 ± 0.44
2.198TyrPhe: 2.198 ± 0.339
2.836TyrGly: 2.836 ± 0.467
0.496TyrHis: 0.496 ± 0.236
2.198TyrIle: 2.198 ± 0.365
2.624TyrLys: 2.624 ± 0.485
2.765TyrLeu: 2.765 ± 0.473
0.851TyrMet: 0.851 ± 0.254
2.269TyrAsn: 2.269 ± 0.343
1.773TyrPro: 1.773 ± 0.39
1.418TyrGln: 1.418 ± 0.328
1.844TyrArg: 1.844 ± 0.38
2.765TyrSer: 2.765 ± 0.433
1.205TyrThr: 1.205 ± 0.306
2.127TyrVal: 2.127 ± 0.359
0.922TyrTrp: 0.922 ± 0.268
1.56TyrTyr: 1.56 ± 0.332
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (14104 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski