Amino acid dipepetide frequency for Pseudoalteromonas phage XCL1123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.547AlaAla: 7.547 ± 0.838
1.068AlaCys: 1.068 ± 0.25
4.699AlaAsp: 4.699 ± 0.592
4.77AlaGlu: 4.77 ± 0.517
2.919AlaPhe: 2.919 ± 0.482
5.838AlaGly: 5.838 ± 0.737
1.282AlaHis: 1.282 ± 0.27
4.343AlaIle: 4.343 ± 0.589
4.699AlaLys: 4.699 ± 0.52
7.12AlaLeu: 7.12 ± 0.834
2.777AlaMet: 2.777 ± 0.447
5.126AlaAsn: 5.126 ± 0.639
2.919AlaPro: 2.919 ± 0.454
2.919AlaGln: 2.919 ± 0.518
3.489AlaArg: 3.489 ± 0.533
5.055AlaSer: 5.055 ± 0.598
3.987AlaThr: 3.987 ± 0.57
5.126AlaVal: 5.126 ± 0.487
0.926AlaTrp: 0.926 ± 0.269
3.062AlaTyr: 3.062 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.32
0.356CysCys: 0.356 ± 0.162
0.712CysAsp: 0.712 ± 0.237
1.282CysGlu: 1.282 ± 0.296
0.427CysPhe: 0.427 ± 0.191
1.139CysGly: 1.139 ± 0.318
0.071CysHis: 0.071 ± 0.075
0.926CysIle: 0.926 ± 0.24
1.21CysLys: 1.21 ± 0.345
1.068CysLeu: 1.068 ± 0.302
0.142CysMet: 0.142 ± 0.093
0.712CysAsn: 0.712 ± 0.225
0.356CysPro: 0.356 ± 0.15
0.285CysGln: 0.285 ± 0.134
0.427CysArg: 0.427 ± 0.181
0.57CysSer: 0.57 ± 0.19
0.285CysThr: 0.285 ± 0.133
1.21CysVal: 1.21 ± 0.361
0.142CysTrp: 0.142 ± 0.105
0.783CysTyr: 0.783 ± 0.241
0.0CysXaa: 0.0 ± 0.0
Asp
4.842AspAla: 4.842 ± 0.711
1.068AspCys: 1.068 ± 0.303
4.557AspAsp: 4.557 ± 0.492
4.699AspGlu: 4.699 ± 0.613
2.492AspPhe: 2.492 ± 0.357
4.557AspGly: 4.557 ± 0.655
0.498AspHis: 0.498 ± 0.203
4.058AspIle: 4.058 ± 0.648
5.126AspLys: 5.126 ± 0.574
4.557AspLeu: 4.557 ± 0.522
1.353AspMet: 1.353 ± 0.302
3.275AspAsn: 3.275 ± 0.595
1.851AspPro: 1.851 ± 0.271
1.566AspGln: 1.566 ± 0.277
1.851AspArg: 1.851 ± 0.341
4.699AspSer: 4.699 ± 0.586
3.56AspThr: 3.56 ± 0.385
3.702AspVal: 3.702 ± 0.619
0.997AspTrp: 0.997 ± 0.316
3.275AspTyr: 3.275 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
5.838GluAla: 5.838 ± 0.526
0.926GluCys: 0.926 ± 0.277
2.99GluAsp: 2.99 ± 0.44
3.133GluGlu: 3.133 ± 0.527
3.133GluPhe: 3.133 ± 0.388
4.913GluGly: 4.913 ± 0.462
1.353GluHis: 1.353 ± 0.34
5.34GluIle: 5.34 ± 0.714
3.702GluLys: 3.702 ± 0.726
6.266GluLeu: 6.266 ± 0.706
1.709GluMet: 1.709 ± 0.339
2.848GluAsn: 2.848 ± 0.428
1.353GluPro: 1.353 ± 0.327
4.628GluGln: 4.628 ± 0.643
2.777GluArg: 2.777 ± 0.416
4.201GluSer: 4.201 ± 0.474
2.492GluThr: 2.492 ± 0.3
4.201GluVal: 4.201 ± 0.599
0.997GluTrp: 0.997 ± 0.255
3.204GluTyr: 3.204 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
2.848PheAla: 2.848 ± 0.489
0.712PheCys: 0.712 ± 0.226
2.99PheAsp: 2.99 ± 0.473
2.919PheGlu: 2.919 ± 0.524
0.926PhePhe: 0.926 ± 0.212
2.563PheGly: 2.563 ± 0.416
0.783PheHis: 0.783 ± 0.22
2.207PheIle: 2.207 ± 0.378
2.634PheLys: 2.634 ± 0.404
2.136PheLeu: 2.136 ± 0.443
0.997PheMet: 0.997 ± 0.302
3.702PheAsn: 3.702 ± 0.526
0.783PhePro: 0.783 ± 0.205
0.854PheGln: 0.854 ± 0.303
0.712PheArg: 0.712 ± 0.25
2.99PheSer: 2.99 ± 0.457
2.563PheThr: 2.563 ± 0.443
3.204PheVal: 3.204 ± 0.424
0.641PheTrp: 0.641 ± 0.241
1.709PheTyr: 1.709 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
5.91GlyAla: 5.91 ± 0.819
1.068GlyCys: 1.068 ± 0.252
4.699GlyAsp: 4.699 ± 0.757
4.913GlyGlu: 4.913 ± 0.59
3.418GlyPhe: 3.418 ± 0.471
5.91GlyGly: 5.91 ± 0.814
0.783GlyHis: 0.783 ± 0.242
4.486GlyIle: 4.486 ± 0.517
5.554GlyLys: 5.554 ± 0.71
5.34GlyLeu: 5.34 ± 0.723
1.21GlyMet: 1.21 ± 0.311
4.272GlyAsn: 4.272 ± 0.71
0.997GlyPro: 0.997 ± 0.229
2.278GlyGln: 2.278 ± 0.511
2.99GlyArg: 2.99 ± 0.408
5.34GlySer: 5.34 ± 0.742
4.414GlyThr: 4.414 ± 0.86
5.055GlyVal: 5.055 ± 0.436
0.997GlyTrp: 0.997 ± 0.267
2.706GlyTyr: 2.706 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
1.21HisAla: 1.21 ± 0.316
0.57HisCys: 0.57 ± 0.2
0.427HisAsp: 0.427 ± 0.166
0.926HisGlu: 0.926 ± 0.249
0.997HisPhe: 0.997 ± 0.255
1.424HisGly: 1.424 ± 0.382
0.214HisHis: 0.214 ± 0.124
0.997HisIle: 0.997 ± 0.248
0.712HisLys: 0.712 ± 0.227
1.21HisLeu: 1.21 ± 0.248
0.427HisMet: 0.427 ± 0.191
0.926HisAsn: 0.926 ± 0.253
0.356HisPro: 0.356 ± 0.162
0.498HisGln: 0.498 ± 0.202
0.854HisArg: 0.854 ± 0.219
0.783HisSer: 0.783 ± 0.229
0.427HisThr: 0.427 ± 0.24
0.498HisVal: 0.498 ± 0.202
0.0HisTrp: 0.0 ± 0.0
0.783HisTyr: 0.783 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
5.767IleAla: 5.767 ± 0.677
0.427IleCys: 0.427 ± 0.16
4.201IleAsp: 4.201 ± 0.52
4.984IleGlu: 4.984 ± 0.495
1.566IlePhe: 1.566 ± 0.374
3.774IleGly: 3.774 ± 0.578
0.783IleHis: 0.783 ± 0.24
3.204IleIle: 3.204 ± 0.528
5.411IleLys: 5.411 ± 0.625
3.845IleLeu: 3.845 ± 0.572
1.638IleMet: 1.638 ± 0.268
4.913IleAsn: 4.913 ± 0.745
2.421IlePro: 2.421 ± 0.372
2.563IleGln: 2.563 ± 0.384
2.848IleArg: 2.848 ± 0.488
3.56IleSer: 3.56 ± 0.535
3.489IleThr: 3.489 ± 0.434
3.916IleVal: 3.916 ± 0.471
0.57IleTrp: 0.57 ± 0.219
2.563IleTyr: 2.563 ± 0.459
0.0IleXaa: 0.0 ± 0.0
Lys
5.625LysAla: 5.625 ± 0.815
0.783LysCys: 0.783 ± 0.249
3.56LysAsp: 3.56 ± 0.483
3.987LysGlu: 3.987 ± 0.522
2.35LysPhe: 2.35 ± 0.354
4.699LysGly: 4.699 ± 0.605
1.068LysHis: 1.068 ± 0.304
3.916LysIle: 3.916 ± 0.496
5.055LysLys: 5.055 ± 0.754
6.194LysLeu: 6.194 ± 0.594
2.563LysMet: 2.563 ± 0.503
3.56LysAsn: 3.56 ± 0.53
3.489LysPro: 3.489 ± 0.565
3.275LysGln: 3.275 ± 0.421
2.848LysArg: 2.848 ± 0.462
4.414LysSer: 4.414 ± 0.512
4.13LysThr: 4.13 ± 0.478
4.557LysVal: 4.557 ± 0.513
0.997LysTrp: 0.997 ± 0.255
2.919LysTyr: 2.919 ± 0.538
0.0LysXaa: 0.0 ± 0.0
Leu
6.337LeuAla: 6.337 ± 0.805
1.068LeuCys: 1.068 ± 0.301
5.34LeuAsp: 5.34 ± 0.591
4.201LeuGlu: 4.201 ± 0.521
2.634LeuPhe: 2.634 ± 0.416
4.984LeuGly: 4.984 ± 0.643
1.495LeuHis: 1.495 ± 0.311
4.628LeuIle: 4.628 ± 0.56
4.628LeuLys: 4.628 ± 0.654
5.482LeuLeu: 5.482 ± 0.666
2.136LeuMet: 2.136 ± 0.33
4.486LeuAsn: 4.486 ± 0.546
2.278LeuPro: 2.278 ± 0.366
2.99LeuGln: 2.99 ± 0.479
3.275LeuArg: 3.275 ± 0.476
6.337LeuSer: 6.337 ± 0.675
5.482LeuThr: 5.482 ± 0.665
5.126LeuVal: 5.126 ± 0.544
0.712LeuTrp: 0.712 ± 0.222
1.495LeuTyr: 1.495 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
1.851MetAla: 1.851 ± 0.327
0.071MetCys: 0.071 ± 0.066
1.282MetAsp: 1.282 ± 0.317
1.922MetGlu: 1.922 ± 0.378
0.641MetPhe: 0.641 ± 0.21
1.282MetGly: 1.282 ± 0.379
0.57MetHis: 0.57 ± 0.173
1.78MetIle: 1.78 ± 0.312
2.065MetLys: 2.065 ± 0.402
1.994MetLeu: 1.994 ± 0.369
0.641MetMet: 0.641 ± 0.228
2.421MetAsn: 2.421 ± 0.435
0.641MetPro: 0.641 ± 0.205
0.926MetGln: 0.926 ± 0.202
1.922MetArg: 1.922 ± 0.311
2.207MetSer: 2.207 ± 0.458
1.78MetThr: 1.78 ± 0.331
1.353MetVal: 1.353 ± 0.372
0.427MetTrp: 0.427 ± 0.168
0.854MetTyr: 0.854 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
5.554AsnAla: 5.554 ± 0.732
0.214AsnCys: 0.214 ± 0.113
4.414AsnAsp: 4.414 ± 0.587
3.845AsnGlu: 3.845 ± 0.466
2.706AsnPhe: 2.706 ± 0.494
4.842AsnGly: 4.842 ± 0.555
1.139AsnHis: 1.139 ± 0.304
3.275AsnIle: 3.275 ± 0.477
4.984AsnLys: 4.984 ± 0.521
4.414AsnLeu: 4.414 ± 0.514
1.709AsnMet: 1.709 ± 0.317
3.56AsnAsn: 3.56 ± 0.464
3.133AsnPro: 3.133 ± 0.484
1.78AsnGln: 1.78 ± 0.377
2.065AsnArg: 2.065 ± 0.367
4.13AsnSer: 4.13 ± 0.512
2.99AsnThr: 2.99 ± 0.434
2.99AsnVal: 2.99 ± 0.534
0.926AsnTrp: 0.926 ± 0.278
1.922AsnTyr: 1.922 ± 0.396
0.0AsnXaa: 0.0 ± 0.0
Pro
2.35ProAla: 2.35 ± 0.367
0.498ProCys: 0.498 ± 0.186
2.065ProAsp: 2.065 ± 0.466
2.563ProGlu: 2.563 ± 0.402
1.566ProPhe: 1.566 ± 0.304
0.712ProGly: 0.712 ± 0.209
0.641ProHis: 0.641 ± 0.208
3.062ProIle: 3.062 ± 0.41
1.638ProLys: 1.638 ± 0.421
2.065ProLeu: 2.065 ± 0.411
0.641ProMet: 0.641 ± 0.21
1.638ProAsn: 1.638 ± 0.291
1.353ProPro: 1.353 ± 0.306
1.566ProGln: 1.566 ± 0.417
0.712ProArg: 0.712 ± 0.246
2.136ProSer: 2.136 ± 0.444
2.563ProThr: 2.563 ± 0.431
2.421ProVal: 2.421 ± 0.396
0.498ProTrp: 0.498 ± 0.171
1.495ProTyr: 1.495 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
3.56GlnAla: 3.56 ± 0.495
0.427GlnCys: 0.427 ± 0.156
2.278GlnAsp: 2.278 ± 0.359
2.492GlnGlu: 2.492 ± 0.326
1.353GlnPhe: 1.353 ± 0.298
2.563GlnGly: 2.563 ± 0.474
0.57GlnHis: 0.57 ± 0.197
2.777GlnIle: 2.777 ± 0.369
1.994GlnLys: 1.994 ± 0.479
3.275GlnLeu: 3.275 ± 0.483
0.783GlnMet: 0.783 ± 0.265
2.207GlnAsn: 2.207 ± 0.287
1.21GlnPro: 1.21 ± 0.256
2.777GlnGln: 2.777 ± 0.762
1.424GlnArg: 1.424 ± 0.4
3.204GlnSer: 3.204 ± 0.551
2.777GlnThr: 2.777 ± 0.572
2.706GlnVal: 2.706 ± 0.507
0.783GlnTrp: 0.783 ± 0.219
1.353GlnTyr: 1.353 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
2.848ArgAla: 2.848 ± 0.465
0.641ArgCys: 0.641 ± 0.221
2.421ArgAsp: 2.421 ± 0.405
2.777ArgGlu: 2.777 ± 0.447
1.78ArgPhe: 1.78 ± 0.344
2.777ArgGly: 2.777 ± 0.382
0.356ArgHis: 0.356 ± 0.148
2.065ArgIle: 2.065 ± 0.378
2.848ArgLys: 2.848 ± 0.515
2.634ArgLeu: 2.634 ± 0.569
1.495ArgMet: 1.495 ± 0.337
1.78ArgAsn: 1.78 ± 0.351
1.068ArgPro: 1.068 ± 0.212
1.78ArgGln: 1.78 ± 0.306
1.21ArgArg: 1.21 ± 0.294
2.634ArgSer: 2.634 ± 0.373
2.421ArgThr: 2.421 ± 0.331
3.845ArgVal: 3.845 ± 0.597
0.854ArgTrp: 0.854 ± 0.234
1.068ArgTyr: 1.068 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
5.482SerAla: 5.482 ± 0.583
0.712SerCys: 0.712 ± 0.234
4.557SerAsp: 4.557 ± 0.546
5.269SerGlu: 5.269 ± 0.523
2.563SerPhe: 2.563 ± 0.334
5.767SerGly: 5.767 ± 0.785
0.498SerHis: 0.498 ± 0.167
4.77SerIle: 4.77 ± 0.564
4.913SerLys: 4.913 ± 0.604
4.628SerLeu: 4.628 ± 0.515
2.065SerMet: 2.065 ± 0.41
3.845SerAsn: 3.845 ± 0.52
2.207SerPro: 2.207 ± 0.413
3.133SerGln: 3.133 ± 0.501
2.848SerArg: 2.848 ± 0.56
5.126SerSer: 5.126 ± 0.711
4.13SerThr: 4.13 ± 0.735
4.557SerVal: 4.557 ± 0.667
0.926SerTrp: 0.926 ± 0.204
2.492SerTyr: 2.492 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
4.13ThrAla: 4.13 ± 0.666
0.854ThrCys: 0.854 ± 0.262
3.489ThrAsp: 3.489 ± 0.394
4.058ThrGlu: 4.058 ± 0.535
2.492ThrPhe: 2.492 ± 0.467
4.984ThrGly: 4.984 ± 0.673
1.068ThrHis: 1.068 ± 0.283
3.916ThrIle: 3.916 ± 0.59
4.557ThrLys: 4.557 ± 0.702
5.126ThrLeu: 5.126 ± 0.396
0.854ThrMet: 0.854 ± 0.238
2.848ThrAsn: 2.848 ± 0.432
2.278ThrPro: 2.278 ± 0.477
1.994ThrGln: 1.994 ± 0.289
1.78ThrArg: 1.78 ± 0.324
3.987ThrSer: 3.987 ± 0.653
3.418ThrThr: 3.418 ± 0.736
4.058ThrVal: 4.058 ± 0.614
0.641ThrTrp: 0.641 ± 0.227
1.922ThrTyr: 1.922 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
4.058ValAla: 4.058 ± 0.612
0.57ValCys: 0.57 ± 0.209
4.343ValAsp: 4.343 ± 0.411
3.987ValGlu: 3.987 ± 0.615
2.278ValPhe: 2.278 ± 0.363
5.554ValGly: 5.554 ± 0.461
0.427ValHis: 0.427 ± 0.184
3.631ValIle: 3.631 ± 0.473
4.628ValLys: 4.628 ± 0.626
4.13ValLeu: 4.13 ± 0.49
2.278ValMet: 2.278 ± 0.425
5.554ValAsn: 5.554 ± 0.58
1.353ValPro: 1.353 ± 0.296
1.994ValGln: 1.994 ± 0.342
1.994ValArg: 1.994 ± 0.401
5.411ValSer: 5.411 ± 0.667
4.699ValThr: 4.699 ± 0.659
3.133ValVal: 3.133 ± 0.474
0.712ValTrp: 0.712 ± 0.205
3.56ValTyr: 3.56 ± 0.563
0.0ValXaa: 0.0 ± 0.0
Trp
0.783TrpAla: 0.783 ± 0.266
0.142TrpCys: 0.142 ± 0.105
0.57TrpAsp: 0.57 ± 0.178
0.854TrpGlu: 0.854 ± 0.272
1.068TrpPhe: 1.068 ± 0.277
0.854TrpGly: 0.854 ± 0.228
0.285TrpHis: 0.285 ± 0.136
0.997TrpIle: 0.997 ± 0.266
1.068TrpLys: 1.068 ± 0.369
0.854TrpLeu: 0.854 ± 0.228
0.427TrpMet: 0.427 ± 0.153
0.712TrpAsn: 0.712 ± 0.269
0.427TrpPro: 0.427 ± 0.182
0.783TrpGln: 0.783 ± 0.246
0.926TrpArg: 0.926 ± 0.254
0.854TrpSer: 0.854 ± 0.236
0.57TrpThr: 0.57 ± 0.212
0.854TrpVal: 0.854 ± 0.265
0.142TrpTrp: 0.142 ± 0.112
0.285TrpTyr: 0.285 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.065TyrAla: 2.065 ± 0.443
0.997TyrCys: 0.997 ± 0.244
2.99TyrAsp: 2.99 ± 0.552
2.421TyrGlu: 2.421 ± 0.425
1.566TyrPhe: 1.566 ± 0.266
3.204TyrGly: 3.204 ± 0.464
0.356TyrHis: 0.356 ± 0.148
2.065TyrIle: 2.065 ± 0.489
2.563TyrLys: 2.563 ± 0.511
2.777TyrLeu: 2.777 ± 0.412
0.712TyrMet: 0.712 ± 0.219
2.207TyrAsn: 2.207 ± 0.471
1.709TyrPro: 1.709 ± 0.345
1.994TyrGln: 1.994 ± 0.373
2.35TyrArg: 2.35 ± 0.382
2.848TyrSer: 2.848 ± 0.525
2.278TyrThr: 2.278 ± 0.323
1.78TyrVal: 1.78 ± 0.341
0.57TyrTrp: 0.57 ± 0.191
2.207TyrTyr: 2.207 ± 0.461
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14046 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski