Amino acid dipepetide frequency for Pseudomonas phage vB_PaeS_SCUT-S3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.832AlaAla: 14.832 ± 2.306
0.671AlaCys: 0.671 ± 0.198
5.366AlaAsp: 5.366 ± 0.594
6.41AlaGlu: 6.41 ± 0.773
3.727AlaPhe: 3.727 ± 0.602
8.795AlaGly: 8.795 ± 1.118
2.236AlaHis: 2.236 ± 0.537
6.112AlaIle: 6.112 ± 1.068
7.081AlaLys: 7.081 ± 0.772
10.285AlaLeu: 10.285 ± 1.301
2.46AlaMet: 2.46 ± 0.396
6.335AlaAsn: 6.335 ± 0.937
5.366AlaPro: 5.366 ± 0.834
4.994AlaGln: 4.994 ± 0.951
4.919AlaArg: 4.919 ± 0.527
6.559AlaSer: 6.559 ± 0.809
6.633AlaThr: 6.633 ± 1.409
7.155AlaVal: 7.155 ± 0.904
2.012AlaTrp: 2.012 ± 0.514
3.727AlaTyr: 3.727 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
1.043CysAla: 1.043 ± 0.262
0.149CysCys: 0.149 ± 0.108
0.82CysAsp: 0.82 ± 0.215
0.894CysGlu: 0.894 ± 0.315
0.447CysPhe: 0.447 ± 0.185
1.043CysGly: 1.043 ± 0.348
0.447CysHis: 0.447 ± 0.215
0.373CysIle: 0.373 ± 0.155
0.373CysLys: 0.373 ± 0.213
0.671CysLeu: 0.671 ± 0.241
0.0CysMet: 0.0 ± 0.0
0.522CysAsn: 0.522 ± 0.191
0.596CysPro: 0.596 ± 0.196
0.075CysGln: 0.075 ± 0.087
0.522CysArg: 0.522 ± 0.21
0.596CysSer: 0.596 ± 0.216
0.373CysThr: 0.373 ± 0.166
0.522CysVal: 0.522 ± 0.169
0.224CysTrp: 0.224 ± 0.125
0.522CysTyr: 0.522 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
6.335AspAla: 6.335 ± 0.87
0.447AspCys: 0.447 ± 0.166
3.13AspAsp: 3.13 ± 0.505
5.814AspGlu: 5.814 ± 0.554
2.385AspPhe: 2.385 ± 0.461
4.845AspGly: 4.845 ± 0.733
0.596AspHis: 0.596 ± 0.253
1.938AspIle: 1.938 ± 0.377
2.385AspLys: 2.385 ± 0.495
4.323AspLeu: 4.323 ± 0.468
0.969AspMet: 0.969 ± 0.202
2.087AspAsn: 2.087 ± 0.365
2.683AspPro: 2.683 ± 0.433
0.894AspGln: 0.894 ± 0.318
3.727AspArg: 3.727 ± 0.484
2.236AspSer: 2.236 ± 0.356
2.683AspThr: 2.683 ± 0.579
3.056AspVal: 3.056 ± 0.35
0.745AspTrp: 0.745 ± 0.21
1.938AspTyr: 1.938 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
9.018GluAla: 9.018 ± 0.991
1.118GluCys: 1.118 ± 0.268
3.056GluAsp: 3.056 ± 0.517
4.397GluGlu: 4.397 ± 0.671
2.758GluPhe: 2.758 ± 0.311
4.099GluGly: 4.099 ± 0.477
0.745GluHis: 0.745 ± 0.259
4.025GluIle: 4.025 ± 0.554
3.056GluLys: 3.056 ± 0.637
6.932GluLeu: 6.932 ± 0.642
1.789GluMet: 1.789 ± 0.369
2.161GluAsn: 2.161 ± 0.388
1.789GluPro: 1.789 ± 0.428
2.385GluGln: 2.385 ± 0.547
3.95GluArg: 3.95 ± 0.418
2.832GluSer: 2.832 ± 0.575
3.652GluThr: 3.652 ± 0.439
5.217GluVal: 5.217 ± 0.616
1.342GluTrp: 1.342 ± 0.411
1.789GluTyr: 1.789 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 0.644
0.671PheCys: 0.671 ± 0.204
3.578PheAsp: 3.578 ± 0.512
3.056PheGlu: 3.056 ± 0.513
1.565PhePhe: 1.565 ± 0.402
3.279PheGly: 3.279 ± 0.469
0.373PheHis: 0.373 ± 0.189
2.758PheIle: 2.758 ± 0.447
1.342PheLys: 1.342 ± 0.293
1.267PheLeu: 1.267 ± 0.263
1.118PheMet: 1.118 ± 0.278
2.311PheAsn: 2.311 ± 0.365
0.969PhePro: 0.969 ± 0.261
0.745PheGln: 0.745 ± 0.184
2.46PheArg: 2.46 ± 0.36
1.267PheSer: 1.267 ± 0.384
3.056PheThr: 3.056 ± 0.529
2.385PheVal: 2.385 ± 0.354
0.671PheTrp: 0.671 ± 0.246
1.416PheTyr: 1.416 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
7.081GlyAla: 7.081 ± 1.062
0.671GlyCys: 0.671 ± 0.244
4.77GlyAsp: 4.77 ± 0.585
5.292GlyGlu: 5.292 ± 0.596
3.727GlyPhe: 3.727 ± 0.485
6.037GlyGly: 6.037 ± 1.019
1.193GlyHis: 1.193 ± 0.344
3.428GlyIle: 3.428 ± 0.613
4.77GlyLys: 4.77 ± 0.542
6.559GlyLeu: 6.559 ± 0.815
1.565GlyMet: 1.565 ± 0.395
3.056GlyAsn: 3.056 ± 0.663
2.981GlyPro: 2.981 ± 0.638
3.428GlyGln: 3.428 ± 0.651
3.503GlyArg: 3.503 ± 0.529
4.845GlySer: 4.845 ± 0.813
4.248GlyThr: 4.248 ± 0.638
5.515GlyVal: 5.515 ± 0.679
1.342GlyTrp: 1.342 ± 0.39
2.534GlyTyr: 2.534 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 0.338
0.298HisCys: 0.298 ± 0.137
0.373HisAsp: 0.373 ± 0.153
0.894HisGlu: 0.894 ± 0.265
0.447HisPhe: 0.447 ± 0.185
0.745HisGly: 0.745 ± 0.183
0.075HisHis: 0.075 ± 0.068
1.267HisIle: 1.267 ± 0.329
0.671HisLys: 0.671 ± 0.204
1.267HisLeu: 1.267 ± 0.396
0.373HisMet: 0.373 ± 0.17
0.82HisAsn: 0.82 ± 0.269
0.745HisPro: 0.745 ± 0.301
0.149HisGln: 0.149 ± 0.109
0.522HisArg: 0.522 ± 0.212
0.745HisSer: 0.745 ± 0.217
0.522HisThr: 0.522 ± 0.235
1.565HisVal: 1.565 ± 0.428
0.075HisTrp: 0.075 ± 0.062
0.373HisTyr: 0.373 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
6.261IleAla: 6.261 ± 0.545
0.82IleCys: 0.82 ± 0.274
3.876IleAsp: 3.876 ± 0.63
4.323IleGlu: 4.323 ± 0.614
1.118IlePhe: 1.118 ± 0.402
4.174IleGly: 4.174 ± 0.616
0.596IleHis: 0.596 ± 0.19
1.938IleIle: 1.938 ± 0.362
2.907IleLys: 2.907 ± 0.437
1.938IleLeu: 1.938 ± 0.345
0.894IleMet: 0.894 ± 0.292
3.13IleAsn: 3.13 ± 0.628
2.609IlePro: 2.609 ± 0.399
1.267IleGln: 1.267 ± 0.336
2.981IleArg: 2.981 ± 0.409
2.758IleSer: 2.758 ± 0.418
2.46IleThr: 2.46 ± 0.415
3.652IleVal: 3.652 ± 0.492
0.745IleTrp: 0.745 ± 0.21
1.863IleTyr: 1.863 ± 0.428
0.0IleXaa: 0.0 ± 0.0
Lys
7.081LysAla: 7.081 ± 0.784
0.373LysCys: 0.373 ± 0.149
2.609LysAsp: 2.609 ± 0.686
3.354LysGlu: 3.354 ± 0.585
1.863LysPhe: 1.863 ± 0.41
3.279LysGly: 3.279 ± 0.48
0.596LysHis: 0.596 ± 0.22
2.534LysIle: 2.534 ± 0.404
4.323LysLys: 4.323 ± 0.816
6.112LysLeu: 6.112 ± 0.606
1.043LysMet: 1.043 ± 0.235
2.534LysAsn: 2.534 ± 0.499
2.758LysPro: 2.758 ± 0.633
1.342LysGln: 1.342 ± 0.317
2.236LysArg: 2.236 ± 0.462
3.205LysSer: 3.205 ± 0.47
3.279LysThr: 3.279 ± 0.457
3.652LysVal: 3.652 ± 0.517
0.969LysTrp: 0.969 ± 0.3
1.416LysTyr: 1.416 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
9.242LeuAla: 9.242 ± 1.231
0.596LeuCys: 0.596 ± 0.208
3.279LeuAsp: 3.279 ± 0.523
4.472LeuGlu: 4.472 ± 0.555
3.056LeuPhe: 3.056 ± 0.429
4.845LeuGly: 4.845 ± 0.648
1.043LeuHis: 1.043 ± 0.256
3.801LeuIle: 3.801 ± 0.507
4.174LeuLys: 4.174 ± 0.614
6.037LeuLeu: 6.037 ± 0.746
1.565LeuMet: 1.565 ± 0.423
4.025LeuAsn: 4.025 ± 0.541
4.025LeuPro: 4.025 ± 0.615
3.428LeuGln: 3.428 ± 0.739
5.217LeuArg: 5.217 ± 0.701
6.708LeuSer: 6.708 ± 0.559
4.174LeuThr: 4.174 ± 0.556
4.546LeuVal: 4.546 ± 0.608
1.118LeuTrp: 1.118 ± 0.303
3.428LeuTyr: 3.428 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.683MetAla: 2.683 ± 0.346
0.149MetCys: 0.149 ± 0.108
0.745MetAsp: 0.745 ± 0.24
0.149MetGlu: 0.149 ± 0.09
0.745MetPhe: 0.745 ± 0.233
1.789MetGly: 1.789 ± 0.363
0.447MetHis: 0.447 ± 0.212
0.82MetIle: 0.82 ± 0.237
1.416MetLys: 1.416 ± 0.415
2.012MetLeu: 2.012 ± 0.416
0.373MetMet: 0.373 ± 0.196
1.043MetAsn: 1.043 ± 0.306
1.789MetPro: 1.789 ± 0.348
1.267MetGln: 1.267 ± 0.317
1.118MetArg: 1.118 ± 0.237
1.714MetSer: 1.714 ± 0.408
1.267MetThr: 1.267 ± 0.265
0.82MetVal: 0.82 ± 0.21
0.075MetTrp: 0.075 ± 0.062
0.373MetTyr: 0.373 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
5.888AsnAla: 5.888 ± 0.868
0.522AsnCys: 0.522 ± 0.238
2.012AsnAsp: 2.012 ± 0.406
3.428AsnGlu: 3.428 ± 0.51
0.969AsnPhe: 0.969 ± 0.211
4.994AsnGly: 4.994 ± 0.654
0.447AsnHis: 0.447 ± 0.15
2.161AsnIle: 2.161 ± 0.381
2.832AsnLys: 2.832 ± 0.613
2.907AsnLeu: 2.907 ± 0.396
0.745AsnMet: 0.745 ± 0.207
2.311AsnAsn: 2.311 ± 0.54
3.13AsnPro: 3.13 ± 0.392
1.043AsnGln: 1.043 ± 0.277
2.758AsnArg: 2.758 ± 0.468
2.385AsnSer: 2.385 ± 0.476
2.832AsnThr: 2.832 ± 0.543
5.292AsnVal: 5.292 ± 0.668
0.82AsnTrp: 0.82 ± 0.294
1.043AsnTyr: 1.043 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
5.068ProAla: 5.068 ± 0.931
0.224ProCys: 0.224 ± 0.129
2.683ProAsp: 2.683 ± 0.415
3.727ProGlu: 3.727 ± 0.566
1.491ProPhe: 1.491 ± 0.416
3.652ProGly: 3.652 ± 0.743
1.043ProHis: 1.043 ± 0.271
2.087ProIle: 2.087 ± 0.362
2.534ProLys: 2.534 ± 0.648
3.205ProLeu: 3.205 ± 0.457
0.745ProMet: 0.745 ± 0.242
3.354ProAsn: 3.354 ± 0.514
2.236ProPro: 2.236 ± 0.458
2.46ProGln: 2.46 ± 0.752
1.714ProArg: 1.714 ± 0.487
3.13ProSer: 3.13 ± 0.603
4.174ProThr: 4.174 ± 0.605
3.205ProVal: 3.205 ± 0.55
0.894ProTrp: 0.894 ± 0.3
1.565ProTyr: 1.565 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
4.323GlnAla: 4.323 ± 0.57
0.522GlnCys: 0.522 ± 0.185
1.565GlnAsp: 1.565 ± 0.418
1.491GlnGlu: 1.491 ± 0.691
1.342GlnPhe: 1.342 ± 0.363
2.683GlnGly: 2.683 ± 0.723
0.447GlnHis: 0.447 ± 0.201
1.938GlnIle: 1.938 ± 0.455
1.64GlnLys: 1.64 ± 0.349
3.876GlnLeu: 3.876 ± 0.757
1.118GlnMet: 1.118 ± 0.302
2.311GlnAsn: 2.311 ± 0.571
2.012GlnPro: 2.012 ± 0.866
3.279GlnGln: 3.279 ± 1.129
2.832GlnArg: 2.832 ± 0.639
2.311GlnSer: 2.311 ± 0.371
2.236GlnThr: 2.236 ± 0.452
2.534GlnVal: 2.534 ± 0.385
0.447GlnTrp: 0.447 ± 0.183
1.416GlnTyr: 1.416 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
5.739ArgAla: 5.739 ± 0.95
0.298ArgCys: 0.298 ± 0.152
3.354ArgAsp: 3.354 ± 0.454
3.801ArgGlu: 3.801 ± 0.6
2.012ArgPhe: 2.012 ± 0.478
3.056ArgGly: 3.056 ± 0.451
0.596ArgHis: 0.596 ± 0.208
3.95ArgIle: 3.95 ± 0.491
2.832ArgLys: 2.832 ± 0.603
4.696ArgLeu: 4.696 ± 0.556
1.64ArgMet: 1.64 ± 0.327
2.012ArgAsn: 2.012 ± 0.497
2.236ArgPro: 2.236 ± 0.465
2.832ArgGln: 2.832 ± 0.34
3.578ArgArg: 3.578 ± 0.641
2.907ArgSer: 2.907 ± 0.553
2.683ArgThr: 2.683 ± 0.466
2.758ArgVal: 2.758 ± 0.43
0.745ArgTrp: 0.745 ± 0.203
1.789ArgTyr: 1.789 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
5.664SerAla: 5.664 ± 0.811
0.596SerCys: 0.596 ± 0.215
3.279SerAsp: 3.279 ± 0.434
3.279SerGlu: 3.279 ± 0.534
2.609SerPhe: 2.609 ± 0.405
5.143SerGly: 5.143 ± 1.037
0.522SerHis: 0.522 ± 0.257
2.832SerIle: 2.832 ± 0.42
4.025SerLys: 4.025 ± 0.67
4.248SerLeu: 4.248 ± 0.592
0.82SerMet: 0.82 ± 0.229
2.981SerAsn: 2.981 ± 0.43
2.907SerPro: 2.907 ± 0.491
3.354SerGln: 3.354 ± 0.807
2.46SerArg: 2.46 ± 0.375
2.758SerSer: 2.758 ± 0.476
3.205SerThr: 3.205 ± 0.443
4.025SerVal: 4.025 ± 0.515
0.82SerTrp: 0.82 ± 0.291
2.087SerTyr: 2.087 ± 0.516
0.0SerXaa: 0.0 ± 0.0
Thr
7.751ThrAla: 7.751 ± 1.448
0.373ThrCys: 0.373 ± 0.169
3.13ThrAsp: 3.13 ± 0.428
3.95ThrGlu: 3.95 ± 0.648
1.714ThrPhe: 1.714 ± 0.402
5.143ThrGly: 5.143 ± 0.618
0.298ThrHis: 0.298 ± 0.146
2.609ThrIle: 2.609 ± 0.455
3.205ThrLys: 3.205 ± 0.472
4.248ThrLeu: 4.248 ± 0.395
0.894ThrMet: 0.894 ± 0.224
2.311ThrAsn: 2.311 ± 0.404
3.876ThrPro: 3.876 ± 0.406
2.609ThrGln: 2.609 ± 0.531
2.087ThrArg: 2.087 ± 0.413
2.832ThrSer: 2.832 ± 0.632
3.056ThrThr: 3.056 ± 0.492
4.025ThrVal: 4.025 ± 0.914
0.82ThrTrp: 0.82 ± 0.271
2.012ThrTyr: 2.012 ± 0.402
0.0ThrXaa: 0.0 ± 0.0
Val
8.646ValAla: 8.646 ± 0.791
0.671ValCys: 0.671 ± 0.197
2.758ValAsp: 2.758 ± 0.602
4.621ValGlu: 4.621 ± 0.695
3.205ValPhe: 3.205 ± 0.504
5.143ValGly: 5.143 ± 0.547
0.82ValHis: 0.82 ± 0.222
3.205ValIle: 3.205 ± 0.572
3.205ValLys: 3.205 ± 0.367
4.621ValLeu: 4.621 ± 0.577
1.863ValMet: 1.863 ± 0.429
2.385ValAsn: 2.385 ± 0.403
3.876ValPro: 3.876 ± 0.59
2.758ValGln: 2.758 ± 0.529
3.428ValArg: 3.428 ± 0.603
4.845ValSer: 4.845 ± 0.701
4.472ValThr: 4.472 ± 0.533
4.77ValVal: 4.77 ± 0.586
1.193ValTrp: 1.193 ± 0.309
1.938ValTyr: 1.938 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
1.64TrpAla: 1.64 ± 0.35
0.298TrpCys: 0.298 ± 0.131
0.522TrpAsp: 0.522 ± 0.216
0.596TrpGlu: 0.596 ± 0.188
0.82TrpPhe: 0.82 ± 0.254
0.671TrpGly: 0.671 ± 0.281
0.373TrpHis: 0.373 ± 0.221
0.82TrpIle: 0.82 ± 0.213
0.447TrpLys: 0.447 ± 0.173
0.969TrpLeu: 0.969 ± 0.276
0.224TrpMet: 0.224 ± 0.112
0.894TrpAsn: 0.894 ± 0.228
1.193TrpPro: 1.193 ± 0.294
0.894TrpGln: 0.894 ± 0.243
1.565TrpArg: 1.565 ± 0.414
0.671TrpSer: 0.671 ± 0.27
0.522TrpThr: 0.522 ± 0.193
1.193TrpVal: 1.193 ± 0.309
0.224TrpTrp: 0.224 ± 0.149
0.969TrpTyr: 0.969 ± 0.248
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.907TyrAla: 2.907 ± 0.591
0.745TyrCys: 0.745 ± 0.258
2.311TyrAsp: 2.311 ± 0.502
2.236TyrGlu: 2.236 ± 0.423
1.416TyrPhe: 1.416 ± 0.394
3.13TyrGly: 3.13 ± 0.622
0.373TyrHis: 0.373 ± 0.155
1.714TyrIle: 1.714 ± 0.315
1.342TyrLys: 1.342 ± 0.413
2.609TyrLeu: 2.609 ± 0.447
0.522TyrMet: 0.522 ± 0.291
1.938TyrAsn: 1.938 ± 0.351
1.416TyrPro: 1.416 ± 0.326
1.118TyrGln: 1.118 ± 0.288
1.938TyrArg: 1.938 ± 0.358
2.311TyrSer: 2.311 ± 0.465
1.416TyrThr: 1.416 ± 0.274
2.534TyrVal: 2.534 ± 0.57
0.224TyrTrp: 0.224 ± 0.105
0.894TyrTyr: 0.894 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski