Amino acid dipepetide frequency for Raoultella phage RP180

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.676AlaAla: 10.676 ± 1.436
1.252AlaCys: 1.252 ± 0.329
5.375AlaAsp: 5.375 ± 0.605
7.215AlaGlu: 7.215 ± 0.847
3.755AlaPhe: 3.755 ± 0.631
7.068AlaGly: 7.068 ± 0.875
1.693AlaHis: 1.693 ± 0.316
5.007AlaIle: 5.007 ± 0.792
6.258AlaLys: 6.258 ± 0.786
8.099AlaLeu: 8.099 ± 0.772
1.988AlaMet: 1.988 ± 0.331
3.608AlaAsn: 3.608 ± 0.581
4.27AlaPro: 4.27 ± 0.649
3.166AlaGln: 3.166 ± 0.664
4.491AlaArg: 4.491 ± 0.523
5.964AlaSer: 5.964 ± 0.755
5.596AlaThr: 5.596 ± 0.762
6.7AlaVal: 6.7 ± 0.753
1.546AlaTrp: 1.546 ± 0.245
3.092AlaTyr: 3.092 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
1.104CysAla: 1.104 ± 0.3
0.074CysCys: 0.074 ± 0.075
0.957CysAsp: 0.957 ± 0.252
1.104CysGlu: 1.104 ± 0.419
0.295CysPhe: 0.295 ± 0.128
1.473CysGly: 1.473 ± 0.348
0.0CysHis: 0.0 ± 0.0
0.515CysIle: 0.515 ± 0.174
0.884CysLys: 0.884 ± 0.271
0.736CysLeu: 0.736 ± 0.272
0.147CysMet: 0.147 ± 0.082
0.442CysAsn: 0.442 ± 0.185
0.295CysPro: 0.295 ± 0.154
0.221CysGln: 0.221 ± 0.127
0.515CysArg: 0.515 ± 0.203
0.368CysSer: 0.368 ± 0.16
0.81CysThr: 0.81 ± 0.289
1.104CysVal: 1.104 ± 0.291
0.221CysTrp: 0.221 ± 0.11
0.515CysTyr: 0.515 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
6.921AspAla: 6.921 ± 0.805
0.663AspCys: 0.663 ± 0.225
4.491AspAsp: 4.491 ± 0.627
4.565AspGlu: 4.565 ± 0.684
2.798AspPhe: 2.798 ± 0.443
5.448AspGly: 5.448 ± 0.583
1.031AspHis: 1.031 ± 0.267
3.387AspIle: 3.387 ± 0.467
3.387AspLys: 3.387 ± 0.478
4.344AspLeu: 4.344 ± 0.515
1.62AspMet: 1.62 ± 0.329
3.092AspAsn: 3.092 ± 0.494
1.546AspPro: 1.546 ± 0.306
0.884AspGln: 0.884 ± 0.215
2.503AspArg: 2.503 ± 0.508
3.387AspSer: 3.387 ± 0.42
3.976AspThr: 3.976 ± 0.484
3.829AspVal: 3.829 ± 0.543
1.104AspTrp: 1.104 ± 0.263
2.135AspTyr: 2.135 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
5.743GluAla: 5.743 ± 0.826
0.515GluCys: 0.515 ± 0.257
4.049GluAsp: 4.049 ± 0.614
5.08GluGlu: 5.08 ± 0.839
2.871GluPhe: 2.871 ± 0.504
3.681GluGly: 3.681 ± 0.537
1.104GluHis: 1.104 ± 0.324
4.123GluIle: 4.123 ± 0.583
4.197GluLys: 4.197 ± 0.712
5.743GluLeu: 5.743 ± 0.815
2.503GluMet: 2.503 ± 0.437
2.43GluAsn: 2.43 ± 0.477
2.356GluPro: 2.356 ± 0.468
3.313GluGln: 3.313 ± 0.74
4.197GluArg: 4.197 ± 0.596
3.313GluSer: 3.313 ± 0.511
3.829GluThr: 3.829 ± 0.454
4.786GluVal: 4.786 ± 0.597
1.104GluTrp: 1.104 ± 0.302
2.724GluTyr: 2.724 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.503PheAla: 2.503 ± 0.442
0.515PheCys: 0.515 ± 0.206
4.197PheAsp: 4.197 ± 0.575
2.945PheGlu: 2.945 ± 0.549
0.515PhePhe: 0.515 ± 0.182
3.166PheGly: 3.166 ± 0.422
0.81PheHis: 0.81 ± 0.218
2.871PheIle: 2.871 ± 0.45
1.767PheLys: 1.767 ± 0.384
1.841PheLeu: 1.841 ± 0.316
0.368PheMet: 0.368 ± 0.135
1.767PheAsn: 1.767 ± 0.355
1.178PhePro: 1.178 ± 0.357
1.104PheGln: 1.104 ± 0.254
2.062PheArg: 2.062 ± 0.327
3.166PheSer: 3.166 ± 0.624
3.313PheThr: 3.313 ± 0.509
2.651PheVal: 2.651 ± 0.53
0.515PheTrp: 0.515 ± 0.221
1.473PheTyr: 1.473 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
6.921GlyAla: 6.921 ± 0.73
1.031GlyCys: 1.031 ± 0.274
4.565GlyAsp: 4.565 ± 0.524
5.08GlyGlu: 5.08 ± 0.569
3.608GlyPhe: 3.608 ± 0.63
6.626GlyGly: 6.626 ± 0.853
1.546GlyHis: 1.546 ± 0.277
2.724GlyIle: 2.724 ± 0.462
5.007GlyLys: 5.007 ± 0.661
5.89GlyLeu: 5.89 ± 0.737
2.503GlyMet: 2.503 ± 0.382
3.755GlyAsn: 3.755 ± 0.656
1.693GlyPro: 1.693 ± 0.315
2.577GlyGln: 2.577 ± 0.513
4.638GlyArg: 4.638 ± 0.566
4.565GlySer: 4.565 ± 0.642
4.049GlyThr: 4.049 ± 0.705
5.154GlyVal: 5.154 ± 0.615
0.736GlyTrp: 0.736 ± 0.188
2.945GlyTyr: 2.945 ± 0.505
0.0GlyXaa: 0.0 ± 0.0
His
1.325HisAla: 1.325 ± 0.317
0.442HisCys: 0.442 ± 0.167
1.104HisAsp: 1.104 ± 0.252
1.252HisGlu: 1.252 ± 0.347
0.663HisPhe: 0.663 ± 0.207
1.325HisGly: 1.325 ± 0.279
0.663HisHis: 0.663 ± 0.205
0.957HisIle: 0.957 ± 0.266
1.178HisLys: 1.178 ± 0.313
1.399HisLeu: 1.399 ± 0.307
0.442HisMet: 0.442 ± 0.159
0.957HisAsn: 0.957 ± 0.229
1.252HisPro: 1.252 ± 0.321
0.81HisGln: 0.81 ± 0.204
1.252HisArg: 1.252 ± 0.309
0.515HisSer: 0.515 ± 0.214
0.884HisThr: 0.884 ± 0.319
1.178HisVal: 1.178 ± 0.347
0.074HisTrp: 0.074 ± 0.081
0.663HisTyr: 0.663 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
5.08IleAla: 5.08 ± 0.726
0.663IleCys: 0.663 ± 0.205
3.902IleAsp: 3.902 ± 0.507
3.681IleGlu: 3.681 ± 0.455
1.325IlePhe: 1.325 ± 0.272
2.945IleGly: 2.945 ± 0.463
0.589IleHis: 0.589 ± 0.192
2.871IleIle: 2.871 ± 0.43
3.387IleLys: 3.387 ± 0.459
3.902IleLeu: 3.902 ± 0.538
1.104IleMet: 1.104 ± 0.251
2.577IleAsn: 2.577 ± 0.425
2.577IlePro: 2.577 ± 0.443
2.209IleGln: 2.209 ± 0.404
2.577IleArg: 2.577 ± 0.327
3.829IleSer: 3.829 ± 0.653
4.197IleThr: 4.197 ± 0.596
3.46IleVal: 3.46 ± 0.461
1.104IleTrp: 1.104 ± 0.316
1.399IleTyr: 1.399 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
6.332LysAla: 6.332 ± 0.821
0.442LysCys: 0.442 ± 0.214
3.019LysAsp: 3.019 ± 0.556
3.902LysGlu: 3.902 ± 0.794
2.209LysPhe: 2.209 ± 0.323
3.46LysGly: 3.46 ± 0.453
1.178LysHis: 1.178 ± 0.298
2.282LysIle: 2.282 ± 0.46
3.24LysLys: 3.24 ± 0.478
5.08LysLeu: 5.08 ± 0.61
1.914LysMet: 1.914 ± 0.525
3.092LysAsn: 3.092 ± 0.496
2.798LysPro: 2.798 ± 0.552
2.356LysGln: 2.356 ± 0.491
3.976LysArg: 3.976 ± 0.613
3.534LysSer: 3.534 ± 0.617
4.049LysThr: 4.049 ± 0.336
3.092LysVal: 3.092 ± 0.532
0.515LysTrp: 0.515 ± 0.173
2.135LysTyr: 2.135 ± 0.345
0.0LysXaa: 0.0 ± 0.0
Leu
7.436LeuAla: 7.436 ± 0.701
0.884LeuCys: 0.884 ± 0.302
4.049LeuAsp: 4.049 ± 0.575
4.786LeuGlu: 4.786 ± 0.803
2.577LeuPhe: 2.577 ± 0.557
5.007LeuGly: 5.007 ± 0.526
1.473LeuHis: 1.473 ± 0.354
4.859LeuIle: 4.859 ± 0.455
3.755LeuLys: 3.755 ± 0.732
6.185LeuLeu: 6.185 ± 0.825
1.767LeuMet: 1.767 ± 0.29
4.344LeuAsn: 4.344 ± 0.723
4.197LeuPro: 4.197 ± 0.567
2.724LeuGln: 2.724 ± 0.404
5.743LeuArg: 5.743 ± 0.768
4.123LeuSer: 4.123 ± 0.551
5.448LeuThr: 5.448 ± 0.815
4.638LeuVal: 4.638 ± 0.658
1.031LeuTrp: 1.031 ± 0.313
2.356LeuTyr: 2.356 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
2.724MetAla: 2.724 ± 0.445
0.295MetCys: 0.295 ± 0.13
0.957MetAsp: 0.957 ± 0.325
1.252MetGlu: 1.252 ± 0.364
0.884MetPhe: 0.884 ± 0.252
1.914MetGly: 1.914 ± 0.39
0.663MetHis: 0.663 ± 0.218
1.473MetIle: 1.473 ± 0.384
1.62MetLys: 1.62 ± 0.339
1.767MetLeu: 1.767 ± 0.329
0.589MetMet: 0.589 ± 0.207
1.104MetAsn: 1.104 ± 0.315
1.325MetPro: 1.325 ± 0.255
0.515MetGln: 0.515 ± 0.187
1.252MetArg: 1.252 ± 0.304
1.914MetSer: 1.914 ± 0.303
1.693MetThr: 1.693 ± 0.329
1.473MetVal: 1.473 ± 0.282
0.368MetTrp: 0.368 ± 0.149
0.368MetTyr: 0.368 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.27AsnAla: 4.27 ± 0.578
0.81AsnCys: 0.81 ± 0.272
2.798AsnAsp: 2.798 ± 0.415
2.724AsnGlu: 2.724 ± 0.426
1.178AsnPhe: 1.178 ± 0.25
4.418AsnGly: 4.418 ± 0.642
0.736AsnHis: 0.736 ± 0.226
2.577AsnIle: 2.577 ± 0.385
2.135AsnLys: 2.135 ± 0.423
3.46AsnLeu: 3.46 ± 0.533
0.736AsnMet: 0.736 ± 0.244
2.798AsnAsn: 2.798 ± 0.562
2.43AsnPro: 2.43 ± 0.438
1.104AsnGln: 1.104 ± 0.281
2.356AsnArg: 2.356 ± 0.409
2.798AsnSer: 2.798 ± 0.37
2.577AsnThr: 2.577 ± 0.498
3.534AsnVal: 3.534 ± 0.523
0.663AsnTrp: 0.663 ± 0.277
1.325AsnTyr: 1.325 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
2.945ProAla: 2.945 ± 0.533
0.736ProCys: 0.736 ± 0.22
3.166ProAsp: 3.166 ± 0.481
3.608ProGlu: 3.608 ± 0.526
1.767ProPhe: 1.767 ± 0.365
3.166ProGly: 3.166 ± 0.558
0.736ProHis: 0.736 ± 0.224
2.577ProIle: 2.577 ± 0.425
1.841ProLys: 1.841 ± 0.336
3.166ProLeu: 3.166 ± 0.531
0.957ProMet: 0.957 ± 0.348
1.473ProAsn: 1.473 ± 0.314
0.884ProPro: 0.884 ± 0.245
1.325ProGln: 1.325 ± 0.364
1.693ProArg: 1.693 ± 0.346
2.577ProSer: 2.577 ± 0.48
1.841ProThr: 1.841 ± 0.382
3.902ProVal: 3.902 ± 0.357
0.221ProTrp: 0.221 ± 0.124
1.473ProTyr: 1.473 ± 0.398
0.0ProXaa: 0.0 ± 0.0
Gln
3.829GlnAla: 3.829 ± 0.658
0.442GlnCys: 0.442 ± 0.201
1.693GlnAsp: 1.693 ± 0.427
2.724GlnGlu: 2.724 ± 0.443
1.399GlnPhe: 1.399 ± 0.299
2.062GlnGly: 2.062 ± 0.312
1.031GlnHis: 1.031 ± 0.263
1.693GlnIle: 1.693 ± 0.353
2.503GlnLys: 2.503 ± 0.481
3.019GlnLeu: 3.019 ± 0.46
0.736GlnMet: 0.736 ± 0.231
1.546GlnAsn: 1.546 ± 0.446
1.473GlnPro: 1.473 ± 0.269
2.282GlnGln: 2.282 ± 0.671
1.988GlnArg: 1.988 ± 0.364
1.988GlnSer: 1.988 ± 0.399
1.841GlnThr: 1.841 ± 0.388
2.282GlnVal: 2.282 ± 0.36
0.736GlnTrp: 0.736 ± 0.266
1.252GlnTyr: 1.252 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
4.344ArgAla: 4.344 ± 0.502
0.442ArgCys: 0.442 ± 0.183
2.871ArgAsp: 2.871 ± 0.388
3.166ArgGlu: 3.166 ± 0.51
1.841ArgPhe: 1.841 ± 0.428
4.049ArgGly: 4.049 ± 0.577
0.81ArgHis: 0.81 ± 0.209
2.945ArgIle: 2.945 ± 0.529
4.049ArgLys: 4.049 ± 0.547
4.049ArgLeu: 4.049 ± 0.381
1.841ArgMet: 1.841 ± 0.322
2.577ArgAsn: 2.577 ± 0.485
1.546ArgPro: 1.546 ± 0.382
3.46ArgGln: 3.46 ± 0.38
5.228ArgArg: 5.228 ± 0.737
3.24ArgSer: 3.24 ± 0.426
3.092ArgThr: 3.092 ± 0.43
4.344ArgVal: 4.344 ± 0.587
0.736ArgTrp: 0.736 ± 0.202
1.767ArgTyr: 1.767 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
5.964SerAla: 5.964 ± 0.852
0.295SerCys: 0.295 ± 0.14
3.534SerAsp: 3.534 ± 0.385
3.24SerGlu: 3.24 ± 0.474
3.092SerPhe: 3.092 ± 0.416
6.258SerGly: 6.258 ± 0.652
1.031SerHis: 1.031 ± 0.252
2.798SerIle: 2.798 ± 0.574
3.24SerLys: 3.24 ± 0.468
5.007SerLeu: 5.007 ± 0.683
1.325SerMet: 1.325 ± 0.308
3.24SerAsn: 3.24 ± 0.566
2.577SerPro: 2.577 ± 0.52
1.693SerGln: 1.693 ± 0.439
2.651SerArg: 2.651 ± 0.46
3.534SerSer: 3.534 ± 0.643
4.123SerThr: 4.123 ± 0.729
4.491SerVal: 4.491 ± 0.508
0.442SerTrp: 0.442 ± 0.162
1.62SerTyr: 1.62 ± 0.31
0.0SerXaa: 0.0 ± 0.0
Thr
6.995ThrAla: 6.995 ± 0.925
0.663ThrCys: 0.663 ± 0.328
3.681ThrAsp: 3.681 ± 0.493
3.387ThrGlu: 3.387 ± 0.396
3.092ThrPhe: 3.092 ± 0.5
6.332ThrGly: 6.332 ± 0.896
1.104ThrHis: 1.104 ± 0.331
3.24ThrIle: 3.24 ± 0.409
3.608ThrLys: 3.608 ± 0.408
4.565ThrLeu: 4.565 ± 0.529
1.252ThrMet: 1.252 ± 0.289
1.914ThrAsn: 1.914 ± 0.353
4.123ThrPro: 4.123 ± 0.515
1.62ThrGln: 1.62 ± 0.291
2.724ThrArg: 2.724 ± 0.463
3.24ThrSer: 3.24 ± 0.429
3.313ThrThr: 3.313 ± 0.632
4.491ThrVal: 4.491 ± 0.653
0.663ThrTrp: 0.663 ± 0.245
2.503ThrTyr: 2.503 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
7.068ValAla: 7.068 ± 0.665
0.957ValCys: 0.957 ± 0.255
3.608ValAsp: 3.608 ± 0.43
5.08ValGlu: 5.08 ± 0.708
2.356ValPhe: 2.356 ± 0.451
3.681ValGly: 3.681 ± 0.493
0.884ValHis: 0.884 ± 0.225
3.976ValIle: 3.976 ± 0.43
3.608ValLys: 3.608 ± 0.497
4.933ValLeu: 4.933 ± 0.6
1.252ValMet: 1.252 ± 0.462
2.798ValAsn: 2.798 ± 0.597
2.282ValPro: 2.282 ± 0.368
2.651ValGln: 2.651 ± 0.46
3.755ValArg: 3.755 ± 0.456
5.08ValSer: 5.08 ± 0.56
5.89ValThr: 5.89 ± 0.709
4.786ValVal: 4.786 ± 0.64
1.178ValTrp: 1.178 ± 0.279
2.356ValTyr: 2.356 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
1.399TrpAla: 1.399 ± 0.457
0.221TrpCys: 0.221 ± 0.129
0.589TrpAsp: 0.589 ± 0.224
0.663TrpGlu: 0.663 ± 0.238
0.736TrpPhe: 0.736 ± 0.263
1.031TrpGly: 1.031 ± 0.244
0.442TrpHis: 0.442 ± 0.18
0.442TrpIle: 0.442 ± 0.147
0.663TrpLys: 0.663 ± 0.215
1.62TrpLeu: 1.62 ± 0.267
0.515TrpMet: 0.515 ± 0.169
0.515TrpAsn: 0.515 ± 0.201
0.147TrpPro: 0.147 ± 0.134
0.957TrpGln: 0.957 ± 0.245
0.884TrpArg: 0.884 ± 0.311
0.515TrpSer: 0.515 ± 0.2
0.442TrpThr: 0.442 ± 0.18
1.031TrpVal: 1.031 ± 0.284
0.295TrpTrp: 0.295 ± 0.144
0.442TrpTyr: 0.442 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.24TyrAla: 3.24 ± 0.466
0.442TyrCys: 0.442 ± 0.181
2.503TyrAsp: 2.503 ± 0.546
2.209TyrGlu: 2.209 ± 0.404
1.767TyrPhe: 1.767 ± 0.383
2.577TyrGly: 2.577 ± 0.377
0.81TyrHis: 0.81 ± 0.217
2.062TyrIle: 2.062 ± 0.447
2.282TyrLys: 2.282 ± 0.371
2.577TyrLeu: 2.577 ± 0.387
0.515TyrMet: 0.515 ± 0.184
1.399TyrAsn: 1.399 ± 0.261
1.104TyrPro: 1.104 ± 0.281
1.546TyrGln: 1.546 ± 0.327
1.914TyrArg: 1.914 ± 0.409
2.503TyrSer: 2.503 ± 0.467
1.546TyrThr: 1.546 ± 0.399
1.252TyrVal: 1.252 ± 0.348
0.295TyrTrp: 0.295 ± 0.162
1.104TyrTyr: 1.104 ± 0.282
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13583 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski