Amino acid dipepetide frequency for Ralstonia phage RPSC1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.121AlaAla: 15.121 ± 1.951
0.591AlaCys: 0.591 ± 0.177
7.518AlaAsp: 7.518 ± 0.733
6.082AlaGlu: 6.082 ± 0.502
4.308AlaPhe: 4.308 ± 0.592
8.785AlaGly: 8.785 ± 1.162
1.689AlaHis: 1.689 ± 0.404
5.66AlaIle: 5.66 ± 0.675
6.42AlaLys: 6.42 ± 0.903
9.968AlaLeu: 9.968 ± 1.331
3.379AlaMet: 3.379 ± 0.481
3.379AlaAsn: 3.379 ± 0.606
5.406AlaPro: 5.406 ± 0.83
4.477AlaGln: 4.477 ± 0.749
6.589AlaArg: 6.589 ± 0.806
5.829AlaSer: 5.829 ± 0.722
7.011AlaThr: 7.011 ± 0.801
6.927AlaVal: 6.927 ± 0.547
2.112AlaTrp: 2.112 ± 0.579
2.872AlaTyr: 2.872 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.676CysAla: 0.676 ± 0.249
0.0CysCys: 0.0 ± 0.0
0.422CysAsp: 0.422 ± 0.23
0.676CysGlu: 0.676 ± 0.21
0.76CysPhe: 0.76 ± 0.295
0.929CysGly: 0.929 ± 0.305
0.507CysHis: 0.507 ± 0.204
0.422CysIle: 0.422 ± 0.153
0.507CysLys: 0.507 ± 0.225
1.014CysLeu: 1.014 ± 0.356
0.676CysMet: 0.676 ± 0.221
0.253CysAsn: 0.253 ± 0.138
0.253CysPro: 0.253 ± 0.167
0.253CysGln: 0.253 ± 0.152
0.76CysArg: 0.76 ± 0.342
0.169CysSer: 0.169 ± 0.107
0.338CysThr: 0.338 ± 0.189
0.169CysVal: 0.169 ± 0.11
0.169CysTrp: 0.169 ± 0.121
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.18AspAla: 7.18 ± 0.77
0.591AspCys: 0.591 ± 0.261
3.97AspAsp: 3.97 ± 0.615
3.379AspGlu: 3.379 ± 0.551
2.534AspPhe: 2.534 ± 0.483
6.758AspGly: 6.758 ± 0.75
1.774AspHis: 1.774 ± 0.411
2.281AspIle: 2.281 ± 0.361
3.21AspLys: 3.21 ± 0.623
5.322AspLeu: 5.322 ± 0.788
1.689AspMet: 1.689 ± 0.352
2.281AspAsn: 2.281 ± 0.452
4.055AspPro: 4.055 ± 0.641
1.943AspGln: 1.943 ± 0.431
3.379AspArg: 3.379 ± 0.455
2.619AspSer: 2.619 ± 0.471
4.308AspThr: 4.308 ± 0.518
4.224AspVal: 4.224 ± 0.665
1.014AspTrp: 1.014 ± 0.283
2.281AspTyr: 2.281 ± 0.452
0.0AspXaa: 0.0 ± 0.0
Glu
7.265GluAla: 7.265 ± 0.851
0.845GluCys: 0.845 ± 0.246
3.379GluAsp: 3.379 ± 0.59
3.717GluGlu: 3.717 ± 0.591
2.619GluPhe: 2.619 ± 0.554
4.731GluGly: 4.731 ± 0.693
0.929GluHis: 0.929 ± 0.266
2.027GluIle: 2.027 ± 0.456
2.703GluLys: 2.703 ± 0.56
3.886GluLeu: 3.886 ± 0.485
2.027GluMet: 2.027 ± 0.306
1.521GluAsn: 1.521 ± 0.276
1.605GluPro: 1.605 ± 0.433
1.521GluGln: 1.521 ± 0.287
4.477GluArg: 4.477 ± 0.606
2.957GluSer: 2.957 ± 0.615
2.788GluThr: 2.788 ± 0.391
4.899GluVal: 4.899 ± 0.591
0.845GluTrp: 0.845 ± 0.327
1.352GluTyr: 1.352 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
2.872PheAla: 2.872 ± 0.311
0.422PheCys: 0.422 ± 0.216
2.788PheAsp: 2.788 ± 0.385
1.183PheGlu: 1.183 ± 0.276
1.436PhePhe: 1.436 ± 0.376
2.196PheGly: 2.196 ± 0.345
0.422PheHis: 0.422 ± 0.246
1.267PheIle: 1.267 ± 0.222
1.436PheLys: 1.436 ± 0.309
3.21PheLeu: 3.21 ± 0.556
1.436PheMet: 1.436 ± 0.307
1.943PheAsn: 1.943 ± 0.28
2.281PhePro: 2.281 ± 0.484
2.027PheGln: 2.027 ± 0.463
2.112PheArg: 2.112 ± 0.4
2.027PheSer: 2.027 ± 0.437
1.943PheThr: 1.943 ± 0.335
2.196PheVal: 2.196 ± 0.409
0.084PheTrp: 0.084 ± 0.089
1.267PheTyr: 1.267 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
8.025GlyAla: 8.025 ± 1.08
0.507GlyCys: 0.507 ± 0.249
5.322GlyAsp: 5.322 ± 0.679
5.068GlyGlu: 5.068 ± 0.612
2.112GlyPhe: 2.112 ± 0.526
6.336GlyGly: 6.336 ± 0.719
1.521GlyHis: 1.521 ± 0.375
4.646GlyIle: 4.646 ± 0.616
5.829GlyLys: 5.829 ± 1.209
6.758GlyLeu: 6.758 ± 0.638
2.112GlyMet: 2.112 ± 0.353
3.041GlyAsn: 3.041 ± 0.391
2.45GlyPro: 2.45 ± 0.332
3.632GlyGln: 3.632 ± 0.579
3.97GlyArg: 3.97 ± 0.566
5.153GlySer: 5.153 ± 0.863
4.984GlyThr: 4.984 ± 0.539
5.322GlyVal: 5.322 ± 0.787
2.196GlyTrp: 2.196 ± 0.488
2.872GlyTyr: 2.872 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
1.774HisAla: 1.774 ± 0.383
0.253HisCys: 0.253 ± 0.14
2.027HisAsp: 2.027 ± 0.421
1.014HisGlu: 1.014 ± 0.331
0.591HisPhe: 0.591 ± 0.257
1.605HisGly: 1.605 ± 0.408
0.253HisHis: 0.253 ± 0.149
1.098HisIle: 1.098 ± 0.354
1.098HisLys: 1.098 ± 0.278
2.281HisLeu: 2.281 ± 0.543
0.676HisMet: 0.676 ± 0.212
0.507HisAsn: 0.507 ± 0.217
1.014HisPro: 1.014 ± 0.345
0.253HisGln: 0.253 ± 0.195
1.352HisArg: 1.352 ± 0.325
1.352HisSer: 1.352 ± 0.325
1.605HisThr: 1.605 ± 0.509
1.183HisVal: 1.183 ± 0.3
0.338HisTrp: 0.338 ± 0.164
0.76HisTyr: 0.76 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
5.068IleAla: 5.068 ± 0.763
0.76IleCys: 0.76 ± 0.32
3.801IleAsp: 3.801 ± 0.687
2.196IleGlu: 2.196 ± 0.395
0.591IlePhe: 0.591 ± 0.306
2.703IleGly: 2.703 ± 0.387
1.183IleHis: 1.183 ± 0.225
2.112IleIle: 2.112 ± 0.483
2.872IleLys: 2.872 ± 0.604
3.041IleLeu: 3.041 ± 0.593
1.436IleMet: 1.436 ± 0.49
2.281IleAsn: 2.281 ± 0.434
2.45IlePro: 2.45 ± 0.608
1.943IleGln: 1.943 ± 0.562
2.957IleArg: 2.957 ± 0.396
1.521IleSer: 1.521 ± 0.304
3.126IleThr: 3.126 ± 0.388
3.801IleVal: 3.801 ± 0.502
0.929IleTrp: 0.929 ± 0.223
0.76IleTyr: 0.76 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
6.758LysAla: 6.758 ± 0.839
0.422LysCys: 0.422 ± 0.243
3.379LysAsp: 3.379 ± 0.594
2.619LysGlu: 2.619 ± 0.482
2.365LysPhe: 2.365 ± 0.374
3.97LysGly: 3.97 ± 0.594
1.352LysHis: 1.352 ± 0.339
1.436LysIle: 1.436 ± 0.293
2.619LysLys: 2.619 ± 0.615
5.322LysLeu: 5.322 ± 0.791
1.943LysMet: 1.943 ± 0.352
1.858LysAsn: 1.858 ± 0.53
2.872LysPro: 2.872 ± 0.458
2.196LysGln: 2.196 ± 0.467
3.21LysArg: 3.21 ± 0.612
3.21LysSer: 3.21 ± 0.473
2.703LysThr: 2.703 ± 0.535
4.308LysVal: 4.308 ± 0.637
0.76LysTrp: 0.76 ± 0.24
1.943LysTyr: 1.943 ± 0.461
0.0LysXaa: 0.0 ± 0.0
Leu
10.559LeuAla: 10.559 ± 1.128
0.169LeuCys: 0.169 ± 0.117
4.393LeuAsp: 4.393 ± 0.655
5.237LeuGlu: 5.237 ± 0.953
1.689LeuPhe: 1.689 ± 0.292
6.336LeuGly: 6.336 ± 1.018
1.521LeuHis: 1.521 ± 0.305
4.055LeuIle: 4.055 ± 0.522
5.322LeuLys: 5.322 ± 0.702
5.068LeuLeu: 5.068 ± 0.715
2.112LeuMet: 2.112 ± 0.427
2.703LeuAsn: 2.703 ± 0.479
3.548LeuPro: 3.548 ± 0.571
3.632LeuGln: 3.632 ± 0.555
4.984LeuArg: 4.984 ± 0.499
4.815LeuSer: 4.815 ± 0.643
4.477LeuThr: 4.477 ± 0.415
5.491LeuVal: 5.491 ± 0.591
1.605LeuTrp: 1.605 ± 0.381
2.365LeuTyr: 2.365 ± 0.478
0.0LeuXaa: 0.0 ± 0.0
Met
4.815MetAla: 4.815 ± 0.637
0.338MetCys: 0.338 ± 0.197
2.45MetAsp: 2.45 ± 0.381
1.183MetGlu: 1.183 ± 0.281
0.929MetPhe: 0.929 ± 0.328
3.126MetGly: 3.126 ± 0.452
0.253MetHis: 0.253 ± 0.135
1.098MetIle: 1.098 ± 0.287
1.183MetLys: 1.183 ± 0.308
1.689MetLeu: 1.689 ± 0.446
0.845MetMet: 0.845 ± 0.29
0.929MetAsn: 0.929 ± 0.256
1.858MetPro: 1.858 ± 0.392
0.676MetGln: 0.676 ± 0.255
1.689MetArg: 1.689 ± 0.352
1.774MetSer: 1.774 ± 0.403
2.112MetThr: 2.112 ± 0.392
2.45MetVal: 2.45 ± 0.584
0.0MetTrp: 0.0 ± 0.0
0.422MetTyr: 0.422 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
4.731AsnAla: 4.731 ± 0.646
0.0AsnCys: 0.0 ± 0.0
1.521AsnAsp: 1.521 ± 0.391
1.943AsnGlu: 1.943 ± 0.467
0.929AsnPhe: 0.929 ± 0.29
2.534AsnGly: 2.534 ± 0.506
0.845AsnHis: 0.845 ± 0.263
2.45AsnIle: 2.45 ± 0.418
1.943AsnLys: 1.943 ± 0.355
2.534AsnLeu: 2.534 ± 0.382
0.929AsnMet: 0.929 ± 0.235
0.929AsnAsn: 0.929 ± 0.247
2.027AsnPro: 2.027 ± 0.506
0.676AsnGln: 0.676 ± 0.222
2.027AsnArg: 2.027 ± 0.447
1.521AsnSer: 1.521 ± 0.323
2.196AsnThr: 2.196 ± 0.409
2.957AsnVal: 2.957 ± 0.385
0.676AsnTrp: 0.676 ± 0.229
1.605AsnTyr: 1.605 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
4.646ProAla: 4.646 ± 0.498
0.591ProCys: 0.591 ± 0.247
3.21ProAsp: 3.21 ± 0.549
3.717ProGlu: 3.717 ± 0.655
1.858ProPhe: 1.858 ± 0.521
3.548ProGly: 3.548 ± 0.514
1.183ProHis: 1.183 ± 0.33
1.858ProIle: 1.858 ± 0.477
1.943ProLys: 1.943 ± 0.405
2.788ProLeu: 2.788 ± 0.529
1.521ProMet: 1.521 ± 0.362
1.943ProAsn: 1.943 ± 0.469
2.45ProPro: 2.45 ± 0.462
1.774ProGln: 1.774 ± 0.393
1.774ProArg: 1.774 ± 0.479
2.534ProSer: 2.534 ± 0.508
3.126ProThr: 3.126 ± 0.462
3.463ProVal: 3.463 ± 0.384
0.676ProTrp: 0.676 ± 0.272
1.436ProTyr: 1.436 ± 0.407
0.0ProXaa: 0.0 ± 0.0
Gln
5.322GlnAla: 5.322 ± 0.714
0.338GlnCys: 0.338 ± 0.157
2.027GlnAsp: 2.027 ± 0.39
2.281GlnGlu: 2.281 ± 0.548
1.521GlnPhe: 1.521 ± 0.467
3.548GlnGly: 3.548 ± 0.417
1.183GlnHis: 1.183 ± 0.279
1.689GlnIle: 1.689 ± 0.383
1.267GlnLys: 1.267 ± 0.34
2.788GlnLeu: 2.788 ± 0.524
1.183GlnMet: 1.183 ± 0.277
1.098GlnAsn: 1.098 ± 0.27
0.929GlnPro: 0.929 ± 0.262
2.196GlnGln: 2.196 ± 0.34
2.365GlnArg: 2.365 ± 0.416
2.45GlnSer: 2.45 ± 0.484
1.521GlnThr: 1.521 ± 0.371
3.041GlnVal: 3.041 ± 0.772
0.338GlnTrp: 0.338 ± 0.163
1.098GlnTyr: 1.098 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
5.744ArgAla: 5.744 ± 0.641
1.014ArgCys: 1.014 ± 0.36
3.97ArgAsp: 3.97 ± 0.589
2.534ArgGlu: 2.534 ± 0.412
2.365ArgPhe: 2.365 ± 0.405
4.731ArgGly: 4.731 ± 0.537
1.521ArgHis: 1.521 ± 0.483
2.872ArgIle: 2.872 ± 0.6
3.97ArgLys: 3.97 ± 0.635
5.322ArgLeu: 5.322 ± 0.658
1.858ArgMet: 1.858 ± 0.398
2.365ArgAsn: 2.365 ± 0.382
2.365ArgPro: 2.365 ± 0.41
1.774ArgGln: 1.774 ± 0.31
3.463ArgArg: 3.463 ± 0.431
2.957ArgSer: 2.957 ± 0.382
3.294ArgThr: 3.294 ± 0.417
3.717ArgVal: 3.717 ± 0.514
0.591ArgTrp: 0.591 ± 0.219
1.605ArgTyr: 1.605 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
5.491SerAla: 5.491 ± 0.622
0.422SerCys: 0.422 ± 0.19
4.477SerAsp: 4.477 ± 0.543
2.703SerGlu: 2.703 ± 0.479
2.45SerPhe: 2.45 ± 0.461
5.237SerGly: 5.237 ± 0.633
1.183SerHis: 1.183 ± 0.313
2.619SerIle: 2.619 ± 0.435
2.872SerLys: 2.872 ± 0.462
4.055SerLeu: 4.055 ± 0.676
1.267SerMet: 1.267 ± 0.322
2.027SerAsn: 2.027 ± 0.625
2.788SerPro: 2.788 ± 0.569
1.689SerGln: 1.689 ± 0.281
2.703SerArg: 2.703 ± 0.38
3.717SerSer: 3.717 ± 0.852
3.294SerThr: 3.294 ± 0.493
3.379SerVal: 3.379 ± 0.524
0.76SerTrp: 0.76 ± 0.3
1.689SerTyr: 1.689 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
5.66ThrAla: 5.66 ± 0.795
0.676ThrCys: 0.676 ± 0.232
3.463ThrAsp: 3.463 ± 0.466
4.139ThrGlu: 4.139 ± 0.52
2.365ThrPhe: 2.365 ± 0.502
5.153ThrGly: 5.153 ± 0.611
1.014ThrHis: 1.014 ± 0.33
2.872ThrIle: 2.872 ± 0.511
3.97ThrLys: 3.97 ± 0.582
5.406ThrLeu: 5.406 ± 0.885
1.267ThrMet: 1.267 ± 0.275
1.521ThrAsn: 1.521 ± 0.333
3.632ThrPro: 3.632 ± 0.489
2.112ThrGln: 2.112 ± 0.47
2.957ThrArg: 2.957 ± 0.483
3.294ThrSer: 3.294 ± 0.547
3.717ThrThr: 3.717 ± 0.656
4.393ThrVal: 4.393 ± 0.552
0.76ThrTrp: 0.76 ± 0.195
2.027ThrTyr: 2.027 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
8.025ValAla: 8.025 ± 0.749
0.507ValCys: 0.507 ± 0.186
3.379ValAsp: 3.379 ± 0.543
4.477ValGlu: 4.477 ± 0.748
2.027ValPhe: 2.027 ± 0.346
4.984ValGly: 4.984 ± 0.572
1.689ValHis: 1.689 ± 0.474
3.463ValIle: 3.463 ± 0.558
3.21ValLys: 3.21 ± 0.505
5.744ValLeu: 5.744 ± 0.515
1.943ValMet: 1.943 ± 0.499
3.041ValAsn: 3.041 ± 0.474
2.365ValPro: 2.365 ± 0.379
3.379ValGln: 3.379 ± 0.444
4.393ValArg: 4.393 ± 0.45
4.308ValSer: 4.308 ± 0.689
4.984ValThr: 4.984 ± 0.839
4.055ValVal: 4.055 ± 0.683
0.929ValTrp: 0.929 ± 0.3
2.619ValTyr: 2.619 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
1.267TrpAla: 1.267 ± 0.312
0.169TrpCys: 0.169 ± 0.122
1.352TrpAsp: 1.352 ± 0.321
0.507TrpGlu: 0.507 ± 0.259
0.507TrpPhe: 0.507 ± 0.248
1.605TrpGly: 1.605 ± 0.466
0.507TrpHis: 0.507 ± 0.213
0.76TrpIle: 0.76 ± 0.192
1.014TrpLys: 1.014 ± 0.342
1.183TrpLeu: 1.183 ± 0.336
0.591TrpMet: 0.591 ± 0.286
0.507TrpAsn: 0.507 ± 0.161
0.676TrpPro: 0.676 ± 0.219
0.591TrpGln: 0.591 ± 0.159
0.76TrpArg: 0.76 ± 0.256
0.76TrpSer: 0.76 ± 0.299
0.845TrpThr: 0.845 ± 0.235
1.689TrpVal: 1.689 ± 0.463
0.507TrpTrp: 0.507 ± 0.189
0.338TrpTyr: 0.338 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.126TyrAla: 3.126 ± 0.484
0.338TyrCys: 0.338 ± 0.133
1.943TyrAsp: 1.943 ± 0.424
1.436TyrGlu: 1.436 ± 0.464
0.591TyrPhe: 0.591 ± 0.269
3.041TyrGly: 3.041 ± 0.514
0.507TyrHis: 0.507 ± 0.259
0.845TyrIle: 0.845 ± 0.253
1.858TyrLys: 1.858 ± 0.385
2.703TyrLeu: 2.703 ± 0.586
0.929TyrMet: 0.929 ± 0.284
0.845TyrAsn: 0.845 ± 0.252
1.014TyrPro: 1.014 ± 0.307
1.436TyrGln: 1.436 ± 0.323
2.112TyrArg: 2.112 ± 0.409
1.858TyrSer: 1.858 ± 0.407
2.027TyrThr: 2.027 ± 0.503
1.858TyrVal: 1.858 ± 0.428
0.845TyrTrp: 0.845 ± 0.233
0.507TyrTyr: 0.507 ± 0.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11839 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski