Amino acid dipepetide frequency for Clostridium phage susfortuna

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.016AlaCys: 1.016 ± 0.29
2.878AlaAsp: 2.878 ± 0.586
2.37AlaGlu: 2.37 ± 0.664
1.524AlaPhe: 1.524 ± 0.453
5.418AlaGly: 5.418 ± 3.477
1.185AlaHis: 1.185 ± 0.473
2.878AlaIle: 2.878 ± 0.689
3.386AlaLys: 3.386 ± 0.563
2.709AlaLeu: 2.709 ± 0.481
0.847AlaMet: 0.847 ± 0.374
2.201AlaAsn: 2.201 ± 0.646
1.524AlaPro: 1.524 ± 0.453
1.524AlaGln: 1.524 ± 0.783
0.677AlaArg: 0.677 ± 0.26
1.355AlaSer: 1.355 ± 0.405
1.524AlaThr: 1.524 ± 0.491
1.693AlaVal: 1.693 ± 0.536
0.0AlaTrp: 0.0 ± 0.0
2.37AlaTyr: 2.37 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.508CysCys: 0.508 ± 0.215
0.677CysAsp: 0.677 ± 0.275
1.185CysGlu: 1.185 ± 0.403
1.524CysPhe: 1.524 ± 0.653
1.355CysGly: 1.355 ± 0.532
0.339CysHis: 0.339 ± 0.232
0.508CysIle: 0.508 ± 0.361
1.016CysLys: 1.016 ± 0.388
1.016CysLeu: 1.016 ± 0.465
0.339CysMet: 0.339 ± 0.202
0.847CysAsn: 0.847 ± 0.347
0.339CysPro: 0.339 ± 0.208
0.508CysGln: 0.508 ± 0.297
0.339CysArg: 0.339 ± 0.251
1.016CysSer: 1.016 ± 0.519
1.016CysThr: 1.016 ± 0.331
0.508CysVal: 0.508 ± 0.277
0.0CysTrp: 0.0 ± 0.0
1.185CysTyr: 1.185 ± 0.43
0.0CysXaa: 0.0 ± 0.0
Asp
1.693AspAla: 1.693 ± 0.535
1.016AspCys: 1.016 ± 0.425
2.878AspAsp: 2.878 ± 0.562
4.233AspGlu: 4.233 ± 0.852
3.894AspPhe: 3.894 ± 0.788
6.095AspGly: 6.095 ± 1.361
0.847AspHis: 0.847 ± 0.34
5.418AspIle: 5.418 ± 0.806
5.757AspLys: 5.757 ± 1.103
5.588AspLeu: 5.588 ± 0.937
0.847AspMet: 0.847 ± 0.373
4.402AspAsn: 4.402 ± 0.863
1.016AspPro: 1.016 ± 0.395
0.847AspGln: 0.847 ± 0.372
3.556AspArg: 3.556 ± 0.654
3.556AspSer: 3.556 ± 0.728
2.54AspThr: 2.54 ± 0.6
2.709AspVal: 2.709 ± 0.561
1.355AspTrp: 1.355 ± 0.945
4.233AspTyr: 4.233 ± 0.807
0.0AspXaa: 0.0 ± 0.0
Glu
1.693GluAla: 1.693 ± 0.607
2.201GluCys: 2.201 ± 0.663
3.556GluAsp: 3.556 ± 0.615
5.588GluGlu: 5.588 ± 1.419
2.201GluPhe: 2.201 ± 0.748
4.741GluGly: 4.741 ± 0.929
1.016GluHis: 1.016 ± 0.365
8.635GluIle: 8.635 ± 1.111
8.466GluLys: 8.466 ± 1.456
9.821GluLeu: 9.821 ± 1.923
3.386GluMet: 3.386 ± 0.804
8.127GluAsn: 8.127 ± 1.599
2.032GluPro: 2.032 ± 0.582
3.556GluGln: 3.556 ± 0.938
3.894GluArg: 3.894 ± 0.739
2.878GluSer: 2.878 ± 0.767
3.556GluThr: 3.556 ± 0.822
4.064GluVal: 4.064 ± 0.747
0.508GluTrp: 0.508 ± 0.254
3.894GluTyr: 3.894 ± 0.639
0.0GluXaa: 0.0 ± 0.0
Phe
2.37PheAla: 2.37 ± 0.72
0.677PheCys: 0.677 ± 0.332
3.386PheAsp: 3.386 ± 1.107
3.386PheGlu: 3.386 ± 0.846
1.355PhePhe: 1.355 ± 0.522
2.878PheGly: 2.878 ± 0.496
0.677PheHis: 0.677 ± 0.315
5.588PheIle: 5.588 ± 1.277
2.878PheLys: 2.878 ± 0.596
3.725PheLeu: 3.725 ± 0.93
1.524PheMet: 1.524 ± 0.537
3.048PheAsn: 3.048 ± 0.643
1.524PhePro: 1.524 ± 0.643
1.355PheGln: 1.355 ± 0.434
2.54PheArg: 2.54 ± 0.622
2.032PheSer: 2.032 ± 0.658
3.386PheThr: 3.386 ± 0.677
2.878PheVal: 2.878 ± 0.522
0.169PheTrp: 0.169 ± 0.204
2.032PheTyr: 2.032 ± 0.513
0.0PheXaa: 0.0 ± 0.0
Gly
2.878GlyAla: 2.878 ± 0.886
1.185GlyCys: 1.185 ± 0.437
4.741GlyAsp: 4.741 ± 1.136
5.588GlyGlu: 5.588 ± 1.087
2.54GlyPhe: 2.54 ± 0.674
4.233GlyGly: 4.233 ± 1.118
0.677GlyHis: 0.677 ± 0.391
4.91GlyIle: 4.91 ± 0.842
5.588GlyLys: 5.588 ± 1.063
6.434GlyLeu: 6.434 ± 0.93
0.847GlyMet: 0.847 ± 0.319
3.386GlyAsn: 3.386 ± 0.718
5.588GlyPro: 5.588 ± 3.913
1.185GlyGln: 1.185 ± 0.418
2.032GlyArg: 2.032 ± 0.58
3.386GlySer: 3.386 ± 0.694
2.54GlyThr: 2.54 ± 0.864
6.942GlyVal: 6.942 ± 1.162
0.339GlyTrp: 0.339 ± 0.218
2.878GlyTyr: 2.878 ± 0.527
0.0GlyXaa: 0.0 ± 0.0
His
0.169HisAla: 0.169 ± 0.164
0.169HisCys: 0.169 ± 0.215
0.847HisAsp: 0.847 ± 0.38
1.185HisGlu: 1.185 ± 0.477
1.185HisPhe: 1.185 ± 0.403
1.524HisGly: 1.524 ± 0.627
0.339HisHis: 0.339 ± 0.227
1.355HisIle: 1.355 ± 0.456
0.508HisLys: 0.508 ± 0.283
1.355HisLeu: 1.355 ± 0.493
0.169HisMet: 0.169 ± 0.164
1.524HisAsn: 1.524 ± 0.505
0.169HisPro: 0.169 ± 0.166
0.0HisGln: 0.0 ± 0.0
0.169HisArg: 0.169 ± 0.184
1.185HisSer: 1.185 ± 0.446
0.677HisThr: 0.677 ± 0.382
1.016HisVal: 1.016 ± 0.356
0.339HisTrp: 0.339 ± 0.202
0.847HisTyr: 0.847 ± 0.37
0.0HisXaa: 0.0 ± 0.0
Ile
2.032IleAla: 2.032 ± 0.535
0.677IleCys: 0.677 ± 0.323
6.095IleAsp: 6.095 ± 0.848
7.111IleGlu: 7.111 ± 1.059
4.233IlePhe: 4.233 ± 0.927
5.588IleGly: 5.588 ± 1.381
0.677IleHis: 0.677 ± 0.278
5.757IleIle: 5.757 ± 1.134
7.619IleLys: 7.619 ± 1.307
5.08IleLeu: 5.08 ± 1.048
2.201IleMet: 2.201 ± 0.53
7.619IleAsn: 7.619 ± 1.251
2.201IlePro: 2.201 ± 0.72
2.878IleGln: 2.878 ± 0.752
2.878IleArg: 2.878 ± 0.738
4.741IleSer: 4.741 ± 0.837
3.556IleThr: 3.556 ± 0.779
4.233IleVal: 4.233 ± 0.776
0.847IleTrp: 0.847 ± 0.27
3.894IleTyr: 3.894 ± 0.866
0.0IleXaa: 0.0 ± 0.0
Lys
4.572LysAla: 4.572 ± 0.92
1.016LysCys: 1.016 ± 0.496
5.588LysAsp: 5.588 ± 0.929
10.159LysGlu: 10.159 ± 1.879
3.048LysPhe: 3.048 ± 0.67
7.281LysGly: 7.281 ± 1.059
1.524LysHis: 1.524 ± 0.476
8.466LysIle: 8.466 ± 1.189
6.942LysLys: 6.942 ± 1.253
8.466LysLeu: 8.466 ± 1.268
2.201LysMet: 2.201 ± 0.608
7.281LysAsn: 7.281 ± 1.183
1.524LysPro: 1.524 ± 0.405
2.54LysGln: 2.54 ± 0.808
3.386LysArg: 3.386 ± 0.856
4.91LysSer: 4.91 ± 0.821
3.725LysThr: 3.725 ± 0.745
4.402LysVal: 4.402 ± 0.921
1.355LysTrp: 1.355 ± 0.489
3.894LysTyr: 3.894 ± 0.821
0.0LysXaa: 0.0 ± 0.0
Leu
2.54LeuAla: 2.54 ± 0.64
0.677LeuCys: 0.677 ± 0.256
4.91LeuAsp: 4.91 ± 1.134
7.281LeuGlu: 7.281 ± 1.196
3.894LeuPhe: 3.894 ± 1.04
5.418LeuGly: 5.418 ± 1.067
1.693LeuHis: 1.693 ± 0.54
5.588LeuIle: 5.588 ± 1.101
10.328LeuLys: 10.328 ± 1.553
6.773LeuLeu: 6.773 ± 1.045
1.524LeuMet: 1.524 ± 0.561
7.111LeuAsn: 7.111 ± 1.314
2.37LeuPro: 2.37 ± 0.58
3.217LeuGln: 3.217 ± 0.636
2.878LeuArg: 2.878 ± 0.663
4.064LeuSer: 4.064 ± 0.779
3.386LeuThr: 3.386 ± 0.659
3.386LeuVal: 3.386 ± 0.713
0.677LeuTrp: 0.677 ± 0.309
4.572LeuTyr: 4.572 ± 0.984
0.0LeuXaa: 0.0 ± 0.0
Met
1.016MetAla: 1.016 ± 0.522
0.169MetCys: 0.169 ± 0.155
1.524MetAsp: 1.524 ± 0.435
1.693MetGlu: 1.693 ± 0.69
1.185MetPhe: 1.185 ± 0.479
0.677MetGly: 0.677 ± 0.374
0.0MetHis: 0.0 ± 0.0
1.693MetIle: 1.693 ± 0.421
2.709MetLys: 2.709 ± 0.706
1.355MetLeu: 1.355 ± 0.419
1.016MetMet: 1.016 ± 0.493
1.185MetAsn: 1.185 ± 0.371
0.508MetPro: 0.508 ± 0.291
0.508MetGln: 0.508 ± 0.25
0.677MetArg: 0.677 ± 0.296
1.693MetSer: 1.693 ± 0.474
1.863MetThr: 1.863 ± 0.445
1.355MetVal: 1.355 ± 0.556
0.169MetTrp: 0.169 ± 0.155
1.524MetTyr: 1.524 ± 0.473
0.0MetXaa: 0.0 ± 0.0
Asn
2.709AsnAla: 2.709 ± 0.666
0.677AsnCys: 0.677 ± 0.295
5.08AsnAsp: 5.08 ± 1.196
5.08AsnGlu: 5.08 ± 0.967
2.878AsnPhe: 2.878 ± 0.839
5.249AsnGly: 5.249 ± 1.184
0.847AsnHis: 0.847 ± 0.389
6.773AsnIle: 6.773 ± 1.595
8.297AsnLys: 8.297 ± 1.193
5.08AsnLeu: 5.08 ± 0.943
1.355AsnMet: 1.355 ± 0.359
5.926AsnAsn: 5.926 ± 0.947
2.201AsnPro: 2.201 ± 0.507
2.54AsnGln: 2.54 ± 0.66
3.725AsnArg: 3.725 ± 0.926
5.757AsnSer: 5.757 ± 1.542
2.54AsnThr: 2.54 ± 0.736
4.572AsnVal: 4.572 ± 0.701
1.016AsnTrp: 1.016 ± 0.317
4.572AsnTyr: 4.572 ± 0.995
0.0AsnXaa: 0.0 ± 0.0
Pro
3.894ProAla: 3.894 ± 3.004
0.169ProCys: 0.169 ± 0.164
1.355ProAsp: 1.355 ± 0.286
2.709ProGlu: 2.709 ± 0.693
0.677ProPhe: 0.677 ± 0.406
0.169ProGly: 0.169 ± 0.155
0.508ProHis: 0.508 ± 0.251
2.37ProIle: 2.37 ± 0.655
2.201ProLys: 2.201 ± 0.692
1.693ProLeu: 1.693 ± 0.392
0.339ProMet: 0.339 ± 0.203
1.524ProAsn: 1.524 ± 0.458
0.677ProPro: 0.677 ± 0.355
1.863ProGln: 1.863 ± 0.489
1.016ProArg: 1.016 ± 0.452
2.37ProSer: 2.37 ± 0.665
0.847ProThr: 0.847 ± 0.382
2.54ProVal: 2.54 ± 0.678
0.0ProTrp: 0.0 ± 0.0
2.709ProTyr: 2.709 ± 0.59
0.0ProXaa: 0.0 ± 0.0
Gln
1.524GlnAla: 1.524 ± 0.556
0.508GlnCys: 0.508 ± 0.283
1.524GlnAsp: 1.524 ± 0.468
3.556GlnGlu: 3.556 ± 0.915
1.355GlnPhe: 1.355 ± 0.336
2.878GlnGly: 2.878 ± 0.821
0.677GlnHis: 0.677 ± 0.353
1.185GlnIle: 1.185 ± 0.406
1.863GlnLys: 1.863 ± 0.592
2.37GlnLeu: 2.37 ± 0.664
0.339GlnMet: 0.339 ± 0.206
2.709GlnAsn: 2.709 ± 0.66
0.677GlnPro: 0.677 ± 0.332
2.37GlnGln: 2.37 ± 0.549
1.524GlnArg: 1.524 ± 0.54
1.693GlnSer: 1.693 ± 0.489
1.185GlnThr: 1.185 ± 0.372
1.355GlnVal: 1.355 ± 0.66
0.847GlnTrp: 0.847 ± 0.326
1.355GlnTyr: 1.355 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
1.524ArgAla: 1.524 ± 0.54
0.847ArgCys: 0.847 ± 0.321
1.863ArgAsp: 1.863 ± 0.66
4.572ArgGlu: 4.572 ± 0.882
2.709ArgPhe: 2.709 ± 0.6
2.54ArgGly: 2.54 ± 0.769
0.169ArgHis: 0.169 ± 0.157
3.217ArgIle: 3.217 ± 0.897
3.556ArgLys: 3.556 ± 0.949
3.556ArgLeu: 3.556 ± 0.742
1.016ArgMet: 1.016 ± 0.347
2.54ArgAsn: 2.54 ± 0.626
1.185ArgPro: 1.185 ± 0.375
0.508ArgGln: 0.508 ± 0.28
1.524ArgArg: 1.524 ± 0.45
1.524ArgSer: 1.524 ± 0.666
1.016ArgThr: 1.016 ± 0.435
2.878ArgVal: 2.878 ± 0.667
0.847ArgTrp: 0.847 ± 0.301
1.355ArgTyr: 1.355 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
2.878SerAla: 2.878 ± 0.832
0.847SerCys: 0.847 ± 0.432
3.725SerAsp: 3.725 ± 0.916
4.741SerGlu: 4.741 ± 0.795
3.386SerPhe: 3.386 ± 0.913
2.201SerGly: 2.201 ± 0.7
0.677SerHis: 0.677 ± 0.365
3.386SerIle: 3.386 ± 0.596
4.91SerLys: 4.91 ± 1.171
4.572SerLeu: 4.572 ± 0.85
1.016SerMet: 1.016 ± 0.342
5.08SerAsn: 5.08 ± 1.12
0.847SerPro: 0.847 ± 0.426
1.355SerGln: 1.355 ± 0.407
1.524SerArg: 1.524 ± 0.489
2.032SerSer: 2.032 ± 1.006
2.201SerThr: 2.201 ± 0.82
2.878SerVal: 2.878 ± 0.853
0.339SerTrp: 0.339 ± 0.229
2.54SerTyr: 2.54 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
0.677ThrAla: 0.677 ± 0.268
0.677ThrCys: 0.677 ± 0.379
2.54ThrAsp: 2.54 ± 0.458
4.402ThrGlu: 4.402 ± 1.305
2.54ThrPhe: 2.54 ± 0.83
3.386ThrGly: 3.386 ± 0.733
0.339ThrHis: 0.339 ± 0.175
3.048ThrIle: 3.048 ± 0.695
4.064ThrLys: 4.064 ± 0.751
4.233ThrLeu: 4.233 ± 0.866
0.847ThrMet: 0.847 ± 0.343
3.556ThrAsn: 3.556 ± 1.022
1.524ThrPro: 1.524 ± 0.489
1.524ThrGln: 1.524 ± 0.389
1.863ThrArg: 1.863 ± 0.59
2.37ThrSer: 2.37 ± 0.563
4.233ThrThr: 4.233 ± 1.132
2.37ThrVal: 2.37 ± 0.416
0.677ThrTrp: 0.677 ± 0.316
2.032ThrTyr: 2.032 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
2.032ValAla: 2.032 ± 0.632
0.169ValCys: 0.169 ± 0.184
5.249ValAsp: 5.249 ± 0.623
3.725ValGlu: 3.725 ± 0.862
3.894ValPhe: 3.894 ± 1.065
2.032ValGly: 2.032 ± 0.682
1.016ValHis: 1.016 ± 0.342
4.741ValIle: 4.741 ± 0.774
6.603ValLys: 6.603 ± 0.925
4.064ValLeu: 4.064 ± 0.761
1.355ValMet: 1.355 ± 0.463
5.418ValAsn: 5.418 ± 0.716
1.185ValPro: 1.185 ± 0.483
1.016ValGln: 1.016 ± 0.38
2.54ValArg: 2.54 ± 0.534
2.032ValSer: 2.032 ± 0.719
3.556ValThr: 3.556 ± 0.759
2.709ValVal: 2.709 ± 0.945
0.339ValTrp: 0.339 ± 0.216
2.37ValTyr: 2.37 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.169TrpAla: 0.169 ± 0.165
0.0TrpCys: 0.0 ± 0.0
1.185TrpAsp: 1.185 ± 0.526
1.355TrpGlu: 1.355 ± 0.5
0.677TrpPhe: 0.677 ± 0.35
1.016TrpGly: 1.016 ± 0.369
0.169TrpHis: 0.169 ± 0.165
1.355TrpIle: 1.355 ± 0.46
0.169TrpLys: 0.169 ± 0.144
1.185TrpLeu: 1.185 ± 0.357
0.169TrpMet: 0.169 ± 0.15
0.169TrpAsn: 0.169 ± 0.163
0.0TrpPro: 0.0 ± 0.0
0.508TrpGln: 0.508 ± 0.235
0.508TrpArg: 0.508 ± 0.312
0.169TrpSer: 0.169 ± 0.164
0.508TrpThr: 0.508 ± 0.256
0.677TrpVal: 0.677 ± 0.381
0.0TrpTrp: 0.0 ± 0.0
0.677TrpTyr: 0.677 ± 0.44
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.709TyrAla: 2.709 ± 0.691
1.016TyrCys: 1.016 ± 0.429
2.878TyrAsp: 2.878 ± 0.714
3.894TyrGlu: 3.894 ± 1.113
2.709TyrPhe: 2.709 ± 0.74
3.217TyrGly: 3.217 ± 0.807
1.355TyrHis: 1.355 ± 0.373
3.048TyrIle: 3.048 ± 0.566
5.418TyrLys: 5.418 ± 0.793
3.217TyrLeu: 3.217 ± 0.562
1.016TyrMet: 1.016 ± 0.448
3.217TyrAsn: 3.217 ± 0.86
2.54TyrPro: 2.54 ± 0.705
1.693TyrGln: 1.693 ± 0.585
1.693TyrArg: 1.693 ± 0.468
2.37TyrSer: 2.37 ± 0.471
3.048TyrThr: 3.048 ± 0.645
2.878TyrVal: 2.878 ± 0.53
0.847TyrTrp: 0.847 ± 0.401
4.91TyrTyr: 4.91 ± 0.966
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (5907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski