Amino acid dipepetide frequency for Podoviridae sp. ctfa10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.24AlaAla: 0.24 ± 0.231
0.72AlaCys: 0.72 ± 0.486
2.641AlaAsp: 2.641 ± 0.814
2.881AlaGlu: 2.881 ± 0.871
2.881AlaPhe: 2.881 ± 1.016
1.921AlaGly: 1.921 ± 0.607
0.72AlaHis: 0.72 ± 0.483
5.282AlaIle: 5.282 ± 0.92
1.921AlaLys: 1.921 ± 0.695
4.322AlaLeu: 4.322 ± 1.095
1.441AlaMet: 1.441 ± 0.531
3.842AlaAsn: 3.842 ± 1.021
0.48AlaPro: 0.48 ± 0.275
1.921AlaGln: 1.921 ± 0.685
1.921AlaArg: 1.921 ± 0.756
2.401AlaSer: 2.401 ± 0.912
3.361AlaThr: 3.361 ± 0.867
2.401AlaVal: 2.401 ± 0.755
0.72AlaTrp: 0.72 ± 0.46
1.921AlaTyr: 1.921 ± 0.559
0.0AlaXaa: 0.0 ± 0.0
Cys
0.72CysAla: 0.72 ± 0.367
0.24CysCys: 0.24 ± 0.231
1.921CysAsp: 1.921 ± 0.569
0.72CysGlu: 0.72 ± 0.406
0.72CysPhe: 0.72 ± 0.304
0.48CysGly: 0.48 ± 0.297
0.0CysHis: 0.0 ± 0.0
1.681CysIle: 1.681 ± 0.496
0.96CysLys: 0.96 ± 0.631
1.2CysLeu: 1.2 ± 0.517
0.48CysMet: 0.48 ± 0.294
1.2CysAsn: 1.2 ± 0.578
0.72CysPro: 0.72 ± 0.264
0.0CysGln: 0.0 ± 0.0
0.48CysArg: 0.48 ± 0.266
1.441CysSer: 1.441 ± 0.288
0.96CysThr: 0.96 ± 0.718
1.681CysVal: 1.681 ± 0.579
0.24CysTrp: 0.24 ± 0.231
2.161CysTyr: 2.161 ± 0.829
0.0CysXaa: 0.0 ± 0.0
Asp
1.921AspAla: 1.921 ± 1.355
1.681AspCys: 1.681 ± 0.713
3.361AspAsp: 3.361 ± 1.509
4.562AspGlu: 4.562 ± 0.773
3.601AspPhe: 3.601 ± 0.805
2.161AspGly: 2.161 ± 0.816
0.96AspHis: 0.96 ± 0.286
3.601AspIle: 3.601 ± 1.146
6.242AspLys: 6.242 ± 0.958
3.842AspLeu: 3.842 ± 1.052
1.681AspMet: 1.681 ± 0.832
3.601AspAsn: 3.601 ± 0.783
0.0AspPro: 0.0 ± 0.0
0.96AspGln: 0.96 ± 0.69
3.121AspArg: 3.121 ± 0.982
3.361AspSer: 3.361 ± 0.75
4.322AspThr: 4.322 ± 0.977
3.361AspVal: 3.361 ± 0.784
1.441AspTrp: 1.441 ± 0.348
5.282AspTyr: 5.282 ± 1.237
0.0AspXaa: 0.0 ± 0.0
Glu
3.842GluAla: 3.842 ± 0.715
0.72GluCys: 0.72 ± 0.394
2.641GluAsp: 2.641 ± 1.232
4.082GluGlu: 4.082 ± 1.84
4.562GluPhe: 4.562 ± 1.113
2.401GluGly: 2.401 ± 0.56
0.96GluHis: 0.96 ± 0.527
4.322GluIle: 4.322 ± 0.834
4.562GluLys: 4.562 ± 1.066
9.844GluLeu: 9.844 ± 2.161
3.601GluMet: 3.601 ± 0.824
7.203GluAsn: 7.203 ± 1.282
0.96GluPro: 0.96 ± 0.483
2.881GluGln: 2.881 ± 0.849
3.121GluArg: 3.121 ± 0.857
2.401GluSer: 2.401 ± 0.699
4.802GluThr: 4.802 ± 0.809
1.441GluVal: 1.441 ± 0.524
0.72GluTrp: 0.72 ± 0.322
4.322GluTyr: 4.322 ± 1.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.121PheAla: 3.121 ± 0.732
1.2PheCys: 1.2 ± 0.466
1.681PheAsp: 1.681 ± 0.648
2.161PheGlu: 2.161 ± 0.509
0.24PhePhe: 0.24 ± 0.219
3.842PheGly: 3.842 ± 0.68
0.72PheHis: 0.72 ± 0.451
3.601PheIle: 3.601 ± 0.568
2.641PheLys: 2.641 ± 0.702
1.921PheLeu: 1.921 ± 0.616
1.2PheMet: 1.2 ± 0.451
3.842PheAsn: 3.842 ± 0.814
2.161PhePro: 2.161 ± 0.734
0.96PheGln: 0.96 ± 0.372
2.401PheArg: 2.401 ± 1.071
2.401PheSer: 2.401 ± 0.718
4.802PheThr: 4.802 ± 0.948
2.881PheVal: 2.881 ± 0.868
0.24PheTrp: 0.24 ± 0.231
3.361PheTyr: 3.361 ± 1.348
0.0PheXaa: 0.0 ± 0.0
Gly
1.921GlyAla: 1.921 ± 0.504
0.72GlyCys: 0.72 ± 0.264
3.601GlyAsp: 3.601 ± 1.087
4.082GlyGlu: 4.082 ± 1.139
3.361GlyPhe: 3.361 ± 0.827
3.842GlyGly: 3.842 ± 0.57
0.0GlyHis: 0.0 ± 0.0
5.042GlyIle: 5.042 ± 0.991
4.562GlyLys: 4.562 ± 0.816
4.082GlyLeu: 4.082 ± 1.084
2.161GlyMet: 2.161 ± 0.787
4.562GlyAsn: 4.562 ± 1.056
0.0GlyPro: 0.0 ± 0.0
0.96GlyGln: 0.96 ± 0.342
2.161GlyArg: 2.161 ± 0.739
3.842GlySer: 3.842 ± 1.111
4.802GlyThr: 4.802 ± 1.373
5.042GlyVal: 5.042 ± 1.108
0.96GlyTrp: 0.96 ± 0.589
4.082GlyTyr: 4.082 ± 0.847
0.0GlyXaa: 0.0 ± 0.0
His
0.72HisAla: 0.72 ± 0.394
0.72HisCys: 0.72 ± 0.317
0.96HisAsp: 0.96 ± 0.474
0.72HisGlu: 0.72 ± 0.462
0.48HisPhe: 0.48 ± 0.295
0.72HisGly: 0.72 ± 0.416
0.0HisHis: 0.0 ± 0.0
0.72HisIle: 0.72 ± 0.483
0.48HisLys: 0.48 ± 0.383
0.24HisLeu: 0.24 ± 0.231
0.24HisMet: 0.24 ± 0.285
0.72HisAsn: 0.72 ± 0.43
0.48HisPro: 0.48 ± 0.327
0.24HisGln: 0.24 ± 0.219
0.24HisArg: 0.24 ± 0.231
1.441HisSer: 1.441 ± 0.756
0.96HisThr: 0.96 ± 0.722
0.24HisVal: 0.24 ± 0.231
0.48HisTrp: 0.48 ± 0.275
0.48HisTyr: 0.48 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
3.842IleAla: 3.842 ± 0.88
1.441IleCys: 1.441 ± 0.374
4.082IleAsp: 4.082 ± 1.167
5.522IleGlu: 5.522 ± 1.119
1.921IlePhe: 1.921 ± 0.783
4.322IleGly: 4.322 ± 0.874
0.72IleHis: 0.72 ± 0.483
6.483IleIle: 6.483 ± 1.739
4.562IleLys: 4.562 ± 1.447
2.401IleLeu: 2.401 ± 0.735
2.161IleMet: 2.161 ± 0.856
7.203IleAsn: 7.203 ± 1.2
2.881IlePro: 2.881 ± 0.893
1.2IleGln: 1.2 ± 0.415
3.842IleArg: 3.842 ± 0.991
5.042IleSer: 5.042 ± 0.953
5.522IleThr: 5.522 ± 1.482
4.082IleVal: 4.082 ± 0.873
0.72IleTrp: 0.72 ± 0.314
3.842IleTyr: 3.842 ± 1.159
0.0IleXaa: 0.0 ± 0.0
Lys
3.842LysAla: 3.842 ± 1.048
1.2LysCys: 1.2 ± 0.443
4.322LysAsp: 4.322 ± 1.187
6.483LysGlu: 6.483 ± 1.533
2.641LysPhe: 2.641 ± 0.879
4.802LysGly: 4.802 ± 1.243
0.48LysHis: 0.48 ± 0.28
3.842LysIle: 3.842 ± 0.896
8.403LysLys: 8.403 ± 1.47
5.042LysLeu: 5.042 ± 1.448
2.641LysMet: 2.641 ± 0.921
7.443LysAsn: 7.443 ± 1.312
2.161LysPro: 2.161 ± 0.715
2.881LysGln: 2.881 ± 0.684
3.121LysArg: 3.121 ± 1.388
4.082LysSer: 4.082 ± 1.07
4.562LysThr: 4.562 ± 0.872
3.361LysVal: 3.361 ± 0.822
1.441LysTrp: 1.441 ± 0.501
4.322LysTyr: 4.322 ± 1.089
0.0LysXaa: 0.0 ± 0.0
Leu
1.921LeuAla: 1.921 ± 0.655
0.72LeuCys: 0.72 ± 0.474
5.042LeuAsp: 5.042 ± 1.383
5.762LeuGlu: 5.762 ± 1.649
3.121LeuPhe: 3.121 ± 0.963
4.322LeuGly: 4.322 ± 1.069
0.48LeuHis: 0.48 ± 0.362
2.881LeuIle: 2.881 ± 0.517
7.683LeuLys: 7.683 ± 1.469
5.762LeuLeu: 5.762 ± 1.156
1.441LeuMet: 1.441 ± 0.472
4.562LeuAsn: 4.562 ± 0.794
1.921LeuPro: 1.921 ± 0.442
2.401LeuGln: 2.401 ± 0.664
2.641LeuArg: 2.641 ± 1.063
6.723LeuSer: 6.723 ± 1.494
6.242LeuThr: 6.242 ± 1.53
3.361LeuVal: 3.361 ± 1.045
0.0LeuTrp: 0.0 ± 0.0
5.042LeuTyr: 5.042 ± 0.972
0.0LeuXaa: 0.0 ± 0.0
Met
1.681MetAla: 1.681 ± 0.649
0.24MetCys: 0.24 ± 0.234
1.681MetAsp: 1.681 ± 0.505
1.681MetGlu: 1.681 ± 0.464
0.96MetPhe: 0.96 ± 0.529
2.641MetGly: 2.641 ± 0.752
0.24MetHis: 0.24 ± 0.227
2.641MetIle: 2.641 ± 0.783
2.881MetLys: 2.881 ± 0.675
1.921MetLeu: 1.921 ± 0.941
0.24MetMet: 0.24 ± 0.219
2.881MetAsn: 2.881 ± 0.795
0.96MetPro: 0.96 ± 0.435
0.48MetGln: 0.48 ± 0.467
1.2MetArg: 1.2 ± 0.468
2.161MetSer: 2.161 ± 0.536
1.2MetThr: 1.2 ± 0.557
1.441MetVal: 1.441 ± 0.645
0.72MetTrp: 0.72 ± 0.355
2.641MetTyr: 2.641 ± 0.957
0.0MetXaa: 0.0 ± 0.0
Asn
3.121AsnAla: 3.121 ± 0.628
1.681AsnCys: 1.681 ± 0.595
4.322AsnAsp: 4.322 ± 0.761
4.802AsnGlu: 4.802 ± 1.254
3.842AsnPhe: 3.842 ± 1.052
5.042AsnGly: 5.042 ± 1.173
0.96AsnHis: 0.96 ± 0.552
6.483AsnIle: 6.483 ± 1.219
6.723AsnLys: 6.723 ± 1.879
5.282AsnLeu: 5.282 ± 1.148
1.2AsnMet: 1.2 ± 0.448
4.562AsnAsn: 4.562 ± 1.829
3.121AsnPro: 3.121 ± 0.664
2.161AsnGln: 2.161 ± 0.829
3.121AsnArg: 3.121 ± 1.042
4.562AsnSer: 4.562 ± 0.901
4.802AsnThr: 4.802 ± 1.515
5.042AsnVal: 5.042 ± 0.852
0.96AsnTrp: 0.96 ± 0.433
4.322AsnTyr: 4.322 ± 1.202
0.0AsnXaa: 0.0 ± 0.0
Pro
0.72ProAla: 0.72 ± 0.384
0.24ProCys: 0.24 ± 0.279
1.921ProAsp: 1.921 ± 0.59
1.681ProGlu: 1.681 ± 0.672
1.681ProPhe: 1.681 ± 0.631
0.24ProGly: 0.24 ± 0.227
0.0ProHis: 0.0 ± 0.0
1.441ProIle: 1.441 ± 0.655
1.681ProLys: 1.681 ± 0.64
2.161ProLeu: 2.161 ± 0.56
0.24ProMet: 0.24 ± 0.219
1.681ProAsn: 1.681 ± 0.725
0.48ProPro: 0.48 ± 0.351
4.082ProGln: 4.082 ± 3.959
0.0ProArg: 0.0 ± 0.0
2.641ProSer: 2.641 ± 0.996
0.72ProThr: 0.72 ± 0.456
3.121ProVal: 3.121 ± 0.846
0.96ProTrp: 0.96 ± 0.324
3.601ProTyr: 3.601 ± 0.659
0.0ProXaa: 0.0 ± 0.0
Gln
2.401GlnAla: 2.401 ± 0.915
0.48GlnCys: 0.48 ± 0.294
0.72GlnAsp: 0.72 ± 0.498
3.121GlnGlu: 3.121 ± 0.809
1.2GlnPhe: 1.2 ± 0.649
1.441GlnGly: 1.441 ± 0.427
0.24GlnHis: 0.24 ± 0.234
1.2GlnIle: 1.2 ± 0.556
1.921GlnLys: 1.921 ± 0.627
2.881GlnLeu: 2.881 ± 0.897
0.96GlnMet: 0.96 ± 0.595
1.681GlnAsn: 1.681 ± 0.702
3.601GlnPro: 3.601 ± 3.723
0.96GlnGln: 0.96 ± 0.455
0.72GlnArg: 0.72 ± 0.694
2.641GlnSer: 2.641 ± 1.111
1.2GlnThr: 1.2 ± 0.388
1.441GlnVal: 1.441 ± 0.754
1.681GlnTrp: 1.681 ± 0.644
1.2GlnTyr: 1.2 ± 0.529
0.0GlnXaa: 0.0 ± 0.0
Arg
0.48ArgAla: 0.48 ± 0.383
0.48ArgCys: 0.48 ± 0.28
1.441ArgAsp: 1.441 ± 0.705
2.881ArgGlu: 2.881 ± 1.128
1.441ArgPhe: 1.441 ± 0.706
1.921ArgGly: 1.921 ± 0.52
0.72ArgHis: 0.72 ± 0.537
2.881ArgIle: 2.881 ± 0.744
2.641ArgLys: 2.641 ± 0.852
2.641ArgLeu: 2.641 ± 1.026
1.441ArgMet: 1.441 ± 0.547
4.082ArgAsn: 4.082 ± 0.799
1.921ArgPro: 1.921 ± 0.774
0.96ArgGln: 0.96 ± 0.5
1.921ArgArg: 1.921 ± 0.822
3.361ArgSer: 3.361 ± 0.583
1.681ArgThr: 1.681 ± 0.704
2.161ArgVal: 2.161 ± 0.763
0.72ArgTrp: 0.72 ± 0.372
2.881ArgTyr: 2.881 ± 1.286
0.0ArgXaa: 0.0 ± 0.0
Ser
2.401SerAla: 2.401 ± 0.675
0.96SerCys: 0.96 ± 0.372
4.562SerAsp: 4.562 ± 0.995
3.842SerGlu: 3.842 ± 0.856
3.361SerPhe: 3.361 ± 0.857
6.242SerGly: 6.242 ± 1.165
0.72SerHis: 0.72 ± 0.45
5.762SerIle: 5.762 ± 0.884
5.762SerLys: 5.762 ± 2.026
3.361SerLeu: 3.361 ± 1.189
3.121SerMet: 3.121 ± 1.068
3.121SerAsn: 3.121 ± 1.002
1.441SerPro: 1.441 ± 0.602
2.641SerGln: 2.641 ± 0.807
3.121SerArg: 3.121 ± 0.752
2.641SerSer: 2.641 ± 0.832
4.322SerThr: 4.322 ± 0.874
3.601SerVal: 3.601 ± 0.711
0.96SerTrp: 0.96 ± 0.498
3.121SerTyr: 3.121 ± 0.919
0.0SerXaa: 0.0 ± 0.0
Thr
3.601ThrAla: 3.601 ± 0.943
1.441ThrCys: 1.441 ± 1.162
4.082ThrAsp: 4.082 ± 0.896
4.562ThrGlu: 4.562 ± 1.086
2.401ThrPhe: 2.401 ± 0.601
5.762ThrGly: 5.762 ± 0.896
1.2ThrHis: 1.2 ± 0.585
4.562ThrIle: 4.562 ± 1.311
3.842ThrLys: 3.842 ± 1.319
4.562ThrLeu: 4.562 ± 1.131
2.401ThrMet: 2.401 ± 0.759
4.802ThrAsn: 4.802 ± 1.935
2.641ThrPro: 2.641 ± 0.722
1.681ThrGln: 1.681 ± 0.722
2.881ThrArg: 2.881 ± 0.822
3.842ThrSer: 3.842 ± 0.978
5.282ThrThr: 5.282 ± 1.569
4.322ThrVal: 4.322 ± 0.849
0.96ThrTrp: 0.96 ± 0.579
1.681ThrTyr: 1.681 ± 0.497
0.0ThrXaa: 0.0 ± 0.0
Val
2.881ValAla: 2.881 ± 0.825
0.48ValCys: 0.48 ± 0.358
3.361ValAsp: 3.361 ± 0.774
3.842ValGlu: 3.842 ± 0.832
3.361ValPhe: 3.361 ± 0.765
2.881ValGly: 2.881 ± 0.893
1.2ValHis: 1.2 ± 0.55
3.601ValIle: 3.601 ± 1.105
4.562ValLys: 4.562 ± 1.19
5.282ValLeu: 5.282 ± 1.49
1.681ValMet: 1.681 ± 0.602
3.842ValAsn: 3.842 ± 0.964
1.681ValPro: 1.681 ± 0.396
2.401ValGln: 2.401 ± 0.564
0.96ValArg: 0.96 ± 0.59
4.562ValSer: 4.562 ± 0.607
3.361ValThr: 3.361 ± 0.927
3.601ValVal: 3.601 ± 0.799
1.2ValTrp: 1.2 ± 0.388
1.681ValTyr: 1.681 ± 0.569
0.0ValXaa: 0.0 ± 0.0
Trp
0.48TrpAla: 0.48 ± 0.317
0.24TrpCys: 0.24 ± 0.234
0.48TrpAsp: 0.48 ± 0.298
1.2TrpGlu: 1.2 ± 0.369
0.96TrpPhe: 0.96 ± 0.663
1.441TrpGly: 1.441 ± 0.464
0.0TrpHis: 0.0 ± 0.0
0.48TrpIle: 0.48 ± 0.276
1.681TrpLys: 1.681 ± 0.629
0.96TrpLeu: 0.96 ± 0.453
0.72TrpMet: 0.72 ± 0.455
0.24TrpAsn: 0.24 ± 0.231
0.24TrpPro: 0.24 ± 0.227
0.72TrpGln: 0.72 ± 0.346
0.24TrpArg: 0.24 ± 0.231
1.681TrpSer: 1.681 ± 0.537
1.2TrpThr: 1.2 ± 0.543
1.2TrpVal: 1.2 ± 0.323
0.72TrpTrp: 0.72 ± 0.452
1.2TrpTyr: 1.2 ± 0.797
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.082TyrAla: 4.082 ± 0.551
2.401TyrCys: 2.401 ± 0.768
6.242TyrAsp: 6.242 ± 0.693
4.802TyrGlu: 4.802 ± 1.239
2.641TyrPhe: 2.641 ± 0.581
3.361TyrGly: 3.361 ± 0.623
0.96TyrHis: 0.96 ± 0.43
5.282TyrIle: 5.282 ± 0.707
3.601TyrLys: 3.601 ± 1.21
4.322TyrLeu: 4.322 ± 0.683
1.2TyrMet: 1.2 ± 0.442
5.042TyrAsn: 5.042 ± 1.316
1.441TyrPro: 1.441 ± 0.464
1.2TyrGln: 1.2 ± 0.335
1.441TyrArg: 1.441 ± 0.383
3.842TyrSer: 3.842 ± 0.882
2.401TyrThr: 2.401 ± 0.854
2.641TyrVal: 2.641 ± 0.719
0.24TyrTrp: 0.24 ± 0.203
4.802TyrTyr: 4.802 ± 0.975
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (4166 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski