Amino acid dipepetide frequency for Hubei leech virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.025AlaAla: 5.025 ± 0.404
1.256AlaCys: 1.256 ± 0.079
1.675AlaAsp: 1.675 ± 1.096
1.675AlaGlu: 1.675 ± 1.067
5.863AlaPhe: 5.863 ± 0.85
4.606AlaGly: 4.606 ± 0.671
0.838AlaHis: 0.838 ± 0.187
2.931AlaIle: 2.931 ± 1.146
4.188AlaLys: 4.188 ± 0.216
3.769AlaLeu: 3.769 ± 0.238
2.094AlaMet: 2.094 ± 0.108
5.444AlaAsn: 5.444 ± 1.579
1.675AlaPro: 1.675 ± 0.375
3.35AlaGln: 3.35 ± 1.413
1.675AlaArg: 1.675 ± 1.096
8.375AlaSer: 8.375 ± 3.316
8.794AlaThr: 8.794 ± 1.608
3.769AlaVal: 3.769 ± 1.204
1.256AlaTrp: 1.256 ± 1.362
2.513AlaTyr: 2.513 ± 0.158
0.0AlaXaa: 0.0 ± 0.0
Cys
0.838CysAla: 0.838 ± 0.533
0.838CysCys: 0.838 ± 0.187
0.419CysAsp: 0.419 ± 0.454
1.675CysGlu: 1.675 ± 1.067
0.838CysPhe: 0.838 ± 0.533
0.419CysGly: 0.419 ± 0.267
0.0CysHis: 0.0 ± 0.0
2.094CysIle: 2.094 ± 0.613
1.256CysLys: 1.256 ± 0.079
2.094CysLeu: 2.094 ± 0.108
0.419CysMet: 0.419 ± 0.267
0.0CysAsn: 0.0 ± 0.0
1.675CysPro: 1.675 ± 0.375
0.0CysGln: 0.0 ± 0.0
0.419CysArg: 0.419 ± 0.267
1.675CysSer: 1.675 ± 0.346
2.094CysThr: 2.094 ± 0.613
2.513CysVal: 2.513 ± 0.879
0.0CysTrp: 0.0 ± 0.0
0.419CysTyr: 0.419 ± 0.267
0.0CysXaa: 0.0 ± 0.0
Asp
1.675AspAla: 1.675 ± 1.067
1.256AspCys: 1.256 ± 0.8
3.769AspAsp: 3.769 ± 0.959
4.606AspGlu: 4.606 ± 0.771
2.931AspPhe: 2.931 ± 0.425
2.094AspGly: 2.094 ± 0.108
0.0AspHis: 0.0 ± 0.0
6.281AspIle: 6.281 ± 0.396
1.256AspLys: 1.256 ± 0.079
2.931AspLeu: 2.931 ± 0.425
1.256AspMet: 1.256 ± 0.8
2.931AspAsn: 2.931 ± 2.458
2.931AspPro: 2.931 ± 2.458
1.256AspGln: 1.256 ± 0.079
2.094AspArg: 2.094 ± 1.333
2.931AspSer: 2.931 ± 1.146
2.513AspThr: 2.513 ± 2.004
5.025AspVal: 5.025 ± 0.404
0.0AspTrp: 0.0 ± 0.0
2.094AspTyr: 2.094 ± 0.108
0.0AspXaa: 0.0 ± 0.0
Glu
3.35GluAla: 3.35 ± 2.134
2.513GluCys: 2.513 ± 0.562
3.769GluAsp: 3.769 ± 0.959
2.513GluGlu: 2.513 ± 0.158
2.094GluPhe: 2.094 ± 0.613
2.513GluGly: 2.513 ± 0.879
0.838GluHis: 0.838 ± 0.533
2.094GluIle: 2.094 ± 0.829
4.188GluLys: 4.188 ± 1.946
3.35GluLeu: 3.35 ± 0.692
1.675GluMet: 1.675 ± 0.346
2.513GluAsn: 2.513 ± 0.158
1.675GluPro: 1.675 ± 0.346
1.675GluGln: 1.675 ± 1.067
2.094GluArg: 2.094 ± 1.333
2.094GluSer: 2.094 ± 0.108
1.675GluThr: 1.675 ± 1.067
4.606GluVal: 4.606 ± 0.771
1.256GluTrp: 1.256 ± 0.8
4.188GluTyr: 4.188 ± 1.225
0.0GluXaa: 0.0 ± 0.0
Phe
4.188PheAla: 4.188 ± 1.658
1.256PheCys: 1.256 ± 0.642
4.188PheAsp: 4.188 ± 1.946
2.513PheGlu: 2.513 ± 0.879
3.769PhePhe: 3.769 ± 0.959
5.025PheGly: 5.025 ± 0.317
1.256PheHis: 1.256 ± 0.642
2.931PheIle: 2.931 ± 1.146
2.094PheLys: 2.094 ± 0.108
5.025PheLeu: 5.025 ± 0.317
1.256PheMet: 1.256 ± 0.8
4.188PheAsn: 4.188 ± 0.216
1.675PhePro: 1.675 ± 0.346
1.256PheGln: 1.256 ± 0.642
1.675PheArg: 1.675 ± 0.346
4.188PheSer: 4.188 ± 1.225
4.188PheThr: 4.188 ± 0.504
2.931PheVal: 2.931 ± 1.146
0.838PheTrp: 0.838 ± 0.533
1.675PheTyr: 1.675 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
4.188GlyAla: 4.188 ± 3.1
0.0GlyCys: 0.0 ± 0.0
5.444GlyAsp: 5.444 ± 0.858
1.675GlyGlu: 1.675 ± 0.375
1.675GlyPhe: 1.675 ± 1.067
2.513GlyGly: 2.513 ± 0.158
2.931GlyHis: 2.931 ± 1.146
2.094GlyIle: 2.094 ± 0.829
4.188GlyLys: 4.188 ± 1.946
3.769GlyLeu: 3.769 ± 0.238
0.838GlyMet: 0.838 ± 0.908
3.769GlyAsn: 3.769 ± 0.483
1.675GlyPro: 1.675 ± 0.346
1.675GlyGln: 1.675 ± 0.346
0.838GlyArg: 0.838 ± 0.187
4.606GlySer: 4.606 ± 0.671
5.444GlyThr: 5.444 ± 1.579
5.863GlyVal: 5.863 ± 0.591
1.256GlyTrp: 1.256 ± 0.079
1.256GlyTyr: 1.256 ± 0.642
0.0GlyXaa: 0.0 ± 0.0
His
0.419HisAla: 0.419 ± 0.267
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.256HisGlu: 1.256 ± 0.8
0.419HisPhe: 0.419 ± 0.454
1.256HisGly: 1.256 ± 0.079
0.838HisHis: 0.838 ± 0.533
0.838HisIle: 0.838 ± 0.533
2.513HisLys: 2.513 ± 0.562
1.675HisLeu: 1.675 ± 0.375
0.419HisMet: 0.419 ± 0.267
0.419HisAsn: 0.419 ± 0.267
2.094HisPro: 2.094 ± 1.55
0.419HisGln: 0.419 ± 0.454
1.256HisArg: 1.256 ± 0.079
1.675HisSer: 1.675 ± 1.067
0.838HisThr: 0.838 ± 0.187
0.419HisVal: 0.419 ± 0.267
0.838HisTrp: 0.838 ± 0.187
0.419HisTyr: 0.419 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
7.119IleAla: 7.119 ± 0.512
0.419IleCys: 0.419 ± 0.267
2.094IleAsp: 2.094 ± 0.829
2.931IleGlu: 2.931 ± 0.425
3.35IlePhe: 3.35 ± 0.692
3.769IleGly: 3.769 ± 1.925
1.675IleHis: 1.675 ± 0.346
3.35IleIle: 3.35 ± 0.75
3.769IleLys: 3.769 ± 0.959
5.025IleLeu: 5.025 ± 1.125
0.419IleMet: 0.419 ± 0.267
3.35IleAsn: 3.35 ± 0.692
1.256IlePro: 1.256 ± 0.642
2.094IleGln: 2.094 ± 0.108
2.513IleArg: 2.513 ± 0.879
4.188IleSer: 4.188 ± 0.216
4.606IleThr: 4.606 ± 2.112
2.094IleVal: 2.094 ± 0.613
0.838IleTrp: 0.838 ± 0.533
1.256IleTyr: 1.256 ± 0.8
0.0IleXaa: 0.0 ± 0.0
Lys
3.769LysAla: 3.769 ± 0.238
1.675LysCys: 1.675 ± 1.067
3.769LysAsp: 3.769 ± 0.959
3.35LysGlu: 3.35 ± 2.134
4.188LysPhe: 4.188 ± 0.504
0.838LysGly: 0.838 ± 0.187
0.419LysHis: 0.419 ± 0.267
3.769LysIle: 3.769 ± 0.483
4.606LysLys: 4.606 ± 2.934
5.025LysLeu: 5.025 ± 1.038
0.838LysMet: 0.838 ± 0.533
4.606LysAsn: 4.606 ± 0.05
2.513LysPro: 2.513 ± 1.6
1.675LysGln: 1.675 ± 1.096
4.606LysArg: 4.606 ± 2.934
2.931LysSer: 2.931 ± 0.296
2.513LysThr: 2.513 ± 0.879
1.675LysVal: 1.675 ± 1.067
1.675LysTrp: 1.675 ± 1.067
1.256LysTyr: 1.256 ± 0.079
0.0LysXaa: 0.0 ± 0.0
Leu
5.025LeuAla: 5.025 ± 1.038
2.931LeuCys: 2.931 ± 0.425
3.35LeuAsp: 3.35 ± 0.75
6.7LeuGlu: 6.7 ± 0.058
5.444LeuPhe: 5.444 ± 1.305
3.35LeuGly: 3.35 ± 2.192
2.513LeuHis: 2.513 ± 0.562
2.513LeuIle: 2.513 ± 0.562
6.7LeuLys: 6.7 ± 2.825
7.119LeuLeu: 7.119 ± 2.371
2.094LeuMet: 2.094 ± 0.829
5.444LeuAsn: 5.444 ± 1.579
5.863LeuPro: 5.863 ± 0.13
4.188LeuGln: 4.188 ± 1.225
2.513LeuArg: 2.513 ± 0.562
5.863LeuSer: 5.863 ± 1.571
5.863LeuThr: 5.863 ± 0.13
5.444LeuVal: 5.444 ± 1.305
0.419LeuTrp: 0.419 ± 0.454
2.094LeuTyr: 2.094 ± 0.613
0.0LeuXaa: 0.0 ± 0.0
Met
0.838MetAla: 0.838 ± 0.187
0.838MetCys: 0.838 ± 0.533
1.256MetAsp: 1.256 ± 0.8
0.419MetGlu: 0.419 ± 0.267
2.094MetPhe: 2.094 ± 0.829
1.675MetGly: 1.675 ± 0.346
0.419MetHis: 0.419 ± 0.454
0.419MetIle: 0.419 ± 0.267
0.838MetLys: 0.838 ± 0.533
2.931MetLeu: 2.931 ± 1.867
1.675MetMet: 1.675 ± 1.096
0.419MetAsn: 0.419 ± 0.267
0.838MetPro: 0.838 ± 0.187
0.0MetGln: 0.0 ± 0.0
1.675MetArg: 1.675 ± 0.346
2.931MetSer: 2.931 ± 1.737
1.256MetThr: 1.256 ± 0.8
2.094MetVal: 2.094 ± 0.108
0.838MetTrp: 0.838 ± 0.187
1.256MetTyr: 1.256 ± 0.642
0.0MetXaa: 0.0 ± 0.0
Asn
3.769AsnAla: 3.769 ± 1.925
1.256AsnCys: 1.256 ± 0.8
0.838AsnAsp: 0.838 ± 0.533
2.513AsnGlu: 2.513 ± 1.6
3.35AsnPhe: 3.35 ± 1.471
3.769AsnGly: 3.769 ± 0.483
0.838AsnHis: 0.838 ± 0.533
5.444AsnIle: 5.444 ± 0.858
4.606AsnLys: 4.606 ± 0.771
2.094AsnLeu: 2.094 ± 0.108
0.838AsnMet: 0.838 ± 0.414
2.513AsnAsn: 2.513 ± 2.004
4.606AsnPro: 4.606 ± 0.05
0.838AsnGln: 0.838 ± 0.187
0.838AsnArg: 0.838 ± 0.187
4.188AsnSer: 4.188 ± 1.658
4.606AsnThr: 4.606 ± 2.112
5.444AsnVal: 5.444 ± 0.137
1.675AsnTrp: 1.675 ± 0.375
3.35AsnTyr: 3.35 ± 0.692
0.0AsnXaa: 0.0 ± 0.0
Pro
2.094ProAla: 2.094 ± 0.829
0.419ProCys: 0.419 ± 0.267
1.256ProAsp: 1.256 ± 0.8
1.675ProGlu: 1.675 ± 0.346
2.513ProPhe: 2.513 ± 0.158
2.094ProGly: 2.094 ± 0.829
0.838ProHis: 0.838 ± 0.533
2.094ProIle: 2.094 ± 0.108
1.256ProLys: 1.256 ± 0.079
6.281ProLeu: 6.281 ± 0.325
2.513ProMet: 2.513 ± 0.794
0.838ProAsn: 0.838 ± 0.187
1.675ProPro: 1.675 ± 0.375
2.094ProGln: 2.094 ± 0.829
2.094ProArg: 2.094 ± 0.613
3.35ProSer: 3.35 ± 0.75
3.769ProThr: 3.769 ± 2.646
3.769ProVal: 3.769 ± 1.204
1.675ProTrp: 1.675 ± 0.346
2.513ProTyr: 2.513 ± 0.562
0.0ProXaa: 0.0 ± 0.0
Gln
1.675GlnAla: 1.675 ± 1.096
0.419GlnCys: 0.419 ± 0.267
0.838GlnAsp: 0.838 ± 0.533
1.256GlnGlu: 1.256 ± 0.079
0.838GlnPhe: 0.838 ± 0.533
2.094GlnGly: 2.094 ± 1.333
0.419GlnHis: 0.419 ± 0.267
1.675GlnIle: 1.675 ± 0.346
3.769GlnLys: 3.769 ± 1.679
4.188GlnLeu: 4.188 ± 1.225
0.838GlnMet: 0.838 ± 0.533
1.256GlnAsn: 1.256 ± 0.642
0.838GlnPro: 0.838 ± 0.908
1.256GlnGln: 1.256 ± 0.8
2.094GlnArg: 2.094 ± 0.829
3.769GlnSer: 3.769 ± 0.959
2.513GlnThr: 2.513 ± 1.283
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.838GlnTyr: 0.838 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
5.444ArgAla: 5.444 ± 0.584
0.0ArgCys: 0.0 ± 0.0
0.838ArgAsp: 0.838 ± 0.187
2.931ArgGlu: 2.931 ± 1.867
1.675ArgPhe: 1.675 ± 0.375
3.769ArgGly: 3.769 ± 1.204
0.419ArgHis: 0.419 ± 0.267
2.094ArgIle: 2.094 ± 0.613
2.931ArgLys: 2.931 ± 1.146
3.35ArgLeu: 3.35 ± 0.692
2.094ArgMet: 2.094 ± 0.829
1.675ArgAsn: 1.675 ± 1.067
1.256ArgPro: 1.256 ± 0.079
0.419ArgGln: 0.419 ± 0.267
2.513ArgArg: 2.513 ± 0.158
2.094ArgSer: 2.094 ± 0.613
1.675ArgThr: 1.675 ± 1.096
2.931ArgVal: 2.931 ± 0.296
0.0ArgTrp: 0.0 ± 0.0
3.769ArgTyr: 3.769 ± 0.238
0.0ArgXaa: 0.0 ± 0.0
Ser
7.119SerAla: 7.119 ± 1.954
0.419SerCys: 0.419 ± 0.267
5.025SerAsp: 5.025 ± 1.038
2.931SerGlu: 2.931 ± 0.296
5.025SerPhe: 5.025 ± 0.317
4.188SerGly: 4.188 ± 0.216
0.838SerHis: 0.838 ± 0.187
4.606SerIle: 4.606 ± 0.05
3.35SerLys: 3.35 ± 1.413
10.05SerLeu: 10.05 ± 0.087
0.838SerMet: 0.838 ± 0.533
4.188SerAsn: 4.188 ± 0.504
3.35SerPro: 3.35 ± 0.75
3.769SerGln: 3.769 ± 0.959
3.769SerArg: 3.769 ± 1.925
8.375SerSer: 8.375 ± 1.154
5.444SerThr: 5.444 ± 2.3
5.444SerVal: 5.444 ± 3.741
0.419SerTrp: 0.419 ± 0.267
2.094SerTyr: 2.094 ± 0.108
0.0SerXaa: 0.0 ± 0.0
Thr
3.769ThrAla: 3.769 ± 1.204
0.419ThrCys: 0.419 ± 0.454
3.35ThrAsp: 3.35 ± 1.471
2.931ThrGlu: 2.931 ± 1.146
4.188ThrPhe: 4.188 ± 0.504
3.769ThrGly: 3.769 ± 1.204
1.256ThrHis: 1.256 ± 1.362
4.606ThrIle: 4.606 ± 2.112
2.513ThrLys: 2.513 ± 1.283
6.281ThrLeu: 6.281 ± 1.046
2.094ThrMet: 2.094 ± 0.829
6.281ThrAsn: 6.281 ± 1.766
5.025ThrPro: 5.025 ± 1.125
2.094ThrGln: 2.094 ± 0.108
3.35ThrArg: 3.35 ± 0.75
6.7ThrSer: 6.7 ± 1.5
5.025ThrThr: 5.025 ± 2.566
5.863ThrVal: 5.863 ± 0.13
0.838ThrTrp: 0.838 ± 0.908
3.769ThrTyr: 3.769 ± 1.204
0.0ThrXaa: 0.0 ± 0.0
Val
5.863ValAla: 5.863 ± 1.312
2.094ValCys: 2.094 ± 0.613
4.188ValAsp: 4.188 ± 0.937
4.606ValGlu: 4.606 ± 1.492
2.513ValPhe: 2.513 ± 0.158
3.35ValGly: 3.35 ± 0.75
0.838ValHis: 0.838 ± 0.533
3.769ValIle: 3.769 ± 1.204
1.256ValLys: 1.256 ± 0.8
6.281ValLeu: 6.281 ± 0.325
0.838ValMet: 0.838 ± 0.533
5.444ValAsn: 5.444 ± 1.305
3.35ValPro: 3.35 ± 0.75
2.513ValGln: 2.513 ± 0.879
3.769ValArg: 3.769 ± 0.238
4.188ValSer: 4.188 ± 0.216
7.956ValThr: 7.956 ± 0.7
5.863ValVal: 5.863 ± 0.13
0.838ValTrp: 0.838 ± 0.533
2.094ValTyr: 2.094 ± 1.55
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.256TrpCys: 1.256 ± 0.8
1.256TrpAsp: 1.256 ± 0.079
0.838TrpGlu: 0.838 ± 0.533
0.838TrpPhe: 0.838 ± 0.533
0.419TrpGly: 0.419 ± 0.267
0.0TrpHis: 0.0 ± 0.0
0.419TrpIle: 0.419 ± 0.454
0.0TrpLys: 0.0 ± 0.0
2.094TrpLeu: 2.094 ± 0.829
0.0TrpMet: 0.0 ± 0.0
0.838TrpAsn: 0.838 ± 0.187
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.256TrpArg: 1.256 ± 0.079
3.35TrpSer: 3.35 ± 0.75
0.0TrpThr: 0.0 ± 0.0
1.675TrpVal: 1.675 ± 1.067
0.838TrpTrp: 0.838 ± 0.187
0.838TrpTyr: 0.838 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.606TyrAla: 4.606 ± 0.05
0.419TyrCys: 0.419 ± 0.267
2.513TyrAsp: 2.513 ± 1.283
1.675TyrGlu: 1.675 ± 0.375
2.513TyrPhe: 2.513 ± 0.879
4.188TyrGly: 4.188 ± 1.225
0.838TyrHis: 0.838 ± 0.908
2.094TyrIle: 2.094 ± 1.333
0.419TyrLys: 0.419 ± 0.267
2.094TyrLeu: 2.094 ± 0.613
1.256TyrMet: 1.256 ± 0.8
1.256TyrAsn: 1.256 ± 0.8
1.256TyrPro: 1.256 ± 0.079
0.0TyrGln: 0.0 ± 0.0
0.838TyrArg: 0.838 ± 0.533
3.769TyrSer: 3.769 ± 1.925
3.35TyrThr: 3.35 ± 2.912
4.188TyrVal: 4.188 ± 0.216
0.419TyrTrp: 0.419 ± 0.267
2.094TyrTyr: 2.094 ± 0.829
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2389 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski