Amino acid dipepetide frequency for Hubei diptera virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.749AlaAla: 2.749 ± 1.136
0.55AlaCys: 0.55 ± 0.175
1.649AlaAsp: 1.649 ± 1.489
1.924AlaGlu: 1.924 ± 0.585
3.024AlaPhe: 3.024 ± 0.228
2.199AlaGly: 2.199 ± 0.423
0.825AlaHis: 0.825 ± 0.731
2.749AlaIle: 2.749 ± 0.477
3.024AlaLys: 3.024 ± 0.874
4.673AlaLeu: 4.673 ± 1.797
1.1AlaMet: 1.1 ± 0.363
2.749AlaAsn: 2.749 ± 2.062
1.924AlaPro: 1.924 ± 0.561
2.474AlaGln: 2.474 ± 1.199
1.924AlaArg: 1.924 ± 0.577
2.749AlaSer: 2.749 ± 0.747
2.199AlaThr: 2.199 ± 2.237
1.924AlaVal: 1.924 ± 0.414
0.825AlaTrp: 0.825 ± 0.502
2.474AlaTyr: 2.474 ± 1.264
0.0AlaXaa: 0.0 ± 0.0
Cys
1.374CysAla: 1.374 ± 0.895
0.275CysCys: 0.275 ± 0.252
1.1CysAsp: 1.1 ± 0.646
1.1CysGlu: 1.1 ± 0.646
1.374CysPhe: 1.374 ± 0.373
0.275CysGly: 0.275 ± 0.167
0.55CysHis: 0.55 ± 0.504
2.749CysIle: 2.749 ± 1.148
2.199CysLys: 2.199 ± 0.962
3.024CysLeu: 3.024 ± 0.226
1.1CysMet: 1.1 ± 0.363
0.275CysAsn: 0.275 ± 0.252
1.1CysPro: 1.1 ± 0.646
0.55CysGln: 0.55 ± 0.504
0.825CysArg: 0.825 ± 0.757
2.749CysSer: 2.749 ± 1.444
1.1CysThr: 1.1 ± 1.009
1.374CysVal: 1.374 ± 0.643
0.275CysTrp: 0.275 ± 0.252
1.924CysTyr: 1.924 ± 0.732
0.0CysXaa: 0.0 ± 0.0
Asp
1.924AspAla: 1.924 ± 0.839
2.749AspCys: 2.749 ± 1.142
5.772AspAsp: 5.772 ± 0.391
4.123AspGlu: 4.123 ± 0.789
2.199AspPhe: 2.199 ± 1.298
0.55AspGly: 0.55 ± 0.175
2.474AspHis: 2.474 ± 1.505
4.123AspIle: 4.123 ± 1.154
5.223AspLys: 5.223 ± 2.711
6.322AspLeu: 6.322 ± 1.713
0.825AspMet: 0.825 ± 0.231
1.924AspAsn: 1.924 ± 0.732
1.649AspPro: 1.649 ± 1.423
1.374AspGln: 1.374 ± 0.568
0.825AspArg: 0.825 ± 0.731
3.848AspSer: 3.848 ± 1.464
1.924AspThr: 1.924 ± 0.585
4.123AspVal: 4.123 ± 0.998
1.649AspTrp: 1.649 ± 0.676
1.924AspTyr: 1.924 ± 1.352
0.0AspXaa: 0.0 ± 0.0
Glu
3.573GluAla: 3.573 ± 1.087
0.825GluCys: 0.825 ± 0.231
4.948GluAsp: 4.948 ± 0.615
4.948GluGlu: 4.948 ± 0.411
4.948GluPhe: 4.948 ± 1.34
2.749GluGly: 2.749 ± 0.747
0.825GluHis: 0.825 ± 0.502
4.123GluIle: 4.123 ± 0.798
6.047GluLys: 6.047 ± 1.582
4.948GluLeu: 4.948 ± 1.34
0.55GluMet: 0.55 ± 0.334
3.848GluAsn: 3.848 ± 1.051
1.649GluPro: 1.649 ± 0.524
2.199GluGln: 2.199 ± 0.98
2.749GluArg: 2.749 ± 1.334
5.772GluSer: 5.772 ± 1.448
6.597GluThr: 6.597 ± 0.122
4.948GluVal: 4.948 ± 0.515
0.55GluTrp: 0.55 ± 0.334
3.024GluTyr: 3.024 ± 0.226
0.0GluXaa: 0.0 ± 0.0
Phe
2.474PheAla: 2.474 ± 0.877
1.1PheCys: 1.1 ± 0.646
3.848PheAsp: 3.848 ± 0.173
2.199PheGlu: 2.199 ± 1.003
2.474PhePhe: 2.474 ± 0.692
2.749PheGly: 2.749 ± 0.873
1.924PheHis: 1.924 ± 0.839
4.123PheIle: 4.123 ± 0.418
1.924PheLys: 1.924 ± 0.839
3.848PheLeu: 3.848 ± 0.905
1.374PheMet: 1.374 ± 0.564
3.299PheAsn: 3.299 ± 1.209
2.199PhePro: 2.199 ± 1.338
1.1PheGln: 1.1 ± 0.363
3.024PheArg: 3.024 ± 0.823
5.223PheSer: 5.223 ± 0.778
2.749PheThr: 2.749 ± 1.033
4.398PheVal: 4.398 ± 1.659
0.275PheTrp: 0.275 ± 0.167
1.374PheTyr: 1.374 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
2.199GlyAla: 2.199 ± 0.423
0.55GlyCys: 0.55 ± 0.175
3.024GlyAsp: 3.024 ± 0.946
2.199GlyGlu: 2.199 ± 0.727
4.398GlyPhe: 4.398 ± 0.732
1.1GlyGly: 1.1 ± 0.363
0.825GlyHis: 0.825 ± 0.4
2.199GlyIle: 2.199 ± 0.727
3.299GlyLys: 3.299 ± 0.437
3.848GlyLeu: 3.848 ± 0.173
0.825GlyMet: 0.825 ± 0.502
2.199GlyAsn: 2.199 ± 0.628
1.1GlyPro: 1.1 ± 0.349
0.825GlyGln: 0.825 ± 0.502
1.649GlyArg: 1.649 ± 0.539
4.123GlySer: 4.123 ± 1.289
1.924GlyThr: 1.924 ± 0.732
1.924GlyVal: 1.924 ± 0.585
0.0GlyTrp: 0.0 ± 0.0
2.474GlyTyr: 2.474 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.825HisAla: 0.825 ± 0.502
1.374HisCys: 1.374 ± 0.564
0.275HisAsp: 0.275 ± 0.167
1.649HisGlu: 1.649 ± 0.539
0.55HisPhe: 0.55 ± 0.175
1.374HisGly: 1.374 ± 0.516
0.55HisHis: 0.55 ± 0.334
1.374HisIle: 1.374 ± 0.516
2.199HisLys: 2.199 ± 0.366
2.474HisLeu: 2.474 ± 0.391
0.55HisMet: 0.55 ± 0.175
0.55HisAsn: 0.55 ± 0.175
0.55HisPro: 0.55 ± 0.334
1.1HisGln: 1.1 ± 0.363
0.825HisArg: 0.825 ± 0.502
1.649HisSer: 1.649 ± 0.737
1.1HisThr: 1.1 ± 0.669
0.55HisVal: 0.55 ± 0.175
0.275HisTrp: 0.275 ± 0.252
1.374HisTyr: 1.374 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
3.299IleAla: 3.299 ± 2.153
1.374IleCys: 1.374 ± 0.564
3.848IleAsp: 3.848 ± 1.051
5.498IleGlu: 5.498 ± 0.392
2.474IlePhe: 2.474 ± 0.903
3.848IleGly: 3.848 ± 0.537
1.649IleHis: 1.649 ± 0.737
6.322IleIle: 6.322 ± 0.219
9.896IleLys: 9.896 ± 0.971
7.971IleLeu: 7.971 ± 1.606
2.749IleMet: 2.749 ± 0.747
3.848IleAsn: 3.848 ± 0.655
2.474IlePro: 2.474 ± 0.692
3.573IleGln: 3.573 ± 1.525
3.024IleArg: 3.024 ± 0.226
6.872IleSer: 6.872 ± 0.986
3.299IleThr: 3.299 ± 0.906
3.299IleVal: 3.299 ± 0.349
0.55IleTrp: 0.55 ± 0.175
1.1IleTyr: 1.1 ± 0.786
0.0IleXaa: 0.0 ± 0.0
Lys
4.673LysAla: 4.673 ± 1.522
1.924LysCys: 1.924 ± 1.255
5.498LysAsp: 5.498 ± 0.787
6.597LysGlu: 6.597 ± 1.606
3.299LysPhe: 3.299 ± 0.923
2.474LysGly: 2.474 ± 0.703
0.55LysHis: 0.55 ± 0.786
5.498LysIle: 5.498 ± 1.978
5.772LysLys: 5.772 ± 0.816
7.147LysLeu: 7.147 ± 1.355
2.199LysMet: 2.199 ± 0.698
4.673LysAsn: 4.673 ± 1.522
3.573LysPro: 3.573 ± 1.314
2.749LysGln: 2.749 ± 1.136
6.047LysArg: 6.047 ± 1.47
6.322LysSer: 6.322 ± 1.74
3.299LysThr: 3.299 ± 0.906
4.398LysVal: 4.398 ± 1.124
1.649LysTrp: 1.649 ± 0.539
2.199LysTyr: 2.199 ± 0.728
0.0LysXaa: 0.0 ± 0.0
Leu
4.123LeuAla: 4.123 ± 2.146
1.924LeuCys: 1.924 ± 0.732
4.123LeuAsp: 4.123 ± 1.179
5.223LeuGlu: 5.223 ± 1.671
3.848LeuPhe: 3.848 ± 0.655
4.673LeuGly: 4.673 ± 0.605
1.649LeuHis: 1.649 ± 0.676
6.322LeuIle: 6.322 ± 0.565
7.971LeuLys: 7.971 ± 2.28
9.896LeuLeu: 9.896 ± 1.633
3.848LeuMet: 3.848 ± 1.821
6.322LeuAsn: 6.322 ± 0.947
3.024LeuPro: 3.024 ± 1.636
2.749LeuGln: 2.749 ± 0.196
4.123LeuArg: 4.123 ± 1.694
10.445LeuSer: 10.445 ± 0.917
4.398LeuThr: 4.398 ± 1.635
4.398LeuVal: 4.398 ± 1.66
0.55LeuTrp: 0.55 ± 0.334
1.924LeuTyr: 1.924 ± 0.585
0.0LeuXaa: 0.0 ± 0.0
Met
1.649MetAla: 1.649 ± 1.423
0.825MetCys: 0.825 ± 0.231
1.374MetAsp: 1.374 ± 0.373
2.199MetGlu: 2.199 ± 0.596
1.1MetPhe: 1.1 ± 0.349
1.924MetGly: 1.924 ± 0.839
0.275MetHis: 0.275 ± 0.167
2.199MetIle: 2.199 ± 0.366
2.749MetLys: 2.749 ± 0.477
2.199MetLeu: 2.199 ± 0.366
1.374MetMet: 1.374 ± 0.516
1.1MetAsn: 1.1 ± 0.363
0.825MetPro: 0.825 ± 0.502
1.649MetGln: 1.649 ± 1.003
0.825MetArg: 0.825 ± 0.744
1.924MetSer: 1.924 ± 0.414
2.199MetThr: 2.199 ± 0.727
1.1MetVal: 1.1 ± 0.349
0.0MetTrp: 0.0 ± 0.0
1.1MetTyr: 1.1 ± 0.363
0.0MetXaa: 0.0 ± 0.0
Asn
0.825AsnAla: 0.825 ± 0.231
1.924AsnCys: 1.924 ± 0.535
1.374AsnAsp: 1.374 ± 0.373
3.848AsnGlu: 3.848 ± 0.827
2.474AsnPhe: 2.474 ± 0.877
2.199AsnGly: 2.199 ± 1.003
1.924AsnHis: 1.924 ± 0.585
4.948AsnIle: 4.948 ± 1.571
4.673AsnLys: 4.673 ± 0.339
6.872AsnLeu: 6.872 ± 4.988
0.825AsnMet: 0.825 ± 0.502
2.199AsnAsn: 2.199 ± 1.298
2.749AsnPro: 2.749 ± 1.334
0.275AsnGln: 0.275 ± 0.167
3.299AsnArg: 3.299 ± 0.978
4.123AsnSer: 4.123 ± 0.816
2.474AsnThr: 2.474 ± 1.199
2.749AsnVal: 2.749 ± 1.142
0.825AsnTrp: 0.825 ± 0.502
2.474AsnTyr: 2.474 ± 0.692
0.0AsnXaa: 0.0 ± 0.0
Pro
0.825ProAla: 0.825 ± 0.744
0.55ProCys: 0.55 ± 0.504
3.024ProAsp: 3.024 ± 0.985
4.398ProGlu: 4.398 ± 1.256
3.024ProPhe: 3.024 ± 0.946
1.649ProGly: 1.649 ± 0.524
0.825ProHis: 0.825 ± 0.231
1.924ProIle: 1.924 ± 0.585
1.649ProLys: 1.649 ± 0.461
1.649ProLeu: 1.649 ± 1.003
0.825ProMet: 0.825 ± 0.938
0.825ProAsn: 0.825 ± 0.4
1.374ProPro: 1.374 ± 0.516
0.825ProGln: 0.825 ± 1.649
1.374ProArg: 1.374 ± 0.568
3.024ProSer: 3.024 ± 1.152
1.374ProThr: 1.374 ± 0.895
2.474ProVal: 2.474 ± 1.427
0.0ProTrp: 0.0 ± 0.0
1.374ProTyr: 1.374 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
1.374GlnAla: 1.374 ± 0.373
0.55GlnCys: 0.55 ± 0.504
1.374GlnAsp: 1.374 ± 0.516
3.024GlnGlu: 3.024 ± 0.226
1.649GlnPhe: 1.649 ± 0.539
1.374GlnGly: 1.374 ± 0.568
1.1GlnHis: 1.1 ± 0.669
4.123GlnIle: 4.123 ± 0.998
1.924GlnLys: 1.924 ± 0.577
1.924GlnLeu: 1.924 ± 0.839
1.649GlnMet: 1.649 ± 2.357
3.024GlnAsn: 3.024 ± 0.874
0.55GlnPro: 0.55 ± 0.786
1.374GlnGln: 1.374 ± 0.836
0.825GlnArg: 0.825 ± 0.231
2.749GlnSer: 2.749 ± 0.812
0.55GlnThr: 0.55 ± 0.334
2.474GlnVal: 2.474 ± 2.575
0.0GlnTrp: 0.0 ± 0.0
0.275GlnTyr: 0.275 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
3.024ArgAla: 3.024 ± 1.262
0.825ArgCys: 0.825 ± 0.757
1.924ArgAsp: 1.924 ± 1.723
3.299ArgGlu: 3.299 ± 1.293
1.374ArgPhe: 1.374 ± 0.516
2.749ArgGly: 2.749 ± 1.136
0.275ArgHis: 0.275 ± 0.167
5.223ArgIle: 5.223 ± 0.778
3.573ArgLys: 3.573 ± 0.918
4.123ArgLeu: 4.123 ± 1.842
0.825ArgMet: 0.825 ± 0.231
2.474ArgAsn: 2.474 ± 0.391
1.374ArgPro: 1.374 ± 0.9
1.649ArgGln: 1.649 ± 0.539
2.474ArgArg: 2.474 ± 0.692
3.299ArgSer: 3.299 ± 0.906
2.749ArgThr: 2.749 ± 0.196
3.848ArgVal: 3.848 ± 1.17
0.0ArgTrp: 0.0 ± 0.0
1.649ArgTyr: 1.649 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
3.299SerAla: 3.299 ± 1.129
5.223SerCys: 5.223 ± 2.062
4.673SerAsp: 4.673 ± 1.391
6.322SerGlu: 6.322 ± 1.74
4.398SerPhe: 4.398 ± 0.611
3.024SerGly: 3.024 ± 1.075
2.474SerHis: 2.474 ± 0.903
6.047SerIle: 6.047 ± 1.86
6.597SerLys: 6.597 ± 2.156
8.521SerLeu: 8.521 ± 2.347
1.924SerMet: 1.924 ± 0.585
4.398SerAsn: 4.398 ± 1.794
1.374SerPro: 1.374 ± 0.373
2.474SerGln: 2.474 ± 0.924
3.299SerArg: 3.299 ± 1.129
7.971SerSer: 7.971 ± 0.72
7.422SerThr: 7.422 ± 1.084
5.772SerVal: 5.772 ± 0.704
1.1SerTrp: 1.1 ± 0.669
4.123SerTyr: 4.123 ± 1.704
0.0SerXaa: 0.0 ± 0.0
Thr
2.474ThrAla: 2.474 ± 3.116
1.1ThrCys: 1.1 ± 0.646
3.024ThrAsp: 3.024 ± 0.226
3.299ThrGlu: 3.299 ± 1.09
3.024ThrPhe: 3.024 ± 0.226
2.474ThrGly: 2.474 ± 1.101
0.825ThrHis: 0.825 ± 0.4
4.948ThrIle: 4.948 ± 0.816
3.299ThrLys: 3.299 ± 1.295
4.398ThrLeu: 4.398 ± 0.611
2.199ThrMet: 2.199 ± 0.833
2.749ThrAsn: 2.749 ± 0.873
1.924ThrPro: 1.924 ± 0.585
1.1ThrGln: 1.1 ± 0.363
3.024ThrArg: 3.024 ± 0.946
5.498ThrSer: 5.498 ± 1.313
4.948ThrThr: 4.948 ± 1.806
2.474ThrVal: 2.474 ± 0.703
0.0ThrTrp: 0.0 ± 0.0
1.924ThrTyr: 1.924 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
1.924ValAla: 1.924 ± 0.561
0.0ValCys: 0.0 ± 0.0
3.299ValAsp: 3.299 ± 1.786
4.948ValGlu: 4.948 ± 0.568
3.573ValPhe: 3.573 ± 1.278
2.199ValGly: 2.199 ± 0.423
0.55ValHis: 0.55 ± 0.786
4.673ValIle: 4.673 ± 1.594
4.673ValLys: 4.673 ± 0.573
4.398ValLeu: 4.398 ± 0.732
2.474ValMet: 2.474 ± 0.976
4.673ValAsn: 4.673 ± 0.701
0.825ValPro: 0.825 ± 0.502
2.199ValGln: 2.199 ± 1.292
4.398ValArg: 4.398 ± 1.238
6.597ValSer: 6.597 ± 2.89
1.924ValThr: 1.924 ± 0.577
2.749ValVal: 2.749 ± 0.873
0.275ValTrp: 0.275 ± 0.167
1.924ValTyr: 1.924 ± 0.895
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.825TrpGlu: 0.825 ± 0.231
0.825TrpPhe: 0.825 ± 0.231
0.0TrpGly: 0.0 ± 0.0
0.275TrpHis: 0.275 ± 0.167
1.1TrpIle: 1.1 ± 0.641
0.825TrpLys: 0.825 ± 0.502
1.1TrpLeu: 1.1 ± 0.363
0.275TrpMet: 0.275 ± 0.167
0.55TrpAsn: 0.55 ± 0.334
0.275TrpPro: 0.275 ± 0.252
0.0TrpGln: 0.0 ± 0.0
0.825TrpArg: 0.825 ± 0.231
1.374TrpSer: 1.374 ± 0.836
0.275TrpThr: 0.275 ± 0.167
1.374TrpVal: 1.374 ± 0.516
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.1TyrAla: 1.1 ± 0.713
1.649TyrCys: 1.649 ± 0.461
1.374TyrAsp: 1.374 ± 1.51
2.199TyrGlu: 2.199 ± 1.003
1.649TyrPhe: 1.649 ± 0.524
1.1TyrGly: 1.1 ± 0.786
0.825TyrHis: 0.825 ± 0.4
2.199TyrIle: 2.199 ± 0.628
3.024TyrLys: 3.024 ± 1.075
2.199TyrLeu: 2.199 ± 0.423
0.825TyrMet: 0.825 ± 0.502
1.374TyrAsn: 1.374 ± 0.516
2.474TyrPro: 2.474 ± 2.15
1.649TyrGln: 1.649 ± 1.489
1.374TyrArg: 1.374 ± 0.373
4.123TyrSer: 4.123 ± 0.418
2.474TyrThr: 2.474 ± 0.284
1.924TyrVal: 1.924 ± 0.732
1.1TyrTrp: 1.1 ± 0.349
1.924TyrTyr: 1.924 ± 0.585
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski