Amino acid dipepetide frequency for Hubei diptera virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.208AlaAla: 2.208 ± 0.009
1.104AlaCys: 1.104 ± 0.004
2.575AlaAsp: 2.575 ± 0.372
2.943AlaGlu: 2.943 ± 1.818
1.84AlaPhe: 1.84 ± 0.354
2.575AlaGly: 2.575 ± 0.372
0.736AlaHis: 0.736 ± 0.184
2.943AlaIle: 2.943 ± 0.349
3.311AlaLys: 3.311 ± 0.528
5.887AlaLeu: 5.887 ± 0.157
1.472AlaMet: 1.472 ± 0.643
3.679AlaAsn: 3.679 ± 1.46
2.575AlaPro: 2.575 ± 0.17
3.311AlaGln: 3.311 ± 1.097
1.104AlaArg: 1.104 ± 0.004
4.047AlaSer: 4.047 ± 1.823
2.943AlaThr: 2.943 ± 0.734
2.208AlaVal: 2.208 ± 0.551
0.0AlaTrp: 0.0 ± 0.0
2.208AlaTyr: 2.208 ± 0.533
0.0AlaXaa: 0.0 ± 0.0
Cys
0.736CysAla: 0.736 ± 0.358
0.0CysCys: 0.0 ± 0.0
0.368CysAsp: 0.368 ± 0.179
1.84CysGlu: 1.84 ± 0.354
1.104CysPhe: 1.104 ± 0.004
1.472CysGly: 1.472 ± 0.175
0.0CysHis: 0.0 ± 0.0
1.104CysIle: 1.104 ± 0.004
1.104CysLys: 1.104 ± 0.004
0.368CysLeu: 0.368 ± 0.179
0.368CysMet: 0.368 ± 0.363
1.104CysAsn: 1.104 ± 0.004
1.104CysPro: 1.104 ± 0.546
0.0CysGln: 0.0 ± 0.0
0.736CysArg: 0.736 ± 0.358
0.736CysSer: 0.736 ± 0.358
0.0CysThr: 0.0 ± 0.0
0.368CysVal: 0.368 ± 0.363
0.0CysTrp: 0.0 ± 0.0
1.104CysTyr: 1.104 ± 0.537
0.0CysXaa: 0.0 ± 0.0
Asp
1.84AspAla: 1.84 ± 0.896
1.472AspCys: 1.472 ± 0.716
2.943AspAsp: 2.943 ± 0.349
4.047AspGlu: 4.047 ± 0.887
3.679AspPhe: 3.679 ± 1.249
1.84AspGly: 1.84 ± 0.896
0.736AspHis: 0.736 ± 0.358
5.519AspIle: 5.519 ± 0.519
2.575AspLys: 2.575 ± 0.712
4.415AspLeu: 4.415 ± 0.524
0.368AspMet: 0.368 ± 0.363
5.151AspAsn: 5.151 ± 0.34
3.311AspPro: 3.311 ± 1.097
1.104AspGln: 1.104 ± 0.004
0.368AspArg: 0.368 ± 0.179
2.575AspSer: 2.575 ± 0.17
2.575AspThr: 2.575 ± 0.372
2.943AspVal: 2.943 ± 0.349
0.368AspTrp: 0.368 ± 0.179
1.472AspTyr: 1.472 ± 0.175
0.0AspXaa: 0.0 ± 0.0
Glu
5.887GluAla: 5.887 ± 2.011
0.368GluCys: 0.368 ± 0.179
2.943GluAsp: 2.943 ± 0.891
3.679GluGlu: 3.679 ± 1.791
2.943GluPhe: 2.943 ± 1.433
4.047GluGly: 4.047 ± 0.345
2.208GluHis: 2.208 ± 0.533
4.047GluIle: 4.047 ± 0.197
5.519GluLys: 5.519 ± 0.519
3.311GluLeu: 3.311 ± 1.07
2.575GluMet: 2.575 ± 0.17
3.679GluAsn: 3.679 ± 1.249
0.736GluPro: 0.736 ± 0.358
3.679GluGln: 3.679 ± 1.791
1.104GluArg: 1.104 ± 0.004
3.679GluSer: 3.679 ± 0.708
2.943GluThr: 2.943 ± 0.193
3.311GluVal: 3.311 ± 1.639
0.368GluTrp: 0.368 ± 0.363
0.368GluTyr: 0.368 ± 0.179
0.0GluXaa: 0.0 ± 0.0
Phe
1.472PheAla: 1.472 ± 0.367
0.0PheCys: 0.0 ± 0.0
1.104PheAsp: 1.104 ± 0.537
2.208PheGlu: 2.208 ± 1.075
0.736PhePhe: 0.736 ± 0.184
2.575PheGly: 2.575 ± 0.914
1.104PheHis: 1.104 ± 1.088
2.943PheIle: 2.943 ± 0.891
4.047PheLys: 4.047 ± 0.345
1.84PheLeu: 1.84 ± 0.354
1.104PheMet: 1.104 ± 0.004
3.679PheAsn: 3.679 ± 1.249
1.472PhePro: 1.472 ± 0.909
1.104PheGln: 1.104 ± 1.088
1.104PheArg: 1.104 ± 0.004
5.887PheSer: 5.887 ± 1.24
1.84PheThr: 1.84 ± 0.188
5.151PheVal: 5.151 ± 0.34
1.472PheTrp: 1.472 ± 0.175
1.84PheTyr: 1.84 ± 0.896
0.0PheXaa: 0.0 ± 0.0
Gly
1.472GlyAla: 1.472 ± 0.367
0.736GlyCys: 0.736 ± 0.184
2.943GlyAsp: 2.943 ± 0.891
3.311GlyGlu: 3.311 ± 0.013
2.943GlyPhe: 2.943 ± 0.734
2.575GlyGly: 2.575 ± 0.914
0.368GlyHis: 0.368 ± 0.179
5.151GlyIle: 5.151 ± 0.202
4.415GlyLys: 4.415 ± 1.608
3.679GlyLeu: 3.679 ± 0.376
2.208GlyMet: 2.208 ± 0.009
2.208GlyAsn: 2.208 ± 1.075
1.104GlyPro: 1.104 ± 0.537
3.311GlyGln: 3.311 ± 1.639
1.104GlyArg: 1.104 ± 0.546
6.255GlySer: 6.255 ± 3.457
2.943GlyThr: 2.943 ± 0.734
3.311GlyVal: 3.311 ± 1.639
0.368GlyTrp: 0.368 ± 0.363
3.311GlyTyr: 3.311 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
0.368HisAla: 0.368 ± 0.363
0.368HisCys: 0.368 ± 0.363
0.368HisAsp: 0.368 ± 0.179
1.84HisGlu: 1.84 ± 0.354
1.472HisPhe: 1.472 ± 0.716
0.736HisGly: 0.736 ± 0.358
0.0HisHis: 0.0 ± 0.0
2.575HisIle: 2.575 ± 0.17
2.575HisLys: 2.575 ± 0.17
1.472HisLeu: 1.472 ± 0.716
1.104HisMet: 1.104 ± 0.537
1.104HisAsn: 1.104 ± 0.537
0.736HisPro: 0.736 ± 0.184
0.368HisGln: 0.368 ± 0.179
1.472HisArg: 1.472 ± 0.716
1.104HisSer: 1.104 ± 0.537
1.104HisThr: 1.104 ± 1.088
1.472HisVal: 1.472 ± 0.367
0.368HisTrp: 0.368 ± 0.179
0.736HisTyr: 0.736 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
5.519IleAla: 5.519 ± 1.106
1.472IleCys: 1.472 ± 0.175
8.462IleAsp: 8.462 ± 0.869
5.519IleGlu: 5.519 ± 1.603
3.679IlePhe: 3.679 ± 0.708
4.783IleGly: 4.783 ± 0.161
1.472IleHis: 1.472 ± 0.175
5.519IleIle: 5.519 ± 1.061
4.415IleLys: 4.415 ± 1.066
4.783IleLeu: 4.783 ± 1.787
1.84IleMet: 1.84 ± 0.354
5.887IleAsn: 5.887 ± 0.157
4.047IlePro: 4.047 ± 1.281
3.311IleGln: 3.311 ± 1.07
1.472IleArg: 1.472 ± 0.716
8.462IleSer: 8.462 ± 0.215
5.519IleThr: 5.519 ± 1.648
4.047IleVal: 4.047 ± 0.887
0.736IleTrp: 0.736 ± 0.725
5.519IleTyr: 5.519 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
1.104LysAla: 1.104 ± 0.004
1.104LysCys: 1.104 ± 0.537
3.311LysAsp: 3.311 ± 1.07
3.311LysGlu: 3.311 ± 0.528
2.943LysPhe: 2.943 ± 0.891
2.575LysGly: 2.575 ± 0.712
3.679LysHis: 3.679 ± 1.249
7.726LysIle: 7.726 ± 1.052
5.519LysLys: 5.519 ± 1.603
8.462LysLeu: 8.462 ± 0.869
2.575LysMet: 2.575 ± 0.17
4.783LysAsn: 4.783 ± 1.787
2.208LysPro: 2.208 ± 0.551
1.84LysGln: 1.84 ± 0.896
2.208LysArg: 2.208 ± 0.533
2.943LysSer: 2.943 ± 0.891
4.783LysThr: 4.783 ± 1.245
5.887LysVal: 5.887 ± 1.24
0.368LysTrp: 0.368 ± 0.179
3.311LysTyr: 3.311 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
3.679LeuAla: 3.679 ± 0.376
1.104LeuCys: 1.104 ± 0.004
4.415LeuAsp: 4.415 ± 1.066
1.84LeuGlu: 1.84 ± 0.354
4.047LeuPhe: 4.047 ± 0.345
4.047LeuGly: 4.047 ± 0.197
3.679LeuHis: 3.679 ± 1.249
6.255LeuIle: 6.255 ± 1.961
9.198LeuLys: 9.198 ± 2.311
7.358LeuLeu: 7.358 ± 1.415
1.104LeuMet: 1.104 ± 0.537
5.887LeuAsn: 5.887 ± 0.157
3.679LeuPro: 3.679 ± 2.002
3.679LeuGln: 3.679 ± 0.166
4.047LeuArg: 4.047 ± 0.739
5.151LeuSer: 5.151 ± 1.424
4.783LeuThr: 4.783 ± 0.381
4.415LeuVal: 4.415 ± 1.102
1.84LeuTrp: 1.84 ± 0.188
4.047LeuTyr: 4.047 ± 0.887
0.0LeuXaa: 0.0 ± 0.0
Met
3.311MetAla: 3.311 ± 0.013
1.104MetCys: 1.104 ± 0.537
1.104MetAsp: 1.104 ± 0.004
1.104MetGlu: 1.104 ± 0.004
1.104MetPhe: 1.104 ± 0.004
1.84MetGly: 1.84 ± 0.73
1.472MetHis: 1.472 ± 0.175
3.679MetIle: 3.679 ± 0.918
2.208MetLys: 2.208 ± 1.075
1.84MetLeu: 1.84 ± 0.354
0.736MetMet: 0.736 ± 0.358
1.472MetAsn: 1.472 ± 0.175
0.368MetPro: 0.368 ± 0.179
0.736MetGln: 0.736 ± 0.358
1.104MetArg: 1.104 ± 0.004
2.208MetSer: 2.208 ± 0.551
2.208MetThr: 2.208 ± 0.551
0.368MetVal: 0.368 ± 0.179
0.0MetTrp: 0.0 ± 0.0
1.472MetTyr: 1.472 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
5.151AsnAla: 5.151 ± 0.743
1.472AsnCys: 1.472 ± 0.367
5.151AsnAsp: 5.151 ± 0.882
5.151AsnGlu: 5.151 ± 0.34
1.472AsnPhe: 1.472 ± 0.175
3.679AsnGly: 3.679 ± 0.376
0.736AsnHis: 0.736 ± 0.358
7.726AsnIle: 7.726 ± 1.594
4.783AsnLys: 4.783 ± 1.787
5.151AsnLeu: 5.151 ± 0.202
1.84AsnMet: 1.84 ± 0.354
4.415AsnAsn: 4.415 ± 1.608
3.311AsnPro: 3.311 ± 0.013
1.104AsnGln: 1.104 ± 0.004
1.84AsnArg: 1.84 ± 0.354
5.887AsnSer: 5.887 ± 1.24
5.151AsnThr: 5.151 ± 1.285
1.472AsnVal: 1.472 ± 1.451
1.472AsnTrp: 1.472 ± 0.175
5.887AsnTyr: 5.887 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
1.104ProAla: 1.104 ± 0.537
0.368ProCys: 0.368 ± 0.363
1.104ProAsp: 1.104 ± 0.004
3.311ProGlu: 3.311 ± 0.528
1.472ProPhe: 1.472 ± 0.716
1.84ProGly: 1.84 ± 0.188
0.368ProHis: 0.368 ± 0.363
3.311ProIle: 3.311 ± 0.528
1.104ProLys: 1.104 ± 0.004
5.519ProLeu: 5.519 ± 0.564
1.472ProMet: 1.472 ± 0.567
1.104ProAsn: 1.104 ± 0.004
0.368ProPro: 0.368 ± 0.363
1.472ProGln: 1.472 ± 0.909
0.368ProArg: 0.368 ± 0.363
4.047ProSer: 4.047 ± 1.281
4.415ProThr: 4.415 ± 1.644
2.575ProVal: 2.575 ± 1.997
0.368ProTrp: 0.368 ± 0.363
4.783ProTyr: 4.783 ± 2.548
0.0ProXaa: 0.0 ± 0.0
Gln
1.84GlnAla: 1.84 ± 1.272
0.736GlnCys: 0.736 ± 0.358
1.472GlnAsp: 1.472 ± 0.175
1.104GlnGlu: 1.104 ± 0.537
1.84GlnPhe: 1.84 ± 0.73
2.943GlnGly: 2.943 ± 0.193
0.0GlnHis: 0.0 ± 0.0
3.679GlnIle: 3.679 ± 0.166
3.679GlnLys: 3.679 ± 0.708
2.943GlnLeu: 2.943 ± 0.349
0.736GlnMet: 0.736 ± 0.184
0.368GlnAsn: 0.368 ± 0.363
1.472GlnPro: 1.472 ± 0.716
1.84GlnGln: 1.84 ± 0.188
0.736GlnArg: 0.736 ± 0.358
2.575GlnSer: 2.575 ± 0.17
1.84GlnThr: 1.84 ± 1.272
5.887GlnVal: 5.887 ± 0.157
1.472GlnTrp: 1.472 ± 0.716
1.104GlnTyr: 1.104 ± 0.546
0.0GlnXaa: 0.0 ± 0.0
Arg
1.104ArgAla: 1.104 ± 0.004
0.368ArgCys: 0.368 ± 0.179
0.368ArgAsp: 0.368 ± 0.179
1.104ArgGlu: 1.104 ± 0.537
1.104ArgPhe: 1.104 ± 0.004
2.208ArgGly: 2.208 ± 0.551
0.736ArgHis: 0.736 ± 0.184
1.84ArgIle: 1.84 ± 0.188
2.575ArgLys: 2.575 ± 1.254
3.679ArgLeu: 3.679 ± 0.376
0.736ArgMet: 0.736 ± 0.358
3.311ArgAsn: 3.311 ± 0.528
1.472ArgPro: 1.472 ± 0.175
1.84ArgGln: 1.84 ± 0.188
1.104ArgArg: 1.104 ± 0.004
1.472ArgSer: 1.472 ± 0.367
2.208ArgThr: 2.208 ± 0.009
1.84ArgVal: 1.84 ± 0.354
0.0ArgTrp: 0.0 ± 0.0
1.104ArgTyr: 1.104 ± 0.537
0.0ArgXaa: 0.0 ± 0.0
Ser
4.783SerAla: 4.783 ± 0.161
0.736SerCys: 0.736 ± 0.725
2.943SerAsp: 2.943 ± 0.193
4.783SerGlu: 4.783 ± 0.161
2.943SerPhe: 2.943 ± 0.193
5.519SerGly: 5.519 ± 1.648
0.368SerHis: 0.368 ± 0.363
6.99SerIle: 6.99 ± 0.39
4.047SerLys: 4.047 ± 0.345
5.887SerLeu: 5.887 ± 1.782
3.679SerMet: 3.679 ± 0.918
8.83SerAsn: 8.83 ± 2.745
4.047SerPro: 4.047 ± 0.739
2.575SerGln: 2.575 ± 0.17
2.575SerArg: 2.575 ± 0.712
5.887SerSer: 5.887 ± 3.636
5.151SerThr: 5.151 ± 3.453
2.575SerVal: 2.575 ± 0.17
0.368SerTrp: 0.368 ± 0.363
3.311SerTyr: 3.311 ± 1.097
0.0SerXaa: 0.0 ± 0.0
Thr
2.943ThrAla: 2.943 ± 0.193
0.0ThrCys: 0.0 ± 0.0
1.84ThrAsp: 1.84 ± 0.188
3.311ThrGlu: 3.311 ± 0.555
4.047ThrPhe: 4.047 ± 2.364
2.943ThrGly: 2.943 ± 1.818
0.736ThrHis: 0.736 ± 0.184
4.783ThrIle: 4.783 ± 0.703
2.575ThrLys: 2.575 ± 0.17
4.783ThrLeu: 4.783 ± 0.923
2.208ThrMet: 2.208 ± 0.551
6.623ThrAsn: 6.623 ± 0.027
3.679ThrPro: 3.679 ± 1.46
3.679ThrGln: 3.679 ± 0.166
1.84ThrArg: 1.84 ± 0.73
4.783ThrSer: 4.783 ± 2.548
4.783ThrThr: 4.783 ± 2.006
3.311ThrVal: 3.311 ± 2.181
0.368ThrTrp: 0.368 ± 0.179
3.311ThrTyr: 3.311 ± 1.07
0.0ThrXaa: 0.0 ± 0.0
Val
2.208ValAla: 2.208 ± 1.635
1.104ValCys: 1.104 ± 0.537
1.84ValAsp: 1.84 ± 1.272
3.311ValGlu: 3.311 ± 0.555
1.84ValPhe: 1.84 ± 0.354
3.311ValGly: 3.311 ± 1.097
0.736ValHis: 0.736 ± 0.358
6.255ValIle: 6.255 ± 0.206
4.047ValLys: 4.047 ± 0.345
5.519ValLeu: 5.519 ± 0.564
1.472ValMet: 1.472 ± 0.175
4.783ValAsn: 4.783 ± 0.381
3.311ValPro: 3.311 ± 2.723
1.472ValGln: 1.472 ± 0.175
2.943ValArg: 2.943 ± 0.349
4.783ValSer: 4.783 ± 2.548
3.311ValThr: 3.311 ± 0.555
3.679ValVal: 3.679 ± 0.918
0.736ValTrp: 0.736 ± 0.184
1.104ValTyr: 1.104 ± 0.537
0.0ValXaa: 0.0 ± 0.0
Trp
0.736TrpAla: 0.736 ± 0.184
0.0TrpCys: 0.0 ± 0.0
1.472TrpAsp: 1.472 ± 0.716
0.0TrpGlu: 0.0 ± 0.0
1.104TrpPhe: 1.104 ± 0.004
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.472TrpIle: 1.472 ± 0.367
0.368TrpLys: 0.368 ± 0.179
1.104TrpLeu: 1.104 ± 0.537
1.104TrpMet: 1.104 ± 0.004
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.368TrpGln: 0.368 ± 0.363
0.736TrpArg: 0.736 ± 0.184
1.472TrpSer: 1.472 ± 1.451
0.736TrpThr: 0.736 ± 0.358
0.368TrpVal: 0.368 ± 0.179
0.0TrpTrp: 0.0 ± 0.0
0.736TrpTyr: 0.736 ± 0.358
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.208TyrAla: 2.208 ± 1.075
0.0TyrCys: 0.0 ± 0.0
2.575TyrAsp: 2.575 ± 0.712
3.679TyrGlu: 3.679 ± 0.708
0.368TyrPhe: 0.368 ± 0.363
2.208TyrGly: 2.208 ± 0.009
2.208TyrHis: 2.208 ± 0.533
2.943TyrIle: 2.943 ± 0.734
2.575TyrLys: 2.575 ± 0.712
6.255TyrLeu: 6.255 ± 0.206
0.368TyrMet: 0.368 ± 0.179
4.783TyrAsn: 4.783 ± 0.703
1.84TyrPro: 1.84 ± 0.354
1.472TyrGln: 1.472 ± 0.175
2.208TyrArg: 2.208 ± 0.533
4.047TyrSer: 4.047 ± 1.823
3.311TyrThr: 3.311 ± 1.07
2.575TyrVal: 2.575 ± 1.455
1.104TyrTrp: 1.104 ± 0.537
2.943TyrTyr: 2.943 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski