Amino acid dipepetide frequency for Wuhan Louse Fly Virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.634AlaAla: 9.634 ± 6.662
0.771AlaCys: 0.771 ± 0.757
2.697AlaAsp: 2.697 ± 0.154
8.478AlaGlu: 8.478 ± 3.844
2.312AlaPhe: 2.312 ± 1.093
4.624AlaGly: 4.624 ± 0.057
2.697AlaHis: 2.697 ± 0.154
4.624AlaIle: 4.624 ± 0.057
3.854AlaLys: 3.854 ± 0.421
10.405AlaLeu: 10.405 ± 1.811
3.083AlaMet: 3.083 ± 1.448
3.083AlaAsn: 3.083 ± 0.336
5.78AlaPro: 5.78 ± 0.49
3.083AlaGln: 3.083 ± 4.151
5.01AlaArg: 5.01 ± 2.118
4.239AlaSer: 4.239 ± 0.239
3.083AlaThr: 3.083 ± 0.786
5.395AlaVal: 5.395 ± 0.814
2.697AlaTrp: 2.697 ± 1.275
4.239AlaTyr: 4.239 ± 0.239
0.0AlaXaa: 0.0 ± 0.0
Cys
1.156CysAla: 1.156 ± 0.575
0.0CysCys: 0.0 ± 0.0
1.927CysAsp: 1.927 ± 0.911
1.541CysGlu: 1.541 ± 0.729
0.385CysPhe: 0.385 ± 0.182
1.156CysGly: 1.156 ± 0.575
0.771CysHis: 0.771 ± 0.364
0.0CysIle: 0.0 ± 0.0
1.156CysLys: 1.156 ± 0.547
3.468CysLeu: 3.468 ± 1.64
0.385CysMet: 0.385 ± 0.182
0.771CysAsn: 0.771 ± 0.757
0.0CysPro: 0.0 ± 0.0
0.385CysGln: 0.385 ± 0.182
1.156CysArg: 1.156 ± 0.575
1.156CysSer: 1.156 ± 0.575
0.771CysThr: 0.771 ± 0.757
0.771CysVal: 0.771 ± 0.364
0.771CysTrp: 0.771 ± 0.757
0.771CysTyr: 0.771 ± 0.364
0.0CysXaa: 0.0 ± 0.0
Asp
4.239AspAla: 4.239 ± 0.882
0.385AspCys: 0.385 ± 0.182
2.697AspAsp: 2.697 ± 1.275
3.468AspGlu: 3.468 ± 2.847
1.541AspPhe: 1.541 ± 0.393
2.697AspGly: 2.697 ± 0.968
1.927AspHis: 1.927 ± 0.911
1.541AspIle: 1.541 ± 0.729
0.771AspLys: 0.771 ± 0.364
6.551AspLeu: 6.551 ± 1.389
1.156AspMet: 1.156 ± 0.575
1.541AspAsn: 1.541 ± 1.515
3.083AspPro: 3.083 ± 0.336
0.771AspGln: 0.771 ± 0.364
4.624AspArg: 4.624 ± 0.057
1.927AspSer: 1.927 ± 0.911
3.083AspThr: 3.083 ± 0.786
1.927AspVal: 1.927 ± 0.211
0.385AspTrp: 0.385 ± 0.182
3.468AspTyr: 3.468 ± 1.725
0.0AspXaa: 0.0 ± 0.0
Glu
3.083GluAla: 3.083 ± 0.786
1.541GluCys: 1.541 ± 0.393
2.312GluAsp: 2.312 ± 1.093
4.624GluGlu: 4.624 ± 0.057
2.697GluPhe: 2.697 ± 0.154
2.697GluGly: 2.697 ± 0.154
2.697GluHis: 2.697 ± 1.275
0.771GluIle: 0.771 ± 0.364
1.541GluLys: 1.541 ± 0.393
7.707GluLeu: 7.707 ± 1.401
2.697GluMet: 2.697 ± 0.968
0.385GluAsn: 0.385 ± 0.182
1.927GluPro: 1.927 ± 0.911
4.239GluGln: 4.239 ± 0.239
2.697GluArg: 2.697 ± 2.09
4.624GluSer: 4.624 ± 2.186
4.624GluThr: 4.624 ± 0.057
5.78GluVal: 5.78 ± 1.611
1.156GluTrp: 1.156 ± 0.575
3.854GluTyr: 3.854 ± 2.665
0.0GluXaa: 0.0 ± 0.0
Phe
1.541PheAla: 1.541 ± 0.729
1.156PheCys: 1.156 ± 0.547
1.156PheAsp: 1.156 ± 0.547
1.541PheGlu: 1.541 ± 0.393
0.771PhePhe: 0.771 ± 0.364
2.697PheGly: 2.697 ± 0.968
1.541PheHis: 1.541 ± 0.393
2.312PheIle: 2.312 ± 1.093
1.541PheLys: 1.541 ± 0.393
3.854PheLeu: 3.854 ± 0.7
0.0PheMet: 0.0 ± 0.0
0.771PheAsn: 0.771 ± 0.364
2.697PhePro: 2.697 ± 2.09
0.771PheGln: 0.771 ± 0.364
2.312PheArg: 2.312 ± 1.093
1.156PheSer: 1.156 ± 0.547
1.927PheThr: 1.927 ± 0.211
2.697PheVal: 2.697 ± 1.275
0.0PheTrp: 0.0 ± 0.0
1.156PheTyr: 1.156 ± 0.547
0.0PheXaa: 0.0 ± 0.0
Gly
6.551GlyAla: 6.551 ± 0.268
0.385GlyCys: 0.385 ± 0.182
1.927GlyAsp: 1.927 ± 1.332
3.083GlyGlu: 3.083 ± 1.458
2.312GlyPhe: 2.312 ± 0.029
3.468GlyGly: 3.468 ± 1.725
2.312GlyHis: 2.312 ± 0.029
3.854GlyIle: 3.854 ± 1.543
2.697GlyLys: 2.697 ± 0.154
5.78GlyLeu: 5.78 ± 0.632
0.385GlyMet: 0.385 ± 0.94
1.927GlyAsn: 1.927 ± 0.211
1.541GlyPro: 1.541 ± 2.636
2.312GlyGln: 2.312 ± 1.093
3.083GlyArg: 3.083 ± 0.336
1.156GlySer: 1.156 ± 0.547
2.312GlyThr: 2.312 ± 0.029
3.468GlyVal: 3.468 ± 0.518
1.927GlyTrp: 1.927 ± 0.211
1.156GlyTyr: 1.156 ± 0.575
0.0GlyXaa: 0.0 ± 0.0
His
2.312HisAla: 2.312 ± 1.093
0.0HisCys: 0.0 ± 0.0
1.156HisAsp: 1.156 ± 0.547
1.156HisGlu: 1.156 ± 0.547
1.156HisPhe: 1.156 ± 0.547
2.312HisGly: 2.312 ± 1.093
1.156HisHis: 1.156 ± 0.547
2.697HisIle: 2.697 ± 0.154
1.927HisLys: 1.927 ± 0.211
1.927HisLeu: 1.927 ± 0.211
1.541HisMet: 1.541 ± 0.729
2.697HisAsn: 2.697 ± 0.154
2.312HisPro: 2.312 ± 1.093
1.927HisGln: 1.927 ± 0.911
1.541HisArg: 1.541 ± 0.393
2.312HisSer: 2.312 ± 1.093
1.156HisThr: 1.156 ± 0.547
1.541HisVal: 1.541 ± 0.393
0.0HisTrp: 0.0 ± 0.0
2.697HisTyr: 2.697 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
3.468IleAla: 3.468 ± 2.847
0.0IleCys: 0.0 ± 0.0
3.468IleAsp: 3.468 ± 1.725
5.01IleGlu: 5.01 ± 1.247
0.771IlePhe: 0.771 ± 0.364
3.854IleGly: 3.854 ± 0.7
2.312IleHis: 2.312 ± 1.093
3.083IleIle: 3.083 ± 1.458
2.312IleLys: 2.312 ± 1.093
5.78IleLeu: 5.78 ± 1.611
0.385IleMet: 0.385 ± 0.182
0.385IleAsn: 0.385 ± 0.182
2.697IlePro: 2.697 ± 0.154
2.697IleGln: 2.697 ± 1.275
3.854IleArg: 3.854 ± 1.822
1.927IleSer: 1.927 ± 0.911
3.854IleThr: 3.854 ± 0.421
4.624IleVal: 4.624 ± 1.065
1.156IleTrp: 1.156 ± 1.697
3.468IleTyr: 3.468 ± 1.64
0.0IleXaa: 0.0 ± 0.0
Lys
1.156LysAla: 1.156 ± 0.547
0.771LysCys: 0.771 ± 0.364
1.927LysAsp: 1.927 ± 0.211
2.312LysGlu: 2.312 ± 0.029
1.156LysPhe: 1.156 ± 0.547
0.771LysGly: 0.771 ± 0.364
1.156LysHis: 1.156 ± 0.547
3.468LysIle: 3.468 ± 0.518
1.156LysLys: 1.156 ± 0.575
3.083LysLeu: 3.083 ± 1.458
0.385LysMet: 0.385 ± 0.94
0.385LysAsn: 0.385 ± 0.182
1.927LysPro: 1.927 ± 0.911
0.771LysGln: 0.771 ± 0.364
1.541LysArg: 1.541 ± 0.393
2.697LysSer: 2.697 ± 1.275
2.312LysThr: 2.312 ± 1.15
3.854LysVal: 3.854 ± 1.822
0.771LysTrp: 0.771 ± 0.364
0.771LysTyr: 0.771 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
10.405LeuAla: 10.405 ± 1.811
4.239LeuCys: 4.239 ± 0.239
7.322LeuAsp: 7.322 ± 0.097
4.624LeuGlu: 4.624 ± 0.057
4.239LeuPhe: 4.239 ± 0.239
1.541LeuGly: 1.541 ± 0.729
5.395LeuHis: 5.395 ± 0.307
5.01LeuIle: 5.01 ± 0.125
1.927LeuLys: 1.927 ± 0.911
10.405LeuLeu: 10.405 ± 0.432
2.312LeuMet: 2.312 ± 0.029
3.468LeuAsn: 3.468 ± 0.518
7.707LeuPro: 7.707 ± 0.843
4.239LeuGln: 4.239 ± 0.239
9.249LeuArg: 9.249 ± 1.236
4.624LeuSer: 4.624 ± 2.186
9.249LeuThr: 9.249 ± 2.129
6.936LeuVal: 6.936 ± 0.086
1.541LeuTrp: 1.541 ± 0.729
2.697LeuTyr: 2.697 ± 1.275
0.0LeuXaa: 0.0 ± 0.0
Met
1.541MetAla: 1.541 ± 0.393
1.541MetCys: 1.541 ± 0.729
0.385MetAsp: 0.385 ± 0.182
0.771MetGlu: 0.771 ± 0.364
1.156MetPhe: 1.156 ± 0.575
0.771MetGly: 0.771 ± 0.364
0.0MetHis: 0.0 ± 0.0
1.541MetIle: 1.541 ± 0.393
0.771MetLys: 0.771 ± 0.364
1.927MetLeu: 1.927 ± 0.911
1.541MetMet: 1.541 ± 1.515
0.771MetAsn: 0.771 ± 0.757
0.385MetPro: 0.385 ± 0.182
0.771MetGln: 0.771 ± 0.757
3.854MetArg: 3.854 ± 3.787
1.156MetSer: 1.156 ± 0.547
2.697MetThr: 2.697 ± 0.154
2.312MetVal: 2.312 ± 1.15
0.385MetTrp: 0.385 ± 0.182
0.771MetTyr: 0.771 ± 0.364
0.0MetXaa: 0.0 ± 0.0
Asn
2.697AsnAla: 2.697 ± 0.968
0.385AsnCys: 0.385 ± 0.94
0.385AsnAsp: 0.385 ± 0.94
1.156AsnGlu: 1.156 ± 0.547
0.771AsnPhe: 0.771 ± 0.364
0.771AsnGly: 0.771 ± 0.757
0.385AsnHis: 0.385 ± 0.182
2.697AsnIle: 2.697 ± 1.275
0.385AsnLys: 0.385 ± 0.182
3.083AsnLeu: 3.083 ± 0.336
1.156AsnMet: 1.156 ± 0.547
0.771AsnAsn: 0.771 ± 0.364
2.312AsnPro: 2.312 ± 0.029
1.541AsnGln: 1.541 ± 0.393
1.927AsnArg: 1.927 ± 1.332
1.156AsnSer: 1.156 ± 0.547
1.927AsnThr: 1.927 ± 0.911
2.312AsnVal: 2.312 ± 1.15
0.771AsnTrp: 0.771 ± 0.757
0.771AsnTyr: 0.771 ± 0.757
0.0AsnXaa: 0.0 ± 0.0
Pro
4.624ProAla: 4.624 ± 3.422
0.771ProCys: 0.771 ± 0.364
3.468ProAsp: 3.468 ± 2.847
2.312ProGlu: 2.312 ± 1.093
1.156ProPhe: 1.156 ± 0.547
3.468ProGly: 3.468 ± 0.604
1.156ProHis: 1.156 ± 0.547
3.468ProIle: 3.468 ± 0.518
1.156ProLys: 1.156 ± 0.547
5.78ProLeu: 5.78 ± 0.632
1.156ProMet: 1.156 ± 0.547
1.156ProAsn: 1.156 ± 0.575
1.541ProPro: 1.541 ± 0.729
3.083ProGln: 3.083 ± 0.336
4.239ProArg: 4.239 ± 0.882
3.468ProSer: 3.468 ± 1.64
3.083ProThr: 3.083 ± 0.336
3.468ProVal: 3.468 ± 1.64
1.156ProTrp: 1.156 ± 0.575
3.083ProTyr: 3.083 ± 1.458
0.0ProXaa: 0.0 ± 0.0
Gln
5.78GlnAla: 5.78 ± 3.997
0.0GlnCys: 0.0 ± 0.0
3.083GlnAsp: 3.083 ± 1.908
3.083GlnGlu: 3.083 ± 0.786
0.771GlnPhe: 0.771 ± 0.364
1.156GlnGly: 1.156 ± 0.547
2.312GlnHis: 2.312 ± 1.093
0.771GlnIle: 0.771 ± 0.364
1.541GlnLys: 1.541 ± 0.729
5.01GlnLeu: 5.01 ± 0.997
0.771GlnMet: 0.771 ± 0.364
0.385GlnAsn: 0.385 ± 0.182
2.697GlnPro: 2.697 ± 0.154
2.312GlnGln: 2.312 ± 1.15
3.083GlnArg: 3.083 ± 0.786
1.541GlnSer: 1.541 ± 0.729
1.156GlnThr: 1.156 ± 0.547
2.697GlnVal: 2.697 ± 0.154
0.385GlnTrp: 0.385 ± 0.182
1.927GlnTyr: 1.927 ± 0.911
0.0GlnXaa: 0.0 ± 0.0
Arg
6.166ArgAla: 6.166 ± 0.672
2.697ArgCys: 2.697 ± 0.154
1.156ArgAsp: 1.156 ± 0.547
5.78ArgGlu: 5.78 ± 0.632
3.468ArgPhe: 3.468 ± 0.518
5.01ArgGly: 5.01 ± 2.118
1.156ArgHis: 1.156 ± 0.547
4.624ArgIle: 4.624 ± 1.065
3.083ArgLys: 3.083 ± 0.336
6.166ArgLeu: 6.166 ± 1.572
1.541ArgMet: 1.541 ± 2.636
1.927ArgAsn: 1.927 ± 1.332
1.927ArgPro: 1.927 ± 1.332
1.541ArgGln: 1.541 ± 1.515
4.239ArgArg: 4.239 ± 0.882
3.854ArgSer: 3.854 ± 1.543
5.01ArgThr: 5.01 ± 0.125
7.322ArgVal: 7.322 ± 0.097
2.697ArgTrp: 2.697 ± 0.154
2.697ArgTyr: 2.697 ± 0.154
0.0ArgXaa: 0.0 ± 0.0
Ser
3.468SerAla: 3.468 ± 0.518
1.156SerCys: 1.156 ± 0.547
2.697SerAsp: 2.697 ± 0.154
1.927SerGlu: 1.927 ± 0.911
1.541SerPhe: 1.541 ± 0.393
3.083SerGly: 3.083 ± 0.786
0.385SerHis: 0.385 ± 0.182
3.468SerIle: 3.468 ± 1.64
4.239SerLys: 4.239 ± 2.004
7.707SerLeu: 7.707 ± 0.279
1.541SerMet: 1.541 ± 0.729
0.771SerAsn: 0.771 ± 0.364
2.697SerPro: 2.697 ± 1.275
1.156SerGln: 1.156 ± 0.547
3.468SerArg: 3.468 ± 1.725
3.468SerSer: 3.468 ± 0.518
2.312SerThr: 2.312 ± 1.093
5.395SerVal: 5.395 ± 2.551
1.156SerTrp: 1.156 ± 0.547
2.697SerTyr: 2.697 ± 1.275
0.0SerXaa: 0.0 ± 0.0
Thr
7.322ThrAla: 7.322 ± 1.218
0.771ThrCys: 0.771 ± 0.757
2.697ThrAsp: 2.697 ± 0.968
3.468ThrGlu: 3.468 ± 1.64
1.156ThrPhe: 1.156 ± 0.575
2.697ThrGly: 2.697 ± 1.275
3.083ThrHis: 3.083 ± 0.336
4.239ThrIle: 4.239 ± 1.361
0.385ThrLys: 0.385 ± 0.182
6.166ThrLeu: 6.166 ± 0.672
1.541ThrMet: 1.541 ± 0.729
1.927ThrAsn: 1.927 ± 1.332
6.551ThrPro: 6.551 ± 0.854
2.312ThrGln: 2.312 ± 1.15
5.78ThrArg: 5.78 ± 0.49
3.468ThrSer: 3.468 ± 0.518
5.395ThrThr: 5.395 ± 1.429
1.541ThrVal: 1.541 ± 0.729
1.156ThrTrp: 1.156 ± 1.697
2.312ThrTyr: 2.312 ± 1.15
0.0ThrXaa: 0.0 ± 0.0
Val
7.707ValAla: 7.707 ± 5.33
0.771ValCys: 0.771 ± 0.364
4.624ValAsp: 4.624 ± 0.057
3.468ValGlu: 3.468 ± 0.518
3.083ValPhe: 3.083 ± 0.336
4.239ValGly: 4.239 ± 0.239
1.541ValHis: 1.541 ± 0.393
4.239ValIle: 4.239 ± 0.882
1.156ValLys: 1.156 ± 0.547
5.395ValLeu: 5.395 ± 2.551
1.541ValMet: 1.541 ± 0.321
1.156ValAsn: 1.156 ± 0.547
3.083ValPro: 3.083 ± 1.458
4.239ValGln: 4.239 ± 0.882
4.624ValArg: 4.624 ± 2.186
5.01ValSer: 5.01 ± 0.997
5.01ValThr: 5.01 ± 0.125
3.083ValVal: 3.083 ± 0.786
1.927ValTrp: 1.927 ± 0.211
3.083ValTyr: 3.083 ± 0.336
0.0ValXaa: 0.0 ± 0.0
Trp
2.697TrpAla: 2.697 ± 0.968
1.156TrpCys: 1.156 ± 0.547
0.385TrpAsp: 0.385 ± 0.182
1.927TrpGlu: 1.927 ± 0.211
0.0TrpPhe: 0.0 ± 0.0
1.541TrpGly: 1.541 ± 0.393
0.771TrpHis: 0.771 ± 0.364
0.385TrpIle: 0.385 ± 0.182
0.0TrpLys: 0.0 ± 0.0
1.156TrpLeu: 1.156 ± 0.547
0.0TrpMet: 0.0 ± 0.0
0.771TrpAsn: 0.771 ± 0.364
0.385TrpPro: 0.385 ± 0.182
0.771TrpGln: 0.771 ± 0.364
0.771TrpArg: 0.771 ± 0.757
3.468TrpSer: 3.468 ± 0.518
1.156TrpThr: 1.156 ± 1.697
1.927TrpVal: 1.927 ± 1.332
0.771TrpTrp: 0.771 ± 0.364
0.771TrpTyr: 0.771 ± 0.757
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.239TyrAla: 4.239 ± 2.004
0.0TyrCys: 0.0 ± 0.0
2.312TyrAsp: 2.312 ± 0.029
1.927TyrGlu: 1.927 ± 0.911
1.156TyrPhe: 1.156 ± 0.547
3.083TyrGly: 3.083 ± 1.908
0.771TyrHis: 0.771 ± 0.364
2.312TyrIle: 2.312 ± 1.093
0.771TyrLys: 0.771 ± 0.364
5.78TyrLeu: 5.78 ± 0.632
1.541TyrMet: 1.541 ± 0.729
2.312TyrAsn: 2.312 ± 0.029
1.541TyrPro: 1.541 ± 0.729
1.541TyrGln: 1.541 ± 0.393
5.01TyrArg: 5.01 ± 0.125
1.927TyrSer: 1.927 ± 0.911
3.854TyrThr: 3.854 ± 0.421
2.312TyrVal: 2.312 ± 3.394
0.0TyrTrp: 0.0 ± 0.0
2.697TyrTyr: 2.697 ± 0.968
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2596 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski