Amino acid dipepetide frequency for Pacific coast tick phlebovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.911AlaAla: 6.911 ± 5.421
1.22AlaCys: 1.22 ± 0.52
2.033AlaAsp: 2.033 ± 0.329
2.846AlaGlu: 2.846 ± 0.017
2.033AlaPhe: 2.033 ± 0.329
3.659AlaGly: 3.659 ± 2.027
2.439AlaHis: 2.439 ± 1.351
3.659AlaIle: 3.659 ± 0.831
4.878AlaLys: 4.878 ± 1.507
6.504AlaLeu: 6.504 ± 2.009
2.033AlaMet: 2.033 ± 1.524
2.439AlaAsn: 2.439 ± 1.039
0.813AlaPro: 0.813 ± 0.346
3.252AlaGln: 3.252 ± 2.2
4.065AlaArg: 4.065 ± 1.732
5.691AlaSer: 5.691 ± 0.034
4.472AlaThr: 4.472 ± 0.485
4.472AlaVal: 4.472 ± 4.07
0.813AlaTrp: 0.813 ± 0.346
1.626AlaTyr: 1.626 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.849
0.407CysCys: 0.407 ± 0.173
1.22CysAsp: 1.22 ± 0.52
0.813CysGlu: 0.813 ± 0.346
0.407CysPhe: 0.407 ± 0.173
0.813CysGly: 0.813 ± 0.346
0.407CysHis: 0.407 ± 0.173
0.813CysIle: 0.813 ± 0.346
1.626CysLys: 1.626 ± 0.693
1.626CysLeu: 1.626 ± 1.697
1.22CysMet: 1.22 ± 0.52
0.407CysAsn: 0.407 ± 0.173
1.626CysPro: 1.626 ± 0.693
0.813CysGln: 0.813 ± 0.346
0.0CysArg: 0.0 ± 0.0
0.813CysSer: 0.813 ± 0.346
1.22CysThr: 1.22 ± 0.52
0.0CysVal: 0.0 ± 0.0
0.407CysTrp: 0.407 ± 0.173
0.813CysTyr: 0.813 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
2.846AspAla: 2.846 ± 1.212
0.407AspCys: 0.407 ± 0.173
4.878AspAsp: 4.878 ± 2.078
4.065AspGlu: 4.065 ± 1.853
1.626AspPhe: 1.626 ± 0.693
4.065AspGly: 4.065 ± 0.537
2.033AspHis: 2.033 ± 0.866
6.504AspIle: 6.504 ± 1.576
2.033AspLys: 2.033 ± 0.866
3.659AspLeu: 3.659 ± 1.559
1.626AspMet: 1.626 ± 0.502
1.626AspAsn: 1.626 ± 0.502
4.472AspPro: 4.472 ± 1.68
2.033AspGln: 2.033 ± 0.329
3.659AspArg: 3.659 ± 3.222
4.065AspSer: 4.065 ± 0.658
1.626AspThr: 1.626 ± 0.693
6.098AspVal: 6.098 ± 1.403
0.813AspTrp: 0.813 ± 0.346
0.813AspTyr: 0.813 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
6.504GluAla: 6.504 ± 0.814
1.22GluCys: 1.22 ± 0.52
4.065GluAsp: 4.065 ± 1.853
8.943GluGlu: 8.943 ± 0.97
2.846GluPhe: 2.846 ± 1.212
3.252GluGly: 3.252 ± 2.2
1.626GluHis: 1.626 ± 0.502
3.252GluIle: 3.252 ± 1.385
3.659GluLys: 3.659 ± 0.831
6.098GluLeu: 6.098 ± 2.182
2.033GluMet: 2.033 ± 0.855
1.626GluAsn: 1.626 ± 0.693
2.846GluPro: 2.846 ± 1.212
2.033GluGln: 2.033 ± 0.866
1.22GluArg: 1.22 ± 0.52
5.285GluSer: 5.285 ± 1.056
8.537GluThr: 8.537 ± 2.442
5.285GluVal: 5.285 ± 1.334
0.0GluTrp: 0.0 ± 0.0
2.846GluTyr: 2.846 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
0.813PheAla: 0.813 ± 0.849
2.033PheCys: 2.033 ± 0.866
1.626PheAsp: 1.626 ± 0.502
2.033PheGlu: 2.033 ± 0.866
2.439PhePhe: 2.439 ± 1.039
2.033PheGly: 2.033 ± 0.866
2.033PheHis: 2.033 ± 0.866
1.22PheIle: 1.22 ± 0.52
3.659PheLys: 3.659 ± 1.559
4.472PheLeu: 4.472 ± 0.485
1.22PheMet: 1.22 ± 0.52
2.033PheAsn: 2.033 ± 0.329
2.033PhePro: 2.033 ± 0.866
1.22PheGln: 1.22 ± 0.52
1.22PheArg: 1.22 ± 0.52
2.439PheSer: 2.439 ± 0.156
2.846PheThr: 2.846 ± 2.373
1.626PheVal: 1.626 ± 0.693
0.813PheTrp: 0.813 ± 0.346
1.22PheTyr: 1.22 ± 0.52
0.0PheXaa: 0.0 ± 0.0
Gly
4.065GlyAla: 4.065 ± 0.658
1.22GlyCys: 1.22 ± 0.52
1.22GlyAsp: 1.22 ± 1.871
2.846GlyGlu: 2.846 ± 0.017
3.252GlyPhe: 3.252 ± 0.19
3.659GlyGly: 3.659 ± 0.831
1.22GlyHis: 1.22 ± 0.676
2.439GlyIle: 2.439 ± 1.039
6.098GlyLys: 6.098 ± 1.403
3.659GlyLeu: 3.659 ± 2.027
1.22GlyMet: 1.22 ± 0.46
1.22GlyAsn: 1.22 ± 0.676
2.846GlyPro: 2.846 ± 3.568
1.626GlyGln: 1.626 ± 0.502
6.504GlyArg: 6.504 ± 0.381
2.033GlySer: 2.033 ± 0.866
2.846GlyThr: 2.846 ± 1.178
2.846GlyVal: 2.846 ± 1.178
0.407GlyTrp: 0.407 ± 0.173
1.626GlyTyr: 1.626 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
0.407HisAla: 0.407 ± 1.022
1.626HisCys: 1.626 ± 0.502
2.033HisAsp: 2.033 ± 0.329
1.626HisGlu: 1.626 ± 0.693
0.813HisPhe: 0.813 ± 0.346
4.878HisGly: 4.878 ± 0.883
2.439HisHis: 2.439 ± 1.039
1.626HisIle: 1.626 ± 0.693
1.22HisLys: 1.22 ± 0.676
3.659HisLeu: 3.659 ± 1.559
1.22HisMet: 1.22 ± 0.52
0.407HisAsn: 0.407 ± 0.173
1.626HisPro: 1.626 ± 0.693
1.22HisGln: 1.22 ± 0.676
2.033HisArg: 2.033 ± 0.329
1.626HisSer: 1.626 ± 0.693
2.033HisThr: 2.033 ± 0.866
1.22HisVal: 1.22 ± 0.52
0.0HisTrp: 0.0 ± 0.0
1.22HisTyr: 1.22 ± 0.52
0.0HisXaa: 0.0 ± 0.0
Ile
2.846IleAla: 2.846 ± 1.212
0.0IleCys: 0.0 ± 0.0
1.626IleAsp: 1.626 ± 0.693
2.439IleGlu: 2.439 ± 1.039
2.439IlePhe: 2.439 ± 0.156
2.439IleGly: 2.439 ± 0.156
1.626IleHis: 1.626 ± 0.693
2.439IleIle: 2.439 ± 1.351
3.252IleLys: 3.252 ± 1.385
5.285IleLeu: 5.285 ± 1.056
1.626IleMet: 1.626 ± 0.693
1.22IleAsn: 1.22 ± 0.52
2.439IlePro: 2.439 ± 0.156
0.813IleGln: 0.813 ± 0.849
3.252IleArg: 3.252 ± 1.385
6.098IleSer: 6.098 ± 1.403
3.252IleThr: 3.252 ± 1.005
3.252IleVal: 3.252 ± 0.19
0.407IleTrp: 0.407 ± 0.173
1.626IleTyr: 1.626 ± 0.693
0.0IleXaa: 0.0 ± 0.0
Lys
4.472LysAla: 4.472 ± 0.71
0.813LysCys: 0.813 ± 0.849
4.878LysAsp: 4.878 ± 0.883
4.065LysGlu: 4.065 ± 0.537
2.033LysPhe: 2.033 ± 0.329
2.846LysGly: 2.846 ± 1.178
2.033LysHis: 2.033 ± 0.329
2.846LysIle: 2.846 ± 0.017
5.285LysLys: 5.285 ± 0.139
6.098LysLeu: 6.098 ± 2.598
4.472LysMet: 4.472 ± 0.485
3.252LysAsn: 3.252 ± 1.385
2.846LysPro: 2.846 ± 0.017
2.439LysGln: 2.439 ± 1.039
3.252LysArg: 3.252 ± 3.395
1.626LysSer: 1.626 ± 0.693
4.065LysThr: 4.065 ± 1.732
4.065LysVal: 4.065 ± 1.732
0.813LysTrp: 0.813 ± 0.346
1.626LysTyr: 1.626 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
5.691LeuAla: 5.691 ± 4.746
1.626LeuCys: 1.626 ± 0.693
6.098LeuAsp: 6.098 ± 2.598
6.504LeuGlu: 6.504 ± 0.814
4.065LeuPhe: 4.065 ± 0.658
4.065LeuGly: 4.065 ± 0.537
2.846LeuHis: 2.846 ± 1.212
2.439LeuIle: 2.439 ± 0.156
7.724LeuLys: 7.724 ± 5.075
8.943LeuLeu: 8.943 ± 1.42
2.846LeuMet: 2.846 ± 1.212
2.033LeuAsn: 2.033 ± 0.866
3.659LeuPro: 3.659 ± 2.027
3.659LeuGln: 3.659 ± 0.364
5.691LeuArg: 5.691 ± 0.034
7.724LeuSer: 7.724 ± 0.295
6.504LeuThr: 6.504 ± 2.009
6.098LeuVal: 6.098 ± 1.403
0.813LeuTrp: 0.813 ± 0.346
2.439LeuTyr: 2.439 ± 0.156
0.0LeuXaa: 0.0 ± 0.0
Met
2.846MetAla: 2.846 ± 0.017
0.0MetCys: 0.0 ± 0.0
1.626MetAsp: 1.626 ± 0.693
4.472MetGlu: 4.472 ± 0.485
1.626MetPhe: 1.626 ± 0.693
2.033MetGly: 2.033 ± 0.866
1.22MetHis: 1.22 ± 0.52
1.626MetIle: 1.626 ± 0.502
1.626MetLys: 1.626 ± 0.693
1.626MetLeu: 1.626 ± 0.502
2.033MetMet: 2.033 ± 0.329
1.22MetAsn: 1.22 ± 0.52
1.22MetPro: 1.22 ± 0.676
1.22MetGln: 1.22 ± 0.52
1.626MetArg: 1.626 ± 0.693
0.0MetSer: 0.0 ± 0.0
1.626MetThr: 1.626 ± 0.502
3.659MetVal: 3.659 ± 0.831
0.813MetTrp: 0.813 ± 0.346
0.407MetTyr: 0.407 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.439AsnAla: 2.439 ± 0.156
0.407AsnCys: 0.407 ± 0.173
2.846AsnAsp: 2.846 ± 1.178
3.252AsnGlu: 3.252 ± 0.19
1.626AsnPhe: 1.626 ± 0.693
1.626AsnGly: 1.626 ± 0.693
0.813AsnHis: 0.813 ± 0.346
2.846AsnIle: 2.846 ± 1.212
1.626AsnLys: 1.626 ± 0.693
3.659AsnLeu: 3.659 ± 0.831
0.0AsnMet: 0.0 ± 0.0
1.626AsnAsn: 1.626 ± 0.693
2.439AsnPro: 2.439 ± 2.546
1.22AsnGln: 1.22 ± 0.52
0.407AsnArg: 0.407 ± 0.173
0.813AsnSer: 0.813 ± 0.346
3.252AsnThr: 3.252 ± 1.385
1.626AsnVal: 1.626 ± 0.693
0.407AsnTrp: 0.407 ± 1.022
0.407AsnTyr: 0.407 ± 0.173
0.0AsnXaa: 0.0 ± 0.0
Pro
2.439ProAla: 2.439 ± 2.546
1.22ProCys: 1.22 ± 0.52
2.846ProAsp: 2.846 ± 0.017
6.504ProGlu: 6.504 ± 0.814
3.252ProPhe: 3.252 ± 1.385
2.033ProGly: 2.033 ± 1.524
0.813ProHis: 0.813 ± 2.044
0.407ProIle: 0.407 ± 0.173
0.813ProLys: 0.813 ± 0.346
4.065ProLeu: 4.065 ± 0.537
0.813ProMet: 0.813 ± 0.849
1.22ProAsn: 1.22 ± 0.52
3.252ProPro: 3.252 ± 0.19
2.033ProGln: 2.033 ± 3.914
2.846ProArg: 2.846 ± 2.373
4.878ProSer: 4.878 ± 2.078
2.846ProThr: 2.846 ± 0.017
2.846ProVal: 2.846 ± 1.178
1.22ProTrp: 1.22 ± 0.52
1.22ProTyr: 1.22 ± 0.52
0.0ProXaa: 0.0 ± 0.0
Gln
3.659GlnAla: 3.659 ± 3.222
0.0GlnCys: 0.0 ± 0.0
1.22GlnAsp: 1.22 ± 1.871
2.439GlnGlu: 2.439 ± 1.351
0.0GlnPhe: 0.0 ± 0.0
1.22GlnGly: 1.22 ± 1.871
1.22GlnHis: 1.22 ± 0.52
2.439GlnIle: 2.439 ± 1.039
2.439GlnLys: 2.439 ± 1.039
3.252GlnLeu: 3.252 ± 3.395
2.439GlnMet: 2.439 ± 1.039
0.407GlnAsn: 0.407 ± 0.173
1.22GlnPro: 1.22 ± 0.676
0.813GlnGln: 0.813 ± 0.346
3.252GlnArg: 3.252 ± 1.385
1.626GlnSer: 1.626 ± 0.502
1.626GlnThr: 1.626 ± 0.693
2.846GlnVal: 2.846 ± 0.017
0.407GlnTrp: 0.407 ± 0.173
0.407GlnTyr: 0.407 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
3.252ArgAla: 3.252 ± 1.385
0.407ArgCys: 0.407 ± 1.022
3.659ArgAsp: 3.659 ± 0.364
5.285ArgGlu: 5.285 ± 0.139
1.626ArgPhe: 1.626 ± 0.502
2.846ArgGly: 2.846 ± 2.373
0.813ArgHis: 0.813 ± 0.346
1.626ArgIle: 1.626 ± 0.502
4.065ArgLys: 4.065 ± 0.658
3.252ArgLeu: 3.252 ± 1.005
1.22ArgMet: 1.22 ± 0.52
1.22ArgAsn: 1.22 ± 0.52
2.846ArgPro: 2.846 ± 0.017
3.252ArgGln: 3.252 ± 1.005
4.065ArgArg: 4.065 ± 0.658
6.504ArgSer: 6.504 ± 0.381
4.472ArgThr: 4.472 ± 0.71
8.537ArgVal: 8.537 ± 1.143
0.813ArgTrp: 0.813 ± 0.346
2.439ArgTyr: 2.439 ± 0.156
0.0ArgXaa: 0.0 ± 0.0
Ser
5.691SerAla: 5.691 ± 0.034
0.407SerCys: 0.407 ± 0.173
6.504SerAsp: 6.504 ± 0.381
4.472SerGlu: 4.472 ± 0.71
4.472SerPhe: 4.472 ± 0.485
1.626SerGly: 1.626 ± 0.693
3.252SerHis: 3.252 ± 1.385
3.659SerIle: 3.659 ± 0.364
3.659SerLys: 3.659 ± 1.559
8.943SerLeu: 8.943 ± 1.42
0.813SerMet: 0.813 ± 0.346
2.439SerAsn: 2.439 ± 1.351
3.252SerPro: 3.252 ± 0.19
0.407SerGln: 0.407 ± 0.173
3.659SerArg: 3.659 ± 2.027
6.098SerSer: 6.098 ± 0.987
2.439SerThr: 2.439 ± 1.039
6.504SerVal: 6.504 ± 2.771
1.22SerTrp: 1.22 ± 0.52
2.846SerTyr: 2.846 ± 1.212
0.0SerXaa: 0.0 ± 0.0
Thr
2.033ThrAla: 2.033 ± 0.329
0.407ThrCys: 0.407 ± 0.173
4.065ThrAsp: 4.065 ± 0.537
4.472ThrGlu: 4.472 ± 0.71
1.626ThrPhe: 1.626 ± 0.693
3.252ThrGly: 3.252 ± 0.19
1.626ThrHis: 1.626 ± 0.693
4.472ThrIle: 4.472 ± 0.71
2.439ThrLys: 2.439 ± 1.039
8.943ThrLeu: 8.943 ± 0.225
1.626ThrMet: 1.626 ± 0.502
2.846ThrAsn: 2.846 ± 0.017
3.252ThrPro: 3.252 ± 1.005
2.846ThrGln: 2.846 ± 0.017
5.285ThrArg: 5.285 ± 1.056
4.065ThrSer: 4.065 ± 1.732
4.878ThrThr: 4.878 ± 0.312
2.439ThrVal: 2.439 ± 0.156
1.22ThrTrp: 1.22 ± 0.52
0.813ThrTyr: 0.813 ± 2.044
0.0ThrXaa: 0.0 ± 0.0
Val
5.285ValAla: 5.285 ± 1.334
1.626ValCys: 1.626 ± 0.693
3.659ValAsp: 3.659 ± 1.559
4.878ValGlu: 4.878 ± 1.507
2.033ValPhe: 2.033 ± 0.866
4.065ValGly: 4.065 ± 1.853
2.033ValHis: 2.033 ± 0.866
2.033ValIle: 2.033 ± 0.866
5.691ValLys: 5.691 ± 2.425
3.659ValLeu: 3.659 ± 2.027
2.846ValMet: 2.846 ± 1.212
3.659ValAsn: 3.659 ± 2.027
2.439ValPro: 2.439 ± 0.156
1.22ValGln: 1.22 ± 0.676
6.504ValArg: 6.504 ± 2.009
8.943ValSer: 8.943 ± 0.225
2.033ValThr: 2.033 ± 0.866
6.098ValVal: 6.098 ± 2.182
1.22ValTrp: 1.22 ± 0.52
2.439ValTyr: 2.439 ± 1.039
0.0ValXaa: 0.0 ± 0.0
Trp
2.033TrpAla: 2.033 ± 0.329
0.813TrpCys: 0.813 ± 0.346
0.407TrpAsp: 0.407 ± 0.173
1.22TrpGlu: 1.22 ± 0.52
0.813TrpPhe: 0.813 ± 0.346
1.22TrpGly: 1.22 ± 0.52
0.0TrpHis: 0.0 ± 0.0
0.407TrpIle: 0.407 ± 0.173
0.407TrpLys: 0.407 ± 0.173
0.813TrpLeu: 0.813 ± 0.346
0.407TrpMet: 0.407 ± 0.173
0.407TrpAsn: 0.407 ± 0.173
0.407TrpPro: 0.407 ± 0.173
0.407TrpGln: 0.407 ± 0.173
0.813TrpArg: 0.813 ± 0.346
0.407TrpSer: 0.407 ± 0.173
0.813TrpThr: 0.813 ± 0.346
0.407TrpVal: 0.407 ± 0.173
0.407TrpTrp: 0.407 ± 0.173
0.813TrpTyr: 0.813 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.813TyrAla: 0.813 ± 0.849
0.407TyrCys: 0.407 ± 0.173
2.439TyrAsp: 2.439 ± 1.039
0.407TyrGlu: 0.407 ± 0.173
0.0TyrPhe: 0.0 ± 0.0
1.22TyrGly: 1.22 ± 0.676
2.439TyrHis: 2.439 ± 1.039
1.626TyrIle: 1.626 ± 0.693
1.626TyrLys: 1.626 ± 0.693
3.252TyrLeu: 3.252 ± 0.19
0.407TyrMet: 0.407 ± 0.173
2.033TyrAsn: 2.033 ± 0.329
2.033TyrPro: 2.033 ± 0.329
0.407TyrGln: 0.407 ± 1.022
2.846TyrArg: 2.846 ± 1.212
1.626TyrSer: 1.626 ± 0.693
1.22TyrThr: 1.22 ± 0.52
2.439TyrVal: 2.439 ± 0.156
0.407TyrTrp: 0.407 ± 0.173
0.407TyrTyr: 0.407 ± 0.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2461 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski