Amino acid dipepetide frequency for Yongjia Tick Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.773AlaAla: 6.773 ± 3.803
0.797AlaCys: 0.797 ± 0.392
2.39AlaAsp: 2.39 ± 0.581
6.375AlaGlu: 6.375 ± 1.774
2.789AlaPhe: 2.789 ± 0.619
4.781AlaGly: 4.781 ± 3.254
1.594AlaHis: 1.594 ± 1.922
3.187AlaIle: 3.187 ± 0.711
2.39AlaLys: 2.39 ± 1.176
5.976AlaLeu: 5.976 ± 1.875
1.992AlaMet: 1.992 ± 0.98
0.0AlaAsn: 0.0 ± 0.0
2.789AlaPro: 2.789 ± 0.619
1.195AlaGln: 1.195 ± 0.813
4.781AlaArg: 4.781 ± 1.162
6.375AlaSer: 6.375 ± 1.422
5.179AlaThr: 5.179 ± 4.641
3.187AlaVal: 3.187 ± 1.381
0.797AlaTrp: 0.797 ± 2.063
1.992AlaTyr: 1.992 ± 0.98
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.196
0.0CysCys: 0.0 ± 0.0
0.398CysAsp: 0.398 ± 0.196
0.797CysGlu: 0.797 ± 0.392
0.797CysPhe: 0.797 ± 0.961
0.797CysGly: 0.797 ± 2.063
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.195CysLys: 1.195 ± 0.588
2.39CysLeu: 2.39 ± 1.651
0.0CysMet: 0.0 ± 0.0
0.398CysAsn: 0.398 ± 0.196
0.797CysPro: 0.797 ± 2.063
0.797CysGln: 0.797 ± 0.392
1.992CysArg: 1.992 ± 0.607
1.992CysSer: 1.992 ± 1.73
1.594CysThr: 1.594 ± 1.827
0.797CysVal: 0.797 ± 4.395
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.382AspAla: 4.382 ± 1.172
0.0AspCys: 0.0 ± 0.0
4.382AspAsp: 4.382 ± 1.155
3.984AspGlu: 3.984 ± 1.559
1.195AspPhe: 1.195 ± 0.588
3.187AspGly: 3.187 ± 0.711
1.992AspHis: 1.992 ± 0.607
3.187AspIle: 3.187 ± 0.711
3.187AspLys: 3.187 ± 1.657
4.382AspLeu: 4.382 ± 1.228
0.797AspMet: 0.797 ± 0.392
0.797AspAsn: 0.797 ± 0.392
3.187AspPro: 3.187 ± 1.657
0.797AspGln: 0.797 ± 0.392
3.984AspArg: 3.984 ± 1.961
2.789AspSer: 2.789 ± 1.372
2.39AspThr: 2.39 ± 0.581
4.382AspVal: 4.382 ± 1.155
1.195AspTrp: 1.195 ± 0.813
2.789AspTyr: 2.789 ± 1.372
0.0AspXaa: 0.0 ± 0.0
Glu
5.179GluAla: 5.179 ± 1.184
1.195GluCys: 1.195 ± 0.588
4.781GluAsp: 4.781 ± 2.353
8.765GluGlu: 8.765 ± 3.209
2.39GluPhe: 2.39 ± 1.176
3.187GluGly: 3.187 ± 1.557
1.594GluHis: 1.594 ± 0.784
5.179GluIle: 5.179 ± 1.184
5.976GluLys: 5.976 ± 1.875
6.375GluLeu: 6.375 ± 1.774
3.187GluMet: 3.187 ± 1.381
1.992GluAsn: 1.992 ± 1.73
2.789GluPro: 2.789 ± 1.372
1.594GluGln: 1.594 ± 0.69
3.187GluArg: 3.187 ± 1.569
3.984GluSer: 3.984 ± 0.991
4.382GluThr: 4.382 ± 1.955
4.382GluVal: 4.382 ± 3.555
1.195GluTrp: 1.195 ± 0.588
3.984GluTyr: 3.984 ± 1.961
0.0GluXaa: 0.0 ± 0.0
Phe
2.39PheAla: 2.39 ± 1.627
0.797PheCys: 0.797 ± 0.392
1.992PheAsp: 1.992 ± 0.98
1.992PheGlu: 1.992 ± 1.77
3.586PhePhe: 3.586 ± 1.765
1.992PheGly: 1.992 ± 0.98
0.797PheHis: 0.797 ± 0.392
2.789PheIle: 2.789 ± 1.593
3.984PheLys: 3.984 ± 0.991
3.586PheLeu: 3.586 ± 0.84
0.797PheMet: 0.797 ± 0.392
1.992PheAsn: 1.992 ± 0.607
3.586PhePro: 3.586 ± 0.84
1.594PheGln: 1.594 ± 0.69
1.992PheArg: 1.992 ± 0.98
1.992PheSer: 1.992 ± 0.98
2.789PheThr: 2.789 ± 0.619
1.594PheVal: 1.594 ± 2.34
0.398PheTrp: 0.398 ± 0.196
1.992PheTyr: 1.992 ± 0.98
0.0PheXaa: 0.0 ± 0.0
Gly
1.992GlyAla: 1.992 ± 0.607
0.0GlyCys: 0.0 ± 0.0
2.789GlyAsp: 2.789 ± 1.82
1.594GlyGlu: 1.594 ± 0.784
4.382GlyPhe: 4.382 ± 4.151
3.187GlyGly: 3.187 ± 1.569
0.797GlyHis: 0.797 ± 0.392
3.586GlyIle: 3.586 ± 1.765
1.992GlyLys: 1.992 ± 1.77
3.984GlyLeu: 3.984 ± 0.991
4.382GlyMet: 4.382 ± 3.441
2.39GlyAsn: 2.39 ± 0.581
1.992GlyPro: 1.992 ± 1.73
0.797GlyGln: 0.797 ± 0.392
3.984GlyArg: 3.984 ± 1.358
5.179GlySer: 5.179 ± 1.507
3.586GlyThr: 3.586 ± 0.84
3.586GlyVal: 3.586 ± 0.84
0.398GlyTrp: 0.398 ± 0.196
1.992GlyTyr: 1.992 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
1.195HisAla: 1.195 ± 0.588
0.797HisCys: 0.797 ± 0.392
1.195HisAsp: 1.195 ± 0.588
1.992HisGlu: 1.992 ± 0.98
1.195HisPhe: 1.195 ± 0.588
1.992HisGly: 1.992 ± 0.98
2.39HisHis: 2.39 ± 1.627
1.992HisIle: 1.992 ± 0.607
1.195HisLys: 1.195 ± 0.588
1.992HisLeu: 1.992 ± 2.163
0.797HisMet: 0.797 ± 0.961
0.797HisAsn: 0.797 ± 0.392
1.594HisPro: 1.594 ± 0.784
0.0HisGln: 0.0 ± 0.0
2.39HisArg: 2.39 ± 2.883
2.789HisSer: 2.789 ± 0.619
0.797HisThr: 0.797 ± 0.392
1.992HisVal: 1.992 ± 0.98
0.0HisTrp: 0.0 ± 0.0
1.594HisTyr: 1.594 ± 0.784
0.0HisXaa: 0.0 ± 0.0
Ile
5.976IleAla: 5.976 ± 1.821
1.992IleCys: 1.992 ± 0.607
1.992IleAsp: 1.992 ± 0.98
3.586IleGlu: 3.586 ± 0.84
0.797IlePhe: 0.797 ± 0.392
1.992IleGly: 1.992 ± 0.98
2.789IleHis: 2.789 ± 1.372
3.187IleIle: 3.187 ± 0.711
4.781IleLys: 4.781 ± 2.071
3.187IleLeu: 3.187 ± 1.569
1.992IleMet: 1.992 ± 0.98
1.195IleAsn: 1.195 ± 0.813
1.594IlePro: 1.594 ± 0.784
3.586IleGln: 3.586 ± 0.84
3.984IleArg: 3.984 ± 0.991
4.382IleSer: 4.382 ± 1.596
4.382IleThr: 4.382 ± 2.184
2.39IleVal: 2.39 ± 0.581
0.398IleTrp: 0.398 ± 0.196
1.195IleTyr: 1.195 ± 0.588
0.0IleXaa: 0.0 ± 0.0
Lys
3.586LysAla: 3.586 ± 0.84
0.398LysCys: 0.398 ± 0.196
2.39LysAsp: 2.39 ± 0.581
3.187LysGlu: 3.187 ± 0.711
3.187LysPhe: 3.187 ± 0.711
2.789LysGly: 2.789 ± 1.82
2.39LysHis: 2.39 ± 0.581
4.382LysIle: 4.382 ± 2.184
6.375LysLys: 6.375 ± 1.774
5.976LysLeu: 5.976 ± 1.875
1.992LysMet: 1.992 ± 0.98
1.992LysAsn: 1.992 ± 0.98
2.39LysPro: 2.39 ± 0.581
2.39LysGln: 2.39 ± 1.651
2.789LysArg: 2.789 ± 1.372
3.586LysSer: 3.586 ± 1.765
3.187LysThr: 3.187 ± 1.381
5.179LysVal: 5.179 ± 1.033
1.594LysTrp: 1.594 ± 0.784
2.39LysTyr: 2.39 ± 0.581
0.0LysXaa: 0.0 ± 0.0
Leu
6.375LeuAla: 6.375 ± 2.063
1.992LeuCys: 1.992 ± 6.795
4.382LeuAsp: 4.382 ± 1.228
5.578LeuGlu: 5.578 ± 1.834
4.382LeuPhe: 4.382 ± 2.157
5.578LeuGly: 5.578 ± 1.383
3.187LeuHis: 3.187 ± 0.711
3.984LeuIle: 3.984 ± 1.214
5.578LeuLys: 5.578 ± 3.166
7.57LeuLeu: 7.57 ± 2.857
3.586LeuMet: 3.586 ± 0.84
1.992LeuAsn: 1.992 ± 0.607
5.578LeuPro: 5.578 ± 1.383
1.992LeuGln: 1.992 ± 0.607
7.171LeuArg: 7.171 ± 0.667
8.765LeuSer: 8.765 ± 1.934
7.57LeuThr: 7.57 ± 1.519
5.976LeuVal: 5.976 ± 3.476
0.797LeuTrp: 0.797 ± 0.392
2.789LeuTyr: 2.789 ± 0.619
0.0LeuXaa: 0.0 ± 0.0
Met
1.992MetAla: 1.992 ± 0.607
0.797MetCys: 0.797 ± 2.063
3.187MetAsp: 3.187 ± 1.657
3.187MetGlu: 3.187 ± 0.711
2.39MetPhe: 2.39 ± 0.581
0.797MetGly: 0.797 ± 0.961
2.39MetHis: 2.39 ± 0.581
1.992MetIle: 1.992 ± 0.98
1.195MetLys: 1.195 ± 0.813
1.992MetLeu: 1.992 ± 0.98
0.797MetMet: 0.797 ± 2.063
0.797MetAsn: 0.797 ± 0.392
0.0MetPro: 0.0 ± 0.0
1.594MetGln: 1.594 ± 0.784
1.594MetArg: 1.594 ± 1.827
2.789MetSer: 2.789 ± 1.372
3.187MetThr: 3.187 ± 0.711
1.195MetVal: 1.195 ± 0.588
0.0MetTrp: 0.0 ± 0.0
0.797MetTyr: 0.797 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
1.195AsnAla: 1.195 ± 0.588
0.398AsnCys: 0.398 ± 0.196
2.39AsnAsp: 2.39 ± 1.989
1.594AsnGlu: 1.594 ± 1.827
1.594AsnPhe: 1.594 ± 0.69
1.992AsnGly: 1.992 ± 0.607
0.398AsnHis: 0.398 ± 0.196
0.797AsnIle: 0.797 ± 0.392
2.39AsnLys: 2.39 ± 1.176
3.187AsnLeu: 3.187 ± 1.569
1.992AsnMet: 1.992 ± 0.98
1.594AsnAsn: 1.594 ± 1.827
1.195AsnPro: 1.195 ± 0.588
1.992AsnGln: 1.992 ± 0.98
0.797AsnArg: 0.797 ± 0.392
5.179AsnSer: 5.179 ± 1.507
0.398AsnThr: 0.398 ± 0.196
1.195AsnVal: 1.195 ± 0.813
0.398AsnTrp: 0.398 ± 1.123
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.195ProAla: 1.195 ± 0.813
0.0ProCys: 0.0 ± 0.0
3.586ProAsp: 3.586 ± 0.84
4.781ProGlu: 4.781 ± 1.162
1.992ProPhe: 1.992 ± 0.98
2.789ProGly: 2.789 ± 1.372
0.797ProHis: 0.797 ± 2.063
2.39ProIle: 2.39 ± 1.627
1.992ProLys: 1.992 ± 0.98
5.976ProLeu: 5.976 ± 10.478
1.195ProMet: 1.195 ± 1.938
0.398ProAsn: 0.398 ± 0.196
3.586ProPro: 3.586 ± 3.553
1.195ProGln: 1.195 ± 0.588
0.797ProArg: 0.797 ± 0.392
4.382ProSer: 4.382 ± 1.228
3.187ProThr: 3.187 ± 1.657
2.789ProVal: 2.789 ± 0.619
0.797ProTrp: 0.797 ± 0.392
1.594ProTyr: 1.594 ± 0.784
0.0ProXaa: 0.0 ± 0.0
Gln
1.594GlnAla: 1.594 ± 0.69
0.0GlnCys: 0.0 ± 0.0
3.187GlnAsp: 3.187 ± 2.581
2.789GlnGlu: 2.789 ± 1.372
0.398GlnPhe: 0.398 ± 0.196
1.594GlnGly: 1.594 ± 0.69
0.797GlnHis: 0.797 ± 0.392
1.992GlnIle: 1.992 ± 0.98
1.195GlnLys: 1.195 ± 0.588
3.984GlnLeu: 3.984 ± 6.616
0.398GlnMet: 0.398 ± 0.196
1.195GlnAsn: 1.195 ± 0.588
0.797GlnPro: 0.797 ± 0.961
2.39GlnGln: 2.39 ± 0.581
0.398GlnArg: 0.398 ± 0.196
1.195GlnSer: 1.195 ± 0.813
3.187GlnThr: 3.187 ± 1.557
3.984GlnVal: 3.984 ± 1.214
1.195GlnTrp: 1.195 ± 0.588
0.398GlnTyr: 0.398 ± 0.196
0.0GlnXaa: 0.0 ± 0.0
Arg
5.578ArgAla: 5.578 ± 1.383
1.195ArgCys: 1.195 ± 0.588
3.187ArgAsp: 3.187 ± 1.569
4.382ArgGlu: 4.382 ± 2.157
1.195ArgPhe: 1.195 ± 0.813
1.992ArgGly: 1.992 ± 1.77
1.195ArgHis: 1.195 ± 0.588
4.382ArgIle: 4.382 ± 1.155
3.187ArgLys: 3.187 ± 0.711
5.578ArgLeu: 5.578 ± 0.98
2.789ArgMet: 2.789 ± 1.372
3.187ArgAsn: 3.187 ± 0.711
1.992ArgPro: 1.992 ± 0.98
2.789ArgGln: 2.789 ± 2.728
4.781ArgArg: 4.781 ± 1.656
2.789ArgSer: 2.789 ± 1.593
3.586ArgThr: 3.586 ± 1.502
3.187ArgVal: 3.187 ± 0.711
1.594ArgTrp: 1.594 ± 0.784
1.195ArgTyr: 1.195 ± 0.588
0.0ArgXaa: 0.0 ± 0.0
Ser
4.781SerAla: 4.781 ± 1.763
0.797SerCys: 0.797 ± 0.392
3.586SerAsp: 3.586 ± 1.765
4.382SerGlu: 4.382 ± 1.155
3.586SerPhe: 3.586 ± 1.285
3.187SerGly: 3.187 ± 1.569
2.789SerHis: 2.789 ± 0.619
4.382SerIle: 4.382 ± 2.157
2.789SerLys: 2.789 ± 1.372
12.351SerLeu: 12.351 ± 3.133
1.992SerMet: 1.992 ± 0.98
3.187SerAsn: 3.187 ± 0.711
5.179SerPro: 5.179 ± 1.033
3.586SerGln: 3.586 ± 1.502
5.578SerArg: 5.578 ± 0.98
8.367SerSer: 8.367 ± 2.821
4.781SerThr: 4.781 ± 1.328
4.781SerVal: 4.781 ± 1.656
0.797SerTrp: 0.797 ± 0.392
1.594SerTyr: 1.594 ± 0.784
0.0SerXaa: 0.0 ± 0.0
Thr
3.984ThrAla: 3.984 ± 2.307
0.797ThrCys: 0.797 ± 0.392
3.187ThrAsp: 3.187 ± 1.569
4.382ThrGlu: 4.382 ± 4.151
2.39ThrPhe: 2.39 ± 0.581
4.382ThrGly: 4.382 ± 1.172
1.195ThrHis: 1.195 ± 0.588
2.789ThrIle: 2.789 ± 1.372
5.578ThrLys: 5.578 ± 1.238
4.781ThrLeu: 4.781 ± 4.499
0.398ThrMet: 0.398 ± 2.197
4.382ThrAsn: 4.382 ± 1.596
3.187ThrPro: 3.187 ± 3.993
2.39ThrGln: 2.39 ± 1.627
3.586ThrArg: 3.586 ± 1.546
6.375ThrSer: 6.375 ± 3.137
5.179ThrThr: 5.179 ± 1.184
4.781ThrVal: 4.781 ± 2.885
0.0ThrTrp: 0.0 ± 0.0
0.797ThrTyr: 0.797 ± 0.961
0.0ThrXaa: 0.0 ± 0.0
Val
4.781ValAla: 4.781 ± 1.162
2.39ValCys: 2.39 ± 6.188
2.789ValAsp: 2.789 ± 0.619
9.163ValGlu: 9.163 ± 2.482
1.992ValPhe: 1.992 ± 0.98
4.382ValGly: 4.382 ± 4.151
0.797ValHis: 0.797 ± 0.392
3.187ValIle: 3.187 ± 1.381
5.578ValLys: 5.578 ± 1.69
5.976ValLeu: 5.976 ± 0.965
1.594ValMet: 1.594 ± 0.784
1.195ValAsn: 1.195 ± 0.588
1.594ValPro: 1.594 ± 0.69
0.797ValGln: 0.797 ± 2.063
3.586ValArg: 3.586 ± 0.84
3.586ValSer: 3.586 ± 2.341
1.992ValThr: 1.992 ± 1.77
3.187ValVal: 3.187 ± 0.711
1.594ValTrp: 1.594 ± 1.827
2.39ValTyr: 2.39 ± 1.989
0.0ValXaa: 0.0 ± 0.0
Trp
0.797TrpAla: 0.797 ± 2.063
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.797TrpGlu: 0.797 ± 0.392
1.195TrpPhe: 1.195 ± 0.588
0.398TrpGly: 0.398 ± 0.196
0.0TrpHis: 0.0 ± 0.0
0.797TrpIle: 0.797 ± 0.392
0.0TrpLys: 0.0 ± 0.0
2.39TrpLeu: 2.39 ± 0.581
0.797TrpMet: 0.797 ± 0.392
0.398TrpAsn: 0.398 ± 0.196
0.398TrpPro: 0.398 ± 2.197
0.0TrpGln: 0.0 ± 0.0
0.398TrpArg: 0.398 ± 0.196
1.992TrpSer: 1.992 ± 0.607
1.195TrpThr: 1.195 ± 0.588
1.594TrpVal: 1.594 ± 0.784
0.0TrpTrp: 0.0 ± 0.0
0.398TrpTyr: 0.398 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.195TyrAla: 1.195 ± 0.588
0.797TyrCys: 0.797 ± 0.392
1.195TyrAsp: 1.195 ± 0.588
1.992TyrGlu: 1.992 ± 0.98
1.195TyrPhe: 1.195 ± 0.588
1.992TyrGly: 1.992 ± 0.98
0.797TyrHis: 0.797 ± 0.392
1.195TyrIle: 1.195 ± 0.588
1.594TyrLys: 1.594 ± 0.69
3.187TyrLeu: 3.187 ± 0.711
0.797TyrMet: 0.797 ± 0.392
0.797TyrAsn: 0.797 ± 0.392
1.195TyrPro: 1.195 ± 2.52
1.195TyrGln: 1.195 ± 0.813
1.992TyrArg: 1.992 ± 0.607
3.984TyrSer: 3.984 ± 1.961
1.992TyrThr: 1.992 ± 0.98
2.39TyrVal: 2.39 ± 1.176
0.398TyrTrp: 0.398 ± 0.196
0.797TyrTyr: 0.797 ± 0.392
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski