Amino acid dipepetide frequency for Shahe arthropod virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.26AlaAla: 7.26 ± 0.877
0.363AlaCys: 0.363 ± 0.357
6.171AlaAsp: 6.171 ± 0.763
3.993AlaGlu: 3.993 ± 0.511
5.445AlaPhe: 5.445 ± 0.231
7.623AlaGly: 7.623 ± 0.474
1.452AlaHis: 1.452 ± 0.28
4.356AlaIle: 4.356 ± 1.409
3.63AlaLys: 3.63 ± 0.985
6.171AlaLeu: 6.171 ± 0.194
2.904AlaMet: 2.904 ± 0.56
2.541AlaAsn: 2.541 ± 0.791
6.534AlaPro: 6.534 ± 2.44
2.541AlaGln: 2.541 ± 1.36
3.63AlaArg: 3.63 ± 0.154
6.897AlaSer: 6.897 ± 1.089
6.171AlaThr: 6.171 ± 1.332
4.356AlaVal: 4.356 ± 0.299
0.726AlaTrp: 0.726 ± 0.145
3.267AlaTyr: 3.267 ± 1.505
0.0AlaXaa: 0.0 ± 0.0
Cys
1.452CysAla: 1.452 ± 0.289
0.363CysCys: 0.363 ± 0.212
1.452CysAsp: 1.452 ± 0.289
0.363CysGlu: 0.363 ± 0.212
1.089CysPhe: 1.089 ± 0.637
0.363CysGly: 0.363 ± 0.212
0.363CysHis: 0.363 ± 0.212
0.363CysIle: 0.363 ± 0.212
1.089CysLys: 1.089 ± 0.068
0.726CysLeu: 0.726 ± 0.425
0.0CysMet: 0.0 ± 0.0
1.089CysAsn: 1.089 ± 0.637
0.363CysPro: 0.363 ± 0.357
0.726CysGln: 0.726 ± 0.145
0.363CysArg: 0.363 ± 0.212
0.363CysSer: 0.363 ± 0.357
0.726CysThr: 0.726 ± 0.425
0.363CysVal: 0.363 ± 0.212
1.089CysTrp: 1.089 ± 0.068
1.089CysTyr: 1.089 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
3.63AspAla: 3.63 ± 1.292
1.089AspCys: 1.089 ± 0.068
4.719AspAsp: 4.719 ± 0.086
2.904AspGlu: 2.904 ± 0.579
3.267AspPhe: 3.267 ± 0.203
3.267AspGly: 3.267 ± 0.366
0.363AspHis: 0.363 ± 0.212
5.445AspIle: 5.445 ± 0.908
1.452AspLys: 1.452 ± 0.849
3.993AspLeu: 3.993 ± 1.766
1.452AspMet: 1.452 ± 0.28
2.541AspAsn: 2.541 ± 0.222
3.267AspPro: 3.267 ± 0.366
2.541AspGln: 2.541 ± 0.917
2.904AspArg: 2.904 ± 0.579
3.63AspSer: 3.63 ± 0.415
2.178AspThr: 2.178 ± 1.003
3.63AspVal: 3.63 ± 1.554
1.089AspTrp: 1.089 ± 0.637
2.904AspTyr: 2.904 ± 0.56
0.0AspXaa: 0.0 ± 0.0
Glu
5.445GluAla: 5.445 ± 1.477
0.363GluCys: 0.363 ± 0.357
3.63GluAsp: 3.63 ± 0.985
3.267GluGlu: 3.267 ± 0.772
2.541GluPhe: 2.541 ± 0.222
1.452GluGly: 1.452 ± 0.28
0.363GluHis: 0.363 ± 0.212
2.541GluIle: 2.541 ± 0.222
2.904GluLys: 2.904 ± 1.129
3.993GluLeu: 3.993 ± 1.197
2.904GluMet: 2.904 ± 0.009
1.452GluAsn: 1.452 ± 0.289
3.993GluPro: 3.993 ± 1.08
1.089GluGln: 1.089 ± 0.068
2.178GluArg: 2.178 ± 1.003
3.993GluSer: 3.993 ± 1.766
3.267GluThr: 3.267 ± 0.203
2.541GluVal: 2.541 ± 0.222
1.815GluTrp: 1.815 ± 0.077
1.089GluTyr: 1.089 ± 0.068
0.0GluXaa: 0.0 ± 0.0
Phe
2.904PheAla: 2.904 ± 1.148
0.363PheCys: 0.363 ± 0.212
2.904PheAsp: 2.904 ± 0.56
3.993PheGlu: 3.993 ± 1.197
1.452PhePhe: 1.452 ± 0.28
4.356PheGly: 4.356 ± 0.299
1.452PheHis: 1.452 ± 0.28
1.815PheIle: 1.815 ± 0.492
2.541PheLys: 2.541 ± 0.348
5.082PheLeu: 5.082 ± 0.695
1.089PheMet: 1.089 ± 0.129
1.815PheAsn: 1.815 ± 0.492
2.178PhePro: 2.178 ± 1.003
2.904PheGln: 2.904 ± 0.009
2.178PheArg: 2.178 ± 0.705
4.719PheSer: 4.719 ± 1.794
4.356PheThr: 4.356 ± 0.271
3.267PheVal: 3.267 ± 0.935
0.363PheTrp: 0.363 ± 0.212
1.452PheTyr: 1.452 ± 0.849
0.0PheXaa: 0.0 ± 0.0
Gly
6.897GlyAla: 6.897 ± 0.52
0.726GlyCys: 0.726 ± 0.145
2.904GlyAsp: 2.904 ± 0.56
3.267GlyGlu: 3.267 ± 0.935
2.541GlyPhe: 2.541 ± 0.222
4.356GlyGly: 4.356 ± 0.299
1.089GlyHis: 1.089 ± 0.068
3.267GlyIle: 3.267 ± 0.203
3.63GlyLys: 3.63 ± 0.723
3.267GlyLeu: 3.267 ± 0.366
1.815GlyMet: 1.815 ± 0.492
1.815GlyAsn: 1.815 ± 1.215
1.089GlyPro: 1.089 ± 0.068
2.178GlyGln: 2.178 ± 0.135
3.993GlyArg: 3.993 ± 0.628
2.904GlySer: 2.904 ± 1.148
7.985GlyThr: 7.985 ± 1.591
3.267GlyVal: 3.267 ± 0.772
0.0GlyTrp: 0.0 ± 0.0
2.904GlyTyr: 2.904 ± 0.009
0.0GlyXaa: 0.0 ± 0.0
His
1.452HisAla: 1.452 ± 0.858
0.363HisCys: 0.363 ± 0.212
0.363HisAsp: 0.363 ± 0.212
0.363HisGlu: 0.363 ± 0.212
0.726HisPhe: 0.726 ± 0.425
1.452HisGly: 1.452 ± 0.28
0.363HisHis: 0.363 ± 0.212
1.089HisIle: 1.089 ± 0.637
0.0HisLys: 0.0 ± 0.0
2.904HisLeu: 2.904 ± 0.56
0.363HisMet: 0.363 ± 0.212
1.452HisAsn: 1.452 ± 0.858
0.726HisPro: 0.726 ± 0.145
1.452HisGln: 1.452 ± 0.849
1.452HisArg: 1.452 ± 0.849
1.452HisSer: 1.452 ± 0.849
1.452HisThr: 1.452 ± 0.289
0.726HisVal: 0.726 ± 0.145
0.363HisTrp: 0.363 ± 0.212
0.726HisTyr: 0.726 ± 0.714
0.0HisXaa: 0.0 ± 0.0
Ile
4.356IleAla: 4.356 ± 0.84
1.452IleCys: 1.452 ± 0.849
2.178IleAsp: 2.178 ± 0.135
1.089IleGlu: 1.089 ± 0.068
2.541IlePhe: 2.541 ± 1.486
3.63IleGly: 3.63 ± 0.154
1.815IleHis: 1.815 ± 0.492
1.452IleIle: 1.452 ± 0.849
2.904IleLys: 2.904 ± 0.56
5.445IleLeu: 5.445 ± 1.477
2.904IleMet: 2.904 ± 0.009
2.541IleAsn: 2.541 ± 0.348
3.267IlePro: 3.267 ± 0.772
1.815IleGln: 1.815 ± 0.492
1.815IleArg: 1.815 ± 1.785
5.082IleSer: 5.082 ± 0.695
6.897IleThr: 6.897 ± 1.089
3.267IleVal: 3.267 ± 0.366
0.0IleTrp: 0.0 ± 0.0
0.726IleTyr: 0.726 ± 0.425
0.0IleXaa: 0.0 ± 0.0
Lys
2.178LysAla: 2.178 ± 1.274
0.726LysCys: 0.726 ± 0.145
2.904LysAsp: 2.904 ± 1.129
1.815LysGlu: 1.815 ± 1.061
1.089LysPhe: 1.089 ± 0.068
1.815LysGly: 1.815 ± 1.061
2.178LysHis: 2.178 ± 0.135
4.356LysIle: 4.356 ± 0.84
2.541LysLys: 2.541 ± 1.486
6.897LysLeu: 6.897 ± 0.049
1.452LysMet: 1.452 ± 0.28
1.089LysAsn: 1.089 ± 0.068
2.541LysPro: 2.541 ± 0.791
2.904LysGln: 2.904 ± 1.148
3.267LysArg: 3.267 ± 1.911
3.993LysSer: 3.993 ± 0.628
1.452LysThr: 1.452 ± 0.849
2.541LysVal: 2.541 ± 0.348
1.089LysTrp: 1.089 ± 0.502
1.452LysTyr: 1.452 ± 0.28
0.0LysXaa: 0.0 ± 0.0
Leu
7.623LeuAla: 7.623 ± 0.096
2.541LeuCys: 2.541 ± 0.348
3.63LeuAsp: 3.63 ± 0.985
5.082LeuGlu: 5.082 ± 0.126
3.267LeuPhe: 3.267 ± 1.911
3.993LeuGly: 3.993 ± 0.511
1.089LeuHis: 1.089 ± 0.068
5.082LeuIle: 5.082 ± 0.126
4.719LeuLys: 4.719 ± 1.052
8.711LeuLeu: 8.711 ± 0.541
1.089LeuMet: 1.089 ± 0.478
1.452LeuAsn: 1.452 ± 0.28
3.63LeuPro: 3.63 ± 0.154
1.815LeuGln: 1.815 ± 0.077
3.993LeuArg: 3.993 ± 0.058
7.26LeuSer: 7.26 ± 2.016
8.348LeuThr: 8.348 ± 2.037
7.26LeuVal: 7.26 ± 0.261
1.452LeuTrp: 1.452 ± 0.289
2.541LeuTyr: 2.541 ± 0.222
0.0LeuXaa: 0.0 ± 0.0
Met
3.267MetAla: 3.267 ± 1.341
1.089MetCys: 1.089 ± 0.637
1.452MetAsp: 1.452 ± 0.289
1.452MetGlu: 1.452 ± 0.289
1.089MetPhe: 1.089 ± 0.068
2.178MetGly: 2.178 ± 1.274
1.452MetHis: 1.452 ± 0.28
0.363MetIle: 0.363 ± 0.212
1.452MetLys: 1.452 ± 0.849
1.089MetLeu: 1.089 ± 0.068
0.363MetMet: 0.363 ± 0.212
1.452MetAsn: 1.452 ± 0.858
0.0MetPro: 0.0 ± 0.0
2.178MetGln: 2.178 ± 0.135
0.726MetArg: 0.726 ± 0.145
2.541MetSer: 2.541 ± 1.486
2.541MetThr: 2.541 ± 0.791
2.178MetVal: 2.178 ± 0.705
0.0MetTrp: 0.0 ± 0.0
0.726MetTyr: 0.726 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
3.993AsnAla: 3.993 ± 0.058
0.363AsnCys: 0.363 ± 0.357
2.178AsnAsp: 2.178 ± 0.705
2.541AsnGlu: 2.541 ± 1.929
2.178AsnPhe: 2.178 ± 1.003
2.904AsnGly: 2.904 ± 0.579
1.815AsnHis: 1.815 ± 0.492
1.815AsnIle: 1.815 ± 0.492
1.089AsnLys: 1.089 ± 0.068
4.356AsnLeu: 4.356 ± 0.271
0.363AsnMet: 0.363 ± 0.212
1.452AsnAsn: 1.452 ± 0.28
1.452AsnPro: 1.452 ± 0.289
3.267AsnGln: 3.267 ± 2.074
1.089AsnArg: 1.089 ± 0.068
2.904AsnSer: 2.904 ± 0.009
2.178AsnThr: 2.178 ± 0.434
4.719AsnVal: 4.719 ± 0.655
1.089AsnTrp: 1.089 ± 0.502
1.452AsnTyr: 1.452 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
3.993ProAla: 3.993 ± 1.08
0.726ProCys: 0.726 ± 0.145
1.815ProAsp: 1.815 ± 1.785
1.089ProGlu: 1.089 ± 0.637
3.993ProPhe: 3.993 ± 0.511
3.267ProGly: 3.267 ± 0.935
0.0ProHis: 0.0 ± 0.0
2.541ProIle: 2.541 ± 0.348
0.726ProLys: 0.726 ± 0.145
7.26ProLeu: 7.26 ± 0.308
3.993ProMet: 3.993 ± 1.197
2.904ProAsn: 2.904 ± 0.579
3.267ProPro: 3.267 ± 2.074
1.452ProGln: 1.452 ± 0.858
1.089ProArg: 1.089 ± 1.071
3.267ProSer: 3.267 ± 0.366
4.356ProThr: 4.356 ± 2.006
3.267ProVal: 3.267 ± 0.366
0.726ProTrp: 0.726 ± 0.425
2.178ProTyr: 2.178 ± 1.572
0.0ProXaa: 0.0 ± 0.0
Gln
5.082GlnAla: 5.082 ± 0.126
0.726GlnCys: 0.726 ± 0.425
2.541GlnAsp: 2.541 ± 0.348
3.63GlnGlu: 3.63 ± 1.554
1.815GlnPhe: 1.815 ± 0.492
1.089GlnGly: 1.089 ± 0.068
0.363GlnHis: 0.363 ± 0.212
1.815GlnIle: 1.815 ± 0.077
2.904GlnLys: 2.904 ± 0.579
1.815GlnLeu: 1.815 ± 0.646
1.089GlnMet: 1.089 ± 0.068
1.452GlnAsn: 1.452 ± 0.289
1.815GlnPro: 1.815 ± 0.077
1.815GlnGln: 1.815 ± 1.061
2.541GlnArg: 2.541 ± 0.348
3.63GlnSer: 3.63 ± 1.292
1.089GlnThr: 1.089 ± 0.068
2.904GlnVal: 2.904 ± 1.148
0.0GlnTrp: 0.0 ± 0.0
1.815GlnTyr: 1.815 ± 0.646
0.0GlnXaa: 0.0 ± 0.0
Arg
2.178ArgAla: 2.178 ± 0.434
0.726ArgCys: 0.726 ± 0.145
2.541ArgAsp: 2.541 ± 0.348
3.267ArgGlu: 3.267 ± 1.911
2.178ArgPhe: 2.178 ± 0.434
2.178ArgGly: 2.178 ± 0.434
0.363ArgHis: 0.363 ± 0.212
2.541ArgIle: 2.541 ± 0.348
2.178ArgLys: 2.178 ± 0.135
4.719ArgLeu: 4.719 ± 0.483
1.089ArgMet: 1.089 ± 0.637
2.541ArgAsn: 2.541 ± 0.222
3.267ArgPro: 3.267 ± 0.935
2.541ArgGln: 2.541 ± 0.348
3.993ArgArg: 3.993 ± 1.197
3.63ArgSer: 3.63 ± 0.415
2.904ArgThr: 2.904 ± 0.56
5.445ArgVal: 5.445 ± 0.8
0.0ArgTrp: 0.0 ± 0.0
1.452ArgTyr: 1.452 ± 0.28
0.0ArgXaa: 0.0 ± 0.0
Ser
8.711SerAla: 8.711 ± 0.597
0.363SerCys: 0.363 ± 0.212
3.267SerAsp: 3.267 ± 0.203
2.904SerGlu: 2.904 ± 1.698
3.993SerPhe: 3.993 ± 0.058
7.26SerGly: 7.26 ± 2.016
1.815SerHis: 1.815 ± 0.077
4.719SerIle: 4.719 ± 0.086
2.904SerLys: 2.904 ± 1.148
3.267SerLeu: 3.267 ± 0.935
2.178SerMet: 2.178 ± 0.135
5.082SerAsn: 5.082 ± 1.012
3.63SerPro: 3.63 ± 0.154
2.178SerGln: 2.178 ± 0.705
3.993SerArg: 3.993 ± 0.628
7.623SerSer: 7.623 ± 1.803
6.897SerThr: 6.897 ± 2.797
3.993SerVal: 3.993 ± 0.511
0.726SerTrp: 0.726 ± 0.145
2.904SerTyr: 2.904 ± 1.148
0.0SerXaa: 0.0 ± 0.0
Thr
6.171ThrAla: 6.171 ± 0.945
0.363ThrCys: 0.363 ± 0.212
2.904ThrAsp: 2.904 ± 0.579
4.356ThrGlu: 4.356 ± 0.299
3.993ThrPhe: 3.993 ± 0.628
3.63ThrGly: 3.63 ± 0.723
2.178ThrHis: 2.178 ± 0.135
3.63ThrIle: 3.63 ± 0.415
5.445ThrLys: 5.445 ± 2.615
6.897ThrLeu: 6.897 ± 0.049
1.089ThrMet: 1.089 ± 0.068
4.719ThrAsn: 4.719 ± 1.225
5.808ThrPro: 5.808 ± 1.157
1.815ThrGln: 1.815 ± 0.077
4.356ThrArg: 4.356 ± 0.84
7.26ThrSer: 7.26 ± 3.154
5.808ThrThr: 5.808 ± 0.019
3.63ThrVal: 3.63 ± 0.154
0.0ThrTrp: 0.0 ± 0.0
1.452ThrTyr: 1.452 ± 0.28
0.0ThrXaa: 0.0 ± 0.0
Val
7.26ValAla: 7.26 ± 0.308
0.726ValCys: 0.726 ± 0.425
3.267ValAsp: 3.267 ± 0.935
2.904ValGlu: 2.904 ± 0.579
4.719ValPhe: 4.719 ± 1.225
2.541ValGly: 2.541 ± 0.791
0.726ValHis: 0.726 ± 0.714
2.904ValIle: 2.904 ± 0.009
3.993ValLys: 3.993 ± 1.197
4.719ValLeu: 4.719 ± 0.655
1.089ValMet: 1.089 ± 0.068
3.63ValAsn: 3.63 ± 0.985
3.267ValPro: 3.267 ± 0.935
1.815ValGln: 1.815 ± 0.077
3.993ValArg: 3.993 ± 1.766
3.993ValSer: 3.993 ± 1.08
4.356ValThr: 4.356 ± 0.271
3.63ValVal: 3.63 ± 0.723
0.726ValTrp: 0.726 ± 0.145
2.541ValTyr: 2.541 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.363TrpAla: 0.363 ± 0.357
0.0TrpCys: 0.0 ± 0.0
1.452TrpAsp: 1.452 ± 0.28
1.089TrpGlu: 1.089 ± 0.502
0.363TrpPhe: 0.363 ± 0.212
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.178TrpIle: 2.178 ± 0.705
0.0TrpLys: 0.0 ± 0.0
1.452TrpLeu: 1.452 ± 0.858
0.0TrpMet: 0.0 ± 0.0
0.363TrpAsn: 0.363 ± 0.357
0.726TrpPro: 0.726 ± 0.145
0.363TrpGln: 0.363 ± 0.212
0.363TrpArg: 0.363 ± 0.357
0.726TrpSer: 0.726 ± 0.145
1.089TrpThr: 1.089 ± 0.637
0.363TrpVal: 0.363 ± 0.212
0.0TrpTrp: 0.0 ± 0.0
0.726TrpTyr: 0.726 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.815TyrAla: 1.815 ± 0.077
0.363TyrCys: 0.363 ± 0.212
3.993TyrAsp: 3.993 ± 0.628
1.452TyrGlu: 1.452 ± 0.849
2.904TyrPhe: 2.904 ± 1.148
2.541TyrGly: 2.541 ± 0.791
0.0TyrHis: 0.0 ± 0.0
2.904TyrIle: 2.904 ± 1.717
2.178TyrLys: 2.178 ± 0.135
2.178TyrLeu: 2.178 ± 0.705
0.0TyrMet: 0.0 ± 0.0
1.815TyrAsn: 1.815 ± 0.646
1.089TyrPro: 1.089 ± 0.068
2.541TyrGln: 2.541 ± 0.917
1.815TyrArg: 1.815 ± 0.646
2.541TyrSer: 2.541 ± 0.791
1.452TyrThr: 1.452 ± 0.28
1.452TyrVal: 1.452 ± 0.858
0.363TyrTrp: 0.363 ± 0.357
0.726TyrTyr: 0.726 ± 0.425
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2756 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski