Amino acid dipepetide frequency for Hubei orthoptera virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.194AlaAla: 5.194 ± 0.221
1.222AlaCys: 1.222 ± 0.705
2.444AlaAsp: 2.444 ± 0.81
3.361AlaGlu: 3.361 ± 0.274
3.361AlaPhe: 3.361 ± 1.946
4.277AlaGly: 4.277 ± 1.417
2.139AlaHis: 2.139 ± 1.234
7.333AlaIle: 7.333 ± 0.21
5.194AlaLys: 5.194 ± 0.333
7.638AlaLeu: 7.638 ± 1.076
0.917AlaMet: 0.917 ± 0.026
3.361AlaAsn: 3.361 ± 0.274
3.972AlaPro: 3.972 ± 2.148
3.361AlaGln: 3.361 ± 1.391
3.666AlaArg: 3.666 ± 0.105
5.194AlaSer: 5.194 ± 0.221
4.583AlaThr: 4.583 ± 0.131
2.75AlaVal: 2.75 ± 0.079
0.917AlaTrp: 0.917 ± 0.581
2.139AlaTyr: 2.139 ± 1.234
0.0AlaXaa: 0.0 ± 0.0
Cys
1.222CysAla: 1.222 ± 0.15
0.0CysCys: 0.0 ± 0.0
1.222CysAsp: 1.222 ± 0.705
0.611CysGlu: 0.611 ± 0.352
0.0CysPhe: 0.0 ± 0.0
1.222CysGly: 1.222 ± 0.15
0.0CysHis: 0.0 ± 0.0
0.611CysIle: 0.611 ± 0.202
0.611CysLys: 0.611 ± 0.352
0.611CysLeu: 0.611 ± 0.352
0.306CysMet: 0.306 ± 0.379
0.0CysAsn: 0.0 ± 0.0
0.306CysPro: 0.306 ± 0.176
0.611CysGln: 0.611 ± 0.352
0.306CysArg: 0.306 ± 0.176
0.917CysSer: 0.917 ± 0.529
1.528CysThr: 1.528 ± 0.326
0.917CysVal: 0.917 ± 0.529
0.0CysTrp: 0.0 ± 0.0
0.611CysTyr: 0.611 ± 0.352
0.0CysXaa: 0.0 ± 0.0
Asp
3.666AspAla: 3.666 ± 1.56
0.306AspCys: 0.306 ± 0.176
3.972AspAsp: 3.972 ± 0.071
2.75AspGlu: 2.75 ± 1.031
4.888AspPhe: 4.888 ± 1.064
2.75AspGly: 2.75 ± 0.079
0.611AspHis: 0.611 ± 0.202
3.055AspIle: 3.055 ± 0.457
5.194AspLys: 5.194 ± 0.333
4.277AspLeu: 4.277 ± 0.862
0.917AspMet: 0.917 ± 0.026
2.139AspAsn: 2.139 ± 0.124
3.666AspPro: 3.666 ± 1.215
2.139AspGln: 2.139 ± 0.124
3.972AspArg: 3.972 ± 0.626
3.361AspSer: 3.361 ± 0.281
1.528AspThr: 1.528 ± 0.229
6.416AspVal: 6.416 ± 0.183
0.917AspTrp: 0.917 ± 0.026
1.833AspTyr: 1.833 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
4.583GluAla: 4.583 ± 0.424
0.0GluCys: 0.0 ± 0.0
1.222GluAsp: 1.222 ± 0.15
3.666GluGlu: 3.666 ± 0.105
3.055GluPhe: 3.055 ± 0.653
4.277GluGly: 4.277 ± 0.307
0.917GluHis: 0.917 ± 0.026
3.972GluIle: 3.972 ± 1.181
4.277GluLys: 4.277 ± 0.248
5.194GluLeu: 5.194 ± 0.776
0.917GluMet: 0.917 ± 0.026
3.361GluAsn: 3.361 ± 0.281
3.666GluPro: 3.666 ± 1.769
1.222GluGln: 1.222 ± 0.705
2.75GluArg: 2.75 ± 0.079
1.528GluSer: 1.528 ± 0.881
2.444GluThr: 2.444 ± 0.255
3.361GluVal: 3.361 ± 0.274
0.0GluTrp: 0.0 ± 0.0
2.139GluTyr: 2.139 ± 1.234
0.0GluXaa: 0.0 ± 0.0
Phe
2.75PheAla: 2.75 ± 0.476
0.917PheCys: 0.917 ± 0.529
3.666PheAsp: 3.666 ± 0.105
3.972PheGlu: 3.972 ± 1.181
2.75PhePhe: 2.75 ± 0.079
3.666PheGly: 3.666 ± 0.105
2.444PheHis: 2.444 ± 0.3
2.444PheIle: 2.444 ± 0.855
2.139PheLys: 2.139 ± 1.234
3.972PheLeu: 3.972 ± 1.181
1.528PheMet: 1.528 ± 0.229
3.361PheAsn: 3.361 ± 0.281
1.222PhePro: 1.222 ± 0.405
4.583PheGln: 4.583 ± 2.089
3.972PheArg: 3.972 ± 0.483
3.972PheSer: 3.972 ± 0.071
4.583PheThr: 4.583 ± 0.131
4.888PheVal: 4.888 ± 0.045
0.611PheTrp: 0.611 ± 0.202
2.444PheTyr: 2.444 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
4.583GlyAla: 4.583 ± 1.796
0.611GlyCys: 0.611 ± 0.352
3.055GlyAsp: 3.055 ± 0.457
2.139GlyGlu: 2.139 ± 1.541
1.833GlyPhe: 1.833 ± 0.052
4.277GlyGly: 4.277 ± 2.527
1.833GlyHis: 1.833 ± 1.057
3.361GlyIle: 3.361 ± 0.836
3.666GlyLys: 3.666 ± 1.56
3.666GlyLeu: 3.666 ± 1.769
2.139GlyMet: 2.139 ± 0.986
4.277GlyAsn: 4.277 ± 0.307
2.139GlyPro: 2.139 ± 0.124
0.917GlyGln: 0.917 ± 0.026
2.139GlyArg: 2.139 ± 1.234
4.277GlySer: 4.277 ± 0.862
4.277GlyThr: 4.277 ± 1.417
3.666GlyVal: 3.666 ± 1.005
0.917GlyTrp: 0.917 ± 0.581
3.666GlyTyr: 3.666 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
0.306HisAla: 0.306 ± 0.176
0.0HisCys: 0.0 ± 0.0
0.611HisAsp: 0.611 ± 0.352
0.611HisGlu: 0.611 ± 0.352
3.361HisPhe: 3.361 ± 1.938
0.611HisGly: 0.611 ± 0.352
0.306HisHis: 0.306 ± 0.176
1.833HisIle: 1.833 ± 0.052
0.917HisLys: 0.917 ± 0.529
2.139HisLeu: 2.139 ± 1.234
0.306HisMet: 0.306 ± 0.176
1.222HisAsn: 1.222 ± 0.705
0.917HisPro: 0.917 ± 0.581
0.611HisGln: 0.611 ± 0.202
1.222HisArg: 1.222 ± 0.705
1.528HisSer: 1.528 ± 0.229
1.222HisThr: 1.222 ± 0.15
0.917HisVal: 0.917 ± 0.026
0.0HisTrp: 0.0 ± 0.0
1.222HisTyr: 1.222 ± 0.705
0.0HisXaa: 0.0 ± 0.0
Ile
4.277IleAla: 4.277 ± 0.248
1.222IleCys: 1.222 ± 0.705
3.361IleAsp: 3.361 ± 0.836
2.139IleGlu: 2.139 ± 1.234
2.75IlePhe: 2.75 ± 0.476
4.888IleGly: 4.888 ± 0.045
0.306IleHis: 0.306 ± 0.176
3.972IleIle: 3.972 ± 1.736
4.277IleLys: 4.277 ± 1.912
5.194IleLeu: 5.194 ± 0.776
1.833IleMet: 1.833 ± 0.052
3.666IleAsn: 3.666 ± 0.66
4.277IlePro: 4.277 ± 0.307
2.75IleGln: 2.75 ± 0.079
3.361IleArg: 3.361 ± 0.281
6.111IleSer: 6.111 ± 0.75
2.139IleThr: 2.139 ± 0.679
3.666IleVal: 3.666 ± 0.45
0.611IleTrp: 0.611 ± 0.352
3.055IleTyr: 3.055 ± 0.098
0.0IleXaa: 0.0 ± 0.0
Lys
3.361LysAla: 3.361 ± 0.281
0.0LysCys: 0.0 ± 0.0
2.75LysAsp: 2.75 ± 0.476
3.361LysGlu: 3.361 ± 1.384
4.277LysPhe: 4.277 ± 1.912
2.444LysGly: 2.444 ± 0.3
1.833LysHis: 1.833 ± 1.057
5.194LysIle: 5.194 ± 2.441
3.361LysLys: 3.361 ± 0.829
7.027LysLeu: 7.027 ± 2.943
1.528LysMet: 1.528 ± 0.326
3.361LysAsn: 3.361 ± 0.274
1.833LysPro: 1.833 ± 0.052
3.972LysGln: 3.972 ± 1.181
2.444LysArg: 2.444 ± 0.3
3.055LysSer: 3.055 ± 0.653
3.666LysThr: 3.666 ± 0.45
3.666LysVal: 3.666 ± 1.005
0.611LysTrp: 0.611 ± 0.757
2.444LysTyr: 2.444 ± 0.855
0.0LysXaa: 0.0 ± 0.0
Leu
8.249LeuAla: 8.249 ± 0.319
1.222LeuCys: 1.222 ± 0.15
5.194LeuAsp: 5.194 ± 0.221
4.277LeuGlu: 4.277 ± 0.248
4.277LeuPhe: 4.277 ± 1.912
5.5LeuGly: 5.5 ± 0.712
0.917LeuHis: 0.917 ± 0.529
4.888LeuIle: 4.888 ± 0.045
5.194LeuLys: 5.194 ± 0.776
6.416LeuLeu: 6.416 ± 0.372
1.222LeuMet: 1.222 ± 0.53
7.027LeuAsn: 7.027 ± 1.279
6.111LeuPro: 6.111 ± 2.024
2.75LeuGln: 2.75 ± 0.079
3.666LeuArg: 3.666 ± 1.215
3.666LeuSer: 3.666 ± 0.45
4.277LeuThr: 4.277 ± 0.307
2.139LeuVal: 2.139 ± 1.234
0.917LeuTrp: 0.917 ± 0.529
3.055LeuTyr: 3.055 ± 0.098
0.0LeuXaa: 0.0 ± 0.0
Met
3.361MetAla: 3.361 ± 0.281
0.0MetCys: 0.0 ± 0.0
1.528MetAsp: 1.528 ± 0.326
2.444MetGlu: 2.444 ± 1.365
0.306MetPhe: 0.306 ± 0.176
1.528MetGly: 1.528 ± 0.326
0.0MetHis: 0.0 ± 0.0
1.833MetIle: 1.833 ± 0.502
0.917MetLys: 0.917 ± 0.529
1.222MetLeu: 1.222 ± 0.705
0.0MetMet: 0.0 ± 0.0
0.917MetAsn: 0.917 ± 0.581
1.833MetPro: 1.833 ± 0.052
0.611MetGln: 0.611 ± 0.757
1.528MetArg: 1.528 ± 0.783
1.222MetSer: 1.222 ± 0.405
1.222MetThr: 1.222 ± 0.405
0.611MetVal: 0.611 ± 0.202
0.306MetTrp: 0.306 ± 0.176
0.611MetTyr: 0.611 ± 0.352
0.0MetXaa: 0.0 ± 0.0
Asn
4.583AsnAla: 4.583 ± 1.796
0.917AsnCys: 0.917 ± 0.529
3.972AsnAsp: 3.972 ± 0.483
3.361AsnGlu: 3.361 ± 0.274
4.277AsnPhe: 4.277 ± 0.248
3.055AsnGly: 3.055 ± 0.457
1.833AsnHis: 1.833 ± 0.607
3.361AsnIle: 3.361 ± 1.384
1.833AsnLys: 1.833 ± 0.052
4.888AsnLeu: 4.888 ± 1.155
1.528AsnMet: 1.528 ± 0.229
3.361AsnAsn: 3.361 ± 0.281
3.666AsnPro: 3.666 ± 0.66
1.222AsnGln: 1.222 ± 0.15
2.75AsnArg: 2.75 ± 0.633
3.972AsnSer: 3.972 ± 1.181
2.75AsnThr: 2.75 ± 1.188
3.361AsnVal: 3.361 ± 0.274
0.0AsnTrp: 0.0 ± 0.0
2.75AsnTyr: 2.75 ± 0.079
0.0AsnXaa: 0.0 ± 0.0
Pro
5.805ProAla: 5.805 ± 1.646
0.917ProCys: 0.917 ± 0.026
3.972ProAsp: 3.972 ± 0.483
1.528ProGlu: 1.528 ± 1.338
3.666ProPhe: 3.666 ± 0.66
2.444ProGly: 2.444 ± 0.81
0.611ProHis: 0.611 ± 0.352
2.139ProIle: 2.139 ± 0.431
3.055ProLys: 3.055 ± 1.207
5.805ProLeu: 5.805 ± 0.536
1.222ProMet: 1.222 ± 0.96
3.666ProAsn: 3.666 ± 1.005
3.361ProPro: 3.361 ± 3.055
3.361ProGln: 3.361 ± 2.5
1.528ProArg: 1.528 ± 0.783
4.583ProSer: 4.583 ± 3.46
3.972ProThr: 3.972 ± 2.703
2.139ProVal: 2.139 ± 0.431
0.611ProTrp: 0.611 ± 0.202
1.222ProTyr: 1.222 ± 0.15
0.0ProXaa: 0.0 ± 0.0
Gln
2.75GlnAla: 2.75 ± 0.079
0.0GlnCys: 0.0 ± 0.0
2.75GlnAsp: 2.75 ± 1.031
2.139GlnGlu: 2.139 ± 0.124
4.583GlnPhe: 4.583 ± 0.424
1.222GlnGly: 1.222 ± 0.15
0.306GlnHis: 0.306 ± 0.176
1.833GlnIle: 1.833 ± 0.502
3.361GlnLys: 3.361 ± 1.384
3.361GlnLeu: 3.361 ± 1.938
1.222GlnMet: 1.222 ± 0.114
3.055GlnAsn: 3.055 ± 1.012
3.666GlnPro: 3.666 ± 2.324
2.444GlnGln: 2.444 ± 1.919
2.444GlnArg: 2.444 ± 1.919
2.444GlnSer: 2.444 ± 0.255
3.055GlnThr: 3.055 ± 1.567
2.139GlnVal: 2.139 ± 0.431
0.917GlnTrp: 0.917 ± 0.026
1.528GlnTyr: 1.528 ± 0.783
0.0GlnXaa: 0.0 ± 0.0
Arg
1.222ArgAla: 1.222 ± 0.15
0.306ArgCys: 0.306 ± 0.176
3.361ArgAsp: 3.361 ± 1.391
3.361ArgGlu: 3.361 ± 0.274
3.972ArgPhe: 3.972 ± 0.483
3.055ArgGly: 3.055 ± 2.122
0.917ArgHis: 0.917 ± 0.529
3.972ArgIle: 3.972 ± 0.626
4.888ArgLys: 4.888 ± 1.71
4.888ArgLeu: 4.888 ± 2.174
0.306ArgMet: 0.306 ± 0.176
2.75ArgAsn: 2.75 ± 1.188
1.222ArgPro: 1.222 ± 0.405
2.444ArgGln: 2.444 ± 0.81
2.75ArgArg: 2.75 ± 0.079
2.444ArgSer: 2.444 ± 0.255
3.055ArgThr: 3.055 ± 0.457
2.75ArgVal: 2.75 ± 1.586
1.222ArgTrp: 1.222 ± 0.15
0.917ArgTyr: 0.917 ± 0.529
0.0ArgXaa: 0.0 ± 0.0
Ser
4.888SerAla: 4.888 ± 1.064
1.222SerCys: 1.222 ± 0.15
4.583SerAsp: 4.583 ± 0.424
3.055SerGlu: 3.055 ± 0.457
4.583SerPhe: 4.583 ± 1.534
3.361SerGly: 3.361 ± 0.274
0.917SerHis: 0.917 ± 0.529
5.5SerIle: 5.5 ± 0.712
5.194SerLys: 5.194 ± 1.886
5.5SerLeu: 5.5 ± 2.377
1.833SerMet: 1.833 ± 1.057
2.75SerAsn: 2.75 ± 0.633
3.972SerPro: 3.972 ± 0.483
1.833SerGln: 1.833 ± 0.052
2.444SerArg: 2.444 ± 0.3
4.888SerSer: 4.888 ± 0.51
2.444SerThr: 2.444 ± 1.919
3.972SerVal: 3.972 ± 0.483
0.917SerTrp: 0.917 ± 0.581
1.833SerTyr: 1.833 ± 0.502
0.0SerXaa: 0.0 ± 0.0
Thr
4.277ThrAla: 4.277 ± 0.862
1.528ThrCys: 1.528 ± 0.881
2.75ThrAsp: 2.75 ± 0.079
2.444ThrGlu: 2.444 ± 0.81
3.361ThrPhe: 3.361 ± 0.281
3.666ThrGly: 3.666 ± 1.769
1.222ThrHis: 1.222 ± 0.705
2.75ThrIle: 2.75 ± 0.079
0.917ThrLys: 0.917 ± 0.026
3.361ThrLeu: 3.361 ± 0.836
0.917ThrMet: 0.917 ± 0.026
3.055ThrAsn: 3.055 ± 0.098
4.277ThrPro: 4.277 ± 1.417
3.055ThrGln: 3.055 ± 0.457
2.75ThrArg: 2.75 ± 0.633
5.194ThrSer: 5.194 ± 0.333
3.666ThrThr: 3.666 ± 2.879
4.888ThrVal: 4.888 ± 0.51
0.306ThrTrp: 0.306 ± 0.379
1.833ThrTyr: 1.833 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
3.361ValAla: 3.361 ± 0.281
0.611ValCys: 0.611 ± 0.202
3.972ValAsp: 3.972 ± 0.626
5.805ValGlu: 5.805 ± 2.239
2.75ValPhe: 2.75 ± 0.476
2.75ValGly: 2.75 ± 1.031
1.528ValHis: 1.528 ± 0.326
3.972ValIle: 3.972 ± 0.071
1.222ValLys: 1.222 ± 0.705
3.361ValLeu: 3.361 ± 0.274
1.528ValMet: 1.528 ± 0.229
3.666ValAsn: 3.666 ± 0.105
1.833ValPro: 1.833 ± 0.052
3.972ValGln: 3.972 ± 0.483
4.277ValArg: 4.277 ± 0.862
4.277ValSer: 4.277 ± 0.862
3.361ValThr: 3.361 ± 1.384
3.972ValVal: 3.972 ± 0.626
0.306ValTrp: 0.306 ± 0.176
3.361ValTyr: 3.361 ± 0.836
0.0ValXaa: 0.0 ± 0.0
Trp
1.222TrpAla: 1.222 ± 0.15
0.0TrpCys: 0.0 ± 0.0
1.222TrpAsp: 1.222 ± 0.15
0.306TrpGlu: 0.306 ± 0.176
0.611TrpPhe: 0.611 ± 0.202
0.0TrpGly: 0.0 ± 0.0
0.306TrpHis: 0.306 ± 0.176
1.222TrpIle: 1.222 ± 0.705
0.611TrpLys: 0.611 ± 0.352
0.917TrpLeu: 0.917 ± 0.581
0.917TrpMet: 0.917 ± 1.136
0.306TrpAsn: 0.306 ± 0.379
1.222TrpPro: 1.222 ± 0.96
0.611TrpGln: 0.611 ± 0.202
0.306TrpArg: 0.306 ± 0.176
0.611TrpSer: 0.611 ± 0.352
0.306TrpThr: 0.306 ± 0.176
0.306TrpVal: 0.306 ± 0.379
0.306TrpTrp: 0.306 ± 0.176
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.361TyrAla: 3.361 ± 0.274
0.611TyrCys: 0.611 ± 0.202
2.75TyrAsp: 2.75 ± 0.633
1.222TyrGlu: 1.222 ± 0.15
1.222TyrPhe: 1.222 ± 0.705
2.444TyrGly: 2.444 ± 0.81
0.917TyrHis: 0.917 ± 0.026
0.611TyrIle: 0.611 ± 0.352
3.361TyrLys: 3.361 ± 0.829
1.833TyrLeu: 1.833 ± 0.502
0.611TyrMet: 0.611 ± 0.352
2.139TyrAsn: 2.139 ± 0.679
2.444TyrPro: 2.444 ± 0.3
2.75TyrGln: 2.75 ± 0.079
1.528TyrArg: 1.528 ± 0.881
2.444TyrSer: 2.444 ± 0.255
2.139TyrThr: 2.139 ± 0.679
3.361TyrVal: 3.361 ± 0.281
0.917TyrTrp: 0.917 ± 0.529
2.444TyrTyr: 2.444 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski