Amino acid dipepetide frequency for Beihai hermit crab virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.061AlaAla: 7.061 ± 1.059
2.601AlaCys: 2.601 ± 0.002
3.716AlaAsp: 3.716 ± 0.571
4.831AlaGlu: 4.831 ± 1.144
2.973AlaPhe: 2.973 ± 0.481
4.831AlaGly: 4.831 ± 0.866
1.486AlaHis: 1.486 ± 0.576
4.459AlaIle: 4.459 ± 0.387
3.716AlaLys: 3.716 ± 0.769
4.831AlaLeu: 4.831 ± 1.144
2.23AlaMet: 2.23 ± 0.641
4.831AlaAsn: 4.831 ± 3.546
2.23AlaPro: 2.23 ± 0.477
3.344AlaGln: 3.344 ± 0.38
4.088AlaArg: 4.088 ± 0.762
7.061AlaSer: 7.061 ± 1.059
5.574AlaThr: 5.574 ± 1.154
5.203AlaVal: 5.203 ± 2.005
0.743AlaTrp: 0.743 ± 0.288
2.23AlaTyr: 2.23 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
1.486CysAla: 1.486 ± 0.094
0.743CysCys: 0.743 ± 0.382
1.858CysAsp: 1.858 ± 0.955
1.115CysGlu: 1.115 ± 0.767
0.743CysPhe: 0.743 ± 0.382
2.23CysGly: 2.23 ± 0.193
0.372CysHis: 0.372 ± 0.191
0.372CysIle: 0.372 ± 0.191
1.115CysLys: 1.115 ± 0.097
2.23CysLeu: 2.23 ± 0.477
0.372CysMet: 0.372 ± 0.191
0.743CysAsn: 0.743 ± 0.382
0.743CysPro: 0.743 ± 0.288
0.372CysGln: 0.372 ± 0.191
0.372CysArg: 0.372 ± 0.191
1.115CysSer: 1.115 ± 0.097
0.743CysThr: 0.743 ± 0.382
1.486CysVal: 1.486 ± 0.094
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.317AspAla: 6.317 ± 1.239
1.115AspCys: 1.115 ± 0.097
4.831AspAsp: 4.831 ± 1.814
3.716AspGlu: 3.716 ± 0.571
2.601AspPhe: 2.601 ± 0.672
2.23AspGly: 2.23 ± 0.477
2.23AspHis: 2.23 ± 1.147
3.344AspIle: 3.344 ± 0.29
4.459AspLys: 4.459 ± 1.623
5.203AspLeu: 5.203 ± 0.665
1.115AspMet: 1.115 ± 0.685
1.486AspAsn: 1.486 ± 0.094
2.601AspPro: 2.601 ± 1.342
3.344AspGln: 3.344 ± 0.38
0.743AspArg: 0.743 ± 0.382
4.459AspSer: 4.459 ± 0.953
2.601AspThr: 2.601 ± 0.002
4.459AspVal: 4.459 ± 0.387
2.23AspTrp: 2.23 ± 0.193
2.23AspTyr: 2.23 ± 0.863
0.0AspXaa: 0.0 ± 0.0
Glu
4.459GluAla: 4.459 ± 0.953
0.743GluCys: 0.743 ± 0.382
4.459GluAsp: 4.459 ± 0.387
4.088GluGlu: 4.088 ± 1.432
3.716GluPhe: 3.716 ± 0.099
2.23GluGly: 2.23 ± 0.477
1.486GluHis: 1.486 ± 0.764
4.459GluIle: 4.459 ± 0.953
3.344GluLys: 3.344 ± 1.05
4.459GluLeu: 4.459 ± 1.727
1.115GluMet: 1.115 ± 0.573
2.973GluAsn: 2.973 ± 0.189
1.115GluPro: 1.115 ± 0.097
2.601GluGln: 2.601 ± 0.002
1.858GluArg: 1.858 ± 0.955
3.344GluSer: 3.344 ± 0.29
2.23GluThr: 2.23 ± 1.147
4.088GluVal: 4.088 ± 0.092
0.372GluTrp: 0.372 ± 0.191
2.601GluTyr: 2.601 ± 1.342
0.0GluXaa: 0.0 ± 0.0
Phe
1.858PheAla: 1.858 ± 1.055
0.743PheCys: 0.743 ± 0.382
4.088PheAsp: 4.088 ± 0.762
2.601PheGlu: 2.601 ± 0.668
3.344PhePhe: 3.344 ± 1.72
1.858PheGly: 1.858 ± 0.385
0.743PheHis: 0.743 ± 0.382
1.858PheIle: 1.858 ± 0.385
0.743PheLys: 0.743 ± 0.382
4.459PheLeu: 4.459 ± 0.953
0.372PheMet: 0.372 ± 0.191
1.486PheAsn: 1.486 ± 0.764
1.115PhePro: 1.115 ± 0.767
2.23PheGln: 2.23 ± 0.477
2.601PheArg: 2.601 ± 0.668
2.973PheSer: 2.973 ± 1.821
3.344PheThr: 3.344 ± 0.96
2.973PheVal: 2.973 ± 1.151
1.115PheTrp: 1.115 ± 0.573
2.601PheTyr: 2.601 ± 0.002
0.0PheXaa: 0.0 ± 0.0
Gly
6.317GlyAla: 6.317 ± 0.569
1.115GlyCys: 1.115 ± 0.573
3.716GlyAsp: 3.716 ± 1.241
2.973GlyGlu: 2.973 ± 1.151
2.973GlyPhe: 2.973 ± 1.151
3.716GlyGly: 3.716 ± 1.439
0.743GlyHis: 0.743 ± 0.288
1.486GlyIle: 1.486 ± 0.764
2.973GlyLys: 2.973 ± 0.859
4.831GlyLeu: 4.831 ± 0.196
2.23GlyMet: 2.23 ± 1.147
3.344GlyAsn: 3.344 ± 0.96
3.344GlyPro: 3.344 ± 0.38
0.743GlyGln: 0.743 ± 0.288
4.459GlyArg: 4.459 ± 1.057
6.317GlySer: 6.317 ± 2.781
3.344GlyThr: 3.344 ± 1.63
8.175GlyVal: 8.175 ± 0.184
0.372GlyTrp: 0.372 ± 0.479
2.23GlyTyr: 2.23 ± 0.193
0.0GlyXaa: 0.0 ± 0.0
His
2.23HisAla: 2.23 ± 0.193
0.0HisCys: 0.0 ± 0.0
1.115HisAsp: 1.115 ± 0.097
1.115HisGlu: 1.115 ± 0.097
1.858HisPhe: 1.858 ± 0.385
1.115HisGly: 1.115 ± 0.573
0.372HisHis: 0.372 ± 0.479
1.858HisIle: 1.858 ± 0.955
1.115HisLys: 1.115 ± 0.573
2.23HisLeu: 2.23 ± 0.477
0.743HisMet: 0.743 ± 0.382
0.372HisAsn: 0.372 ± 0.191
2.973HisPro: 2.973 ± 0.859
0.372HisGln: 0.372 ± 0.191
0.372HisArg: 0.372 ± 0.479
0.743HisSer: 0.743 ± 0.288
0.743HisThr: 0.743 ± 0.382
1.115HisVal: 1.115 ± 0.767
0.372HisTrp: 0.372 ± 0.191
2.23HisTyr: 2.23 ± 0.477
0.0HisXaa: 0.0 ± 0.0
Ile
4.831IleAla: 4.831 ± 0.196
0.372IleCys: 0.372 ± 0.479
3.716IleAsp: 3.716 ± 1.911
3.344IleGlu: 3.344 ± 0.96
0.743IlePhe: 0.743 ± 0.382
4.459IleGly: 4.459 ± 0.283
0.372IleHis: 0.372 ± 0.479
2.973IleIle: 2.973 ± 1.151
2.23IleLys: 2.23 ± 1.147
5.203IleLeu: 5.203 ± 1.335
1.115IleMet: 1.115 ± 0.573
1.486IleAsn: 1.486 ± 0.576
1.858IlePro: 1.858 ± 0.285
1.115IleGln: 1.115 ± 0.573
2.973IleArg: 2.973 ± 1.529
4.831IleSer: 4.831 ± 1.536
3.344IleThr: 3.344 ± 0.38
2.973IleVal: 2.973 ± 0.859
0.0IleTrp: 0.0 ± 0.0
1.115IleTyr: 1.115 ± 0.573
0.0IleXaa: 0.0 ± 0.0
Lys
2.973LysAla: 2.973 ± 0.189
0.743LysCys: 0.743 ± 0.382
1.486LysAsp: 1.486 ± 0.764
2.23LysGlu: 2.23 ± 1.147
2.973LysPhe: 2.973 ± 0.859
4.459LysGly: 4.459 ± 0.387
1.858LysHis: 1.858 ± 0.955
2.23LysIle: 2.23 ± 1.147
5.574LysLys: 5.574 ± 2.866
4.088LysLeu: 4.088 ± 0.092
0.372LysMet: 0.372 ± 0.191
2.23LysAsn: 2.23 ± 0.477
4.459LysPro: 4.459 ± 1.623
1.115LysGln: 1.115 ± 0.573
3.716LysArg: 3.716 ± 1.241
2.23LysSer: 2.23 ± 0.863
1.858LysThr: 1.858 ± 0.285
5.203LysVal: 5.203 ± 1.335
1.115LysTrp: 1.115 ± 0.573
3.344LysTyr: 3.344 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
5.203LeuAla: 5.203 ± 1.345
1.858LeuCys: 1.858 ± 0.285
3.716LeuAsp: 3.716 ± 0.099
3.716LeuGlu: 3.716 ± 1.241
3.344LeuPhe: 3.344 ± 1.05
5.574LeuGly: 5.574 ± 0.186
2.23LeuHis: 2.23 ± 0.193
2.973LeuIle: 2.973 ± 0.859
4.088LeuLys: 4.088 ± 0.578
3.716LeuLeu: 3.716 ± 1.439
1.486LeuMet: 1.486 ± 0.764
3.716LeuAsn: 3.716 ± 0.769
5.203LeuPro: 5.203 ± 0.675
1.858LeuGln: 1.858 ± 0.285
3.344LeuArg: 3.344 ± 0.38
5.203LeuSer: 5.203 ± 0.665
5.574LeuThr: 5.574 ± 1.824
6.689LeuVal: 6.689 ± 0.09
0.743LeuTrp: 0.743 ± 0.382
2.973LeuTyr: 2.973 ± 0.189
0.0LeuXaa: 0.0 ± 0.0
Met
3.716MetAla: 3.716 ± 0.571
0.372MetCys: 0.372 ± 0.191
0.372MetAsp: 0.372 ± 0.191
1.115MetGlu: 1.115 ± 0.573
0.743MetPhe: 0.743 ± 0.288
2.973MetGly: 2.973 ± 0.859
0.0MetHis: 0.0 ± 0.0
1.486MetIle: 1.486 ± 0.764
1.858MetLys: 1.858 ± 0.385
0.372MetLeu: 0.372 ± 0.191
0.743MetMet: 0.743 ± 0.288
2.23MetAsn: 2.23 ± 0.477
1.115MetPro: 1.115 ± 0.767
1.115MetGln: 1.115 ± 0.097
1.115MetArg: 1.115 ± 0.767
0.743MetSer: 0.743 ± 0.382
2.973MetThr: 2.973 ± 0.189
1.486MetVal: 1.486 ± 0.094
0.743MetTrp: 0.743 ± 0.288
1.115MetTyr: 1.115 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
3.716AsnAla: 3.716 ± 0.571
0.372AsnCys: 0.372 ± 0.479
1.486AsnAsp: 1.486 ± 0.094
1.486AsnGlu: 1.486 ± 0.576
1.115AsnPhe: 1.115 ± 0.767
2.973AsnGly: 2.973 ± 2.491
1.115AsnHis: 1.115 ± 0.573
2.23AsnIle: 2.23 ± 0.193
2.973AsnLys: 2.973 ± 1.529
3.716AsnLeu: 3.716 ± 0.571
1.858AsnMet: 1.858 ± 0.285
2.23AsnAsn: 2.23 ± 0.477
3.716AsnPro: 3.716 ± 0.769
1.115AsnGln: 1.115 ± 0.097
2.973AsnArg: 2.973 ± 0.189
5.574AsnSer: 5.574 ± 0.186
2.973AsnThr: 2.973 ± 2.491
2.23AsnVal: 2.23 ± 0.477
0.743AsnTrp: 0.743 ± 0.288
1.486AsnTyr: 1.486 ± 1.246
0.0AsnXaa: 0.0 ± 0.0
Pro
3.716ProAla: 3.716 ± 0.099
1.486ProCys: 1.486 ± 0.576
2.601ProAsp: 2.601 ± 1.338
2.973ProGlu: 2.973 ± 0.481
2.973ProPhe: 2.973 ± 1.151
2.601ProGly: 2.601 ± 0.672
1.486ProHis: 1.486 ± 0.094
2.601ProIle: 2.601 ± 0.668
2.973ProLys: 2.973 ± 0.859
4.088ProLeu: 4.088 ± 0.762
1.486ProMet: 1.486 ± 1.246
2.601ProAsn: 2.601 ± 1.342
3.344ProPro: 3.344 ± 0.29
1.486ProGln: 1.486 ± 0.764
2.23ProArg: 2.23 ± 0.477
2.973ProSer: 2.973 ± 0.189
2.973ProThr: 2.973 ± 1.821
4.088ProVal: 4.088 ± 1.918
1.115ProTrp: 1.115 ± 0.097
1.858ProTyr: 1.858 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
0.743GlnAla: 0.743 ± 0.958
0.0GlnCys: 0.0 ± 0.0
2.23GlnAsp: 2.23 ± 0.477
1.115GlnGlu: 1.115 ± 0.573
2.23GlnPhe: 2.23 ± 0.477
2.23GlnGly: 2.23 ± 1.147
1.115GlnHis: 1.115 ± 0.573
1.486GlnIle: 1.486 ± 0.094
1.858GlnLys: 1.858 ± 0.285
2.601GlnLeu: 2.601 ± 1.338
1.858GlnMet: 1.858 ± 0.285
0.743GlnAsn: 0.743 ± 0.382
1.115GlnPro: 1.115 ± 0.097
2.23GlnGln: 2.23 ± 0.193
2.601GlnArg: 2.601 ± 0.668
1.115GlnSer: 1.115 ± 0.097
1.858GlnThr: 1.858 ± 0.285
3.344GlnVal: 3.344 ± 0.96
1.115GlnTrp: 1.115 ± 0.097
0.743GlnTyr: 0.743 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
3.344ArgAla: 3.344 ± 0.29
1.115ArgCys: 1.115 ± 0.573
2.601ArgAsp: 2.601 ± 0.668
4.831ArgGlu: 4.831 ± 1.144
1.115ArgPhe: 1.115 ± 0.767
3.716ArgGly: 3.716 ± 0.099
1.858ArgHis: 1.858 ± 0.955
3.344ArgIle: 3.344 ± 0.38
3.344ArgLys: 3.344 ± 1.72
3.716ArgLeu: 3.716 ± 0.769
1.115ArgMet: 1.115 ± 0.573
2.973ArgAsn: 2.973 ± 1.151
2.601ArgPro: 2.601 ± 0.002
1.115ArgGln: 1.115 ± 0.573
6.317ArgArg: 6.317 ± 1.909
2.601ArgSer: 2.601 ± 1.338
4.088ArgThr: 4.088 ± 0.092
4.459ArgVal: 4.459 ± 0.387
0.743ArgTrp: 0.743 ± 0.288
1.858ArgTyr: 1.858 ± 1.055
0.0ArgXaa: 0.0 ± 0.0
Ser
4.459SerAla: 4.459 ± 1.057
1.115SerCys: 1.115 ± 0.573
4.088SerAsp: 4.088 ± 0.762
4.831SerGlu: 4.831 ± 2.206
2.973SerPhe: 2.973 ± 0.189
5.203SerGly: 5.203 ± 0.675
0.372SerHis: 0.372 ± 0.191
4.088SerIle: 4.088 ± 0.578
3.344SerLys: 3.344 ± 0.96
4.459SerLeu: 4.459 ± 0.387
1.115SerMet: 1.115 ± 0.097
4.831SerAsn: 4.831 ± 0.474
4.459SerPro: 4.459 ± 1.057
2.601SerGln: 2.601 ± 0.672
1.858SerArg: 1.858 ± 0.385
6.317SerSer: 6.317 ± 0.771
5.574SerThr: 5.574 ± 1.824
5.574SerVal: 5.574 ± 0.186
1.486SerTrp: 1.486 ± 0.094
2.23SerTyr: 2.23 ± 1.533
0.0SerXaa: 0.0 ± 0.0
Thr
4.459ThrAla: 4.459 ± 1.057
0.743ThrCys: 0.743 ± 0.382
4.831ThrAsp: 4.831 ± 2.876
2.973ThrGlu: 2.973 ± 0.189
1.858ThrPhe: 1.858 ± 0.955
4.088ThrGly: 4.088 ± 1.248
0.743ThrHis: 0.743 ± 0.382
2.601ThrIle: 2.601 ± 0.672
2.973ThrLys: 2.973 ± 1.529
5.203ThrLeu: 5.203 ± 2.015
2.601ThrMet: 2.601 ± 0.672
1.486ThrAsn: 1.486 ± 0.764
3.716ThrPro: 3.716 ± 1.439
2.973ThrGln: 2.973 ± 0.859
4.459ThrArg: 4.459 ± 0.283
4.088ThrSer: 4.088 ± 0.092
5.574ThrThr: 5.574 ± 1.824
4.459ThrVal: 4.459 ± 3.067
1.115ThrTrp: 1.115 ± 0.097
1.858ThrTyr: 1.858 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
8.175ValAla: 8.175 ± 0.184
1.858ValCys: 1.858 ± 0.385
5.574ValAsp: 5.574 ± 2.494
3.716ValGlu: 3.716 ± 1.241
3.344ValPhe: 3.344 ± 1.05
5.574ValGly: 5.574 ± 0.484
1.858ValHis: 1.858 ± 0.285
2.973ValIle: 2.973 ± 0.859
2.973ValLys: 2.973 ± 1.529
3.344ValLeu: 3.344 ± 0.96
2.23ValMet: 2.23 ± 0.193
3.344ValAsn: 3.344 ± 0.38
5.203ValPro: 5.203 ± 0.675
1.858ValGln: 1.858 ± 0.955
5.574ValArg: 5.574 ± 0.484
5.203ValSer: 5.203 ± 3.355
4.459ValThr: 4.459 ± 0.283
6.689ValVal: 6.689 ± 1.25
1.486ValTrp: 1.486 ± 0.764
2.23ValTyr: 2.23 ± 0.863
0.0ValXaa: 0.0 ± 0.0
Trp
0.743TrpAla: 0.743 ± 0.382
0.372TrpCys: 0.372 ± 0.191
1.486TrpAsp: 1.486 ± 0.764
0.743TrpGlu: 0.743 ± 0.382
0.372TrpPhe: 0.372 ± 0.479
1.486TrpGly: 1.486 ± 0.576
0.372TrpHis: 0.372 ± 0.191
0.743TrpIle: 0.743 ± 0.382
0.743TrpLys: 0.743 ± 0.382
1.115TrpLeu: 1.115 ± 0.097
1.115TrpMet: 1.115 ± 0.097
1.486TrpAsn: 1.486 ± 0.094
0.0TrpPro: 0.0 ± 0.0
0.743TrpGln: 0.743 ± 0.288
1.486TrpArg: 1.486 ± 1.246
1.486TrpSer: 1.486 ± 0.764
0.743TrpThr: 0.743 ± 0.288
0.743TrpVal: 0.743 ± 0.288
0.372TrpTrp: 0.372 ± 0.191
0.743TrpTyr: 0.743 ± 0.288
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.23TyrAla: 2.23 ± 1.533
0.743TyrCys: 0.743 ± 0.382
3.716TyrAsp: 3.716 ± 0.099
2.23TyrGlu: 2.23 ± 1.147
1.115TyrPhe: 1.115 ± 0.573
1.858TyrGly: 1.858 ± 0.385
2.23TyrHis: 2.23 ± 2.203
1.486TyrIle: 1.486 ± 0.094
1.858TyrLys: 1.858 ± 0.285
3.344TyrLeu: 3.344 ± 0.96
0.743TyrMet: 0.743 ± 0.288
1.486TyrAsn: 1.486 ± 1.246
0.743TyrPro: 0.743 ± 0.288
0.0TyrGln: 0.0 ± 0.0
3.716TyrArg: 3.716 ± 0.099
2.601TyrSer: 2.601 ± 1.342
2.23TyrThr: 2.23 ± 0.477
2.23TyrVal: 2.23 ± 1.147
1.115TyrTrp: 1.115 ± 0.767
1.115TyrTyr: 1.115 ± 0.097
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2692 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski