Amino acid dipepetide frequency for Wenzhou Shrimp Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.995AlaAla: 4.995 ± 2.764
2.185AlaCys: 2.185 ± 0.754
3.122AlaAsp: 3.122 ± 1.758
4.995AlaGlu: 4.995 ± 1.919
0.937AlaPhe: 0.937 ± 0.596
2.81AlaGly: 2.81 ± 0.401
1.249AlaHis: 1.249 ± 0.276
3.434AlaIle: 3.434 ± 1.124
1.873AlaLys: 1.873 ± 2.07
7.493AlaLeu: 7.493 ± 0.844
3.746AlaMet: 3.746 ± 1.505
4.371AlaAsn: 4.371 ± 2.169
1.249AlaPro: 1.249 ± 0.422
0.624AlaGln: 0.624 ± 0.211
2.498AlaArg: 2.498 ± 1.063
3.746AlaSer: 3.746 ± 0.344
4.059AlaThr: 4.059 ± 0.737
4.683AlaVal: 4.683 ± 1.414
0.937AlaTrp: 0.937 ± 0.541
1.561AlaTyr: 1.561 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
1.249CysAla: 1.249 ± 0.909
0.312CysCys: 0.312 ± 0.355
0.624CysAsp: 0.624 ± 0.211
1.561CysGlu: 1.561 ± 1.112
1.873CysPhe: 1.873 ± 1.618
0.0CysGly: 0.0 ± 0.0
1.561CysHis: 1.561 ± 1.771
0.624CysIle: 0.624 ± 0.361
0.624CysLys: 0.624 ± 0.211
3.434CysLeu: 3.434 ± 0.524
0.937CysMet: 0.937 ± 0.556
1.249CysAsn: 1.249 ± 0.909
1.249CysPro: 1.249 ± 1.421
1.249CysGln: 1.249 ± 0.422
0.624CysArg: 0.624 ± 0.211
4.995CysSer: 4.995 ± 2.243
1.561CysThr: 1.561 ± 1.776
1.561CysVal: 1.561 ± 0.762
0.0CysTrp: 0.0 ± 0.0
1.561CysTyr: 1.561 ± 1.776
0.0CysXaa: 0.0 ± 0.0
Asp
2.185AspAla: 2.185 ± 0.369
1.873AspCys: 1.873 ± 1.112
3.434AspAsp: 3.434 ± 1.062
2.81AspGlu: 2.81 ± 0.707
3.122AspPhe: 3.122 ± 1.417
3.434AspGly: 3.434 ± 0.869
2.81AspHis: 2.81 ± 0.295
1.873AspIle: 1.873 ± 0.607
2.498AspLys: 2.498 ± 1.1
4.683AspLeu: 4.683 ± 0.965
1.561AspMet: 1.561 ± 0.762
1.249AspAsn: 1.249 ± 0.276
3.434AspPro: 3.434 ± 0.524
1.249AspGln: 1.249 ± 0.276
2.185AspArg: 2.185 ± 1.212
6.556AspSer: 6.556 ± 1.283
2.185AspThr: 2.185 ± 0.5
2.81AspVal: 2.81 ± 0.883
0.624AspTrp: 0.624 ± 0.361
0.312AspTyr: 0.312 ± 0.18
0.0AspXaa: 0.0 ± 0.0
Glu
5.308GluAla: 5.308 ± 0.972
0.937GluCys: 0.937 ± 0.596
2.81GluAsp: 2.81 ± 1.256
4.995GluGlu: 4.995 ± 1.919
3.122GluPhe: 3.122 ± 1.316
3.434GluGly: 3.434 ± 0.645
1.249GluHis: 1.249 ± 0.276
3.122GluIle: 3.122 ± 0.4
4.059GluLys: 4.059 ± 0.877
4.995GluLeu: 4.995 ± 0.884
2.81GluMet: 2.81 ± 0.707
2.185GluAsn: 2.185 ± 1.262
2.185GluPro: 2.185 ± 0.42
2.498GluGln: 2.498 ± 0.61
4.059GluArg: 4.059 ± 0.354
2.81GluSer: 2.81 ± 0.401
3.434GluThr: 3.434 ± 0.691
5.308GluVal: 5.308 ± 0.976
0.624GluTrp: 0.624 ± 0.71
1.873GluTyr: 1.873 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
2.185PheAla: 2.185 ± 1.158
2.185PheCys: 2.185 ± 0.971
1.873PheAsp: 1.873 ± 0.82
2.185PheGlu: 2.185 ± 1.212
0.312PhePhe: 0.312 ± 0.18
2.498PheGly: 2.498 ± 1.1
1.873PheHis: 1.873 ± 0.335
2.498PheIle: 2.498 ± 1.033
3.122PheLys: 3.122 ± 0.871
3.434PheLeu: 3.434 ± 1.041
0.937PheMet: 0.937 ± 0.167
1.873PheAsn: 1.873 ± 0.508
1.561PhePro: 1.561 ± 0.436
1.249PheGln: 1.249 ± 0.629
1.873PheArg: 1.873 ± 1.082
1.873PheSer: 1.873 ± 0.335
2.498PheThr: 2.498 ± 0.486
1.873PheVal: 1.873 ± 0.633
0.624PheTrp: 0.624 ± 0.361
0.937PheTyr: 0.937 ± 0.556
0.0PheXaa: 0.0 ± 0.0
Gly
3.122GlyAla: 3.122 ± 1.217
2.185GlyCys: 2.185 ± 1.972
3.746GlyAsp: 3.746 ± 0.829
2.81GlyGlu: 2.81 ± 0.502
2.81GlyPhe: 2.81 ± 1.68
4.371GlyGly: 4.371 ± 1.564
1.249GlyHis: 1.249 ± 0.422
3.434GlyIle: 3.434 ± 0.949
3.434GlyLys: 3.434 ± 1.173
2.498GlyLeu: 2.498 ± 0.553
1.249GlyMet: 1.249 ± 0.422
3.746GlyAsn: 3.746 ± 1.865
0.937GlyPro: 0.937 ± 0.556
2.185GlyGln: 2.185 ± 0.535
1.873GlyArg: 1.873 ± 0.508
6.244GlySer: 6.244 ± 1.126
4.371GlyThr: 4.371 ± 1.43
5.62GlyVal: 5.62 ± 1.821
0.312GlyTrp: 0.312 ± 0.355
1.873GlyTyr: 1.873 ± 1.082
0.0GlyXaa: 0.0 ± 0.0
His
1.873HisAla: 1.873 ± 0.508
1.561HisCys: 1.561 ± 1.263
0.312HisAsp: 0.312 ± 0.18
1.561HisGlu: 1.561 ± 0.902
2.185HisPhe: 2.185 ± 0.369
1.873HisGly: 1.873 ± 0.335
0.937HisHis: 0.937 ± 0.541
2.81HisIle: 2.81 ± 0.295
1.873HisLys: 1.873 ± 0.439
3.434HisLeu: 3.434 ± 1.679
0.312HisMet: 0.312 ± 0.18
1.249HisAsn: 1.249 ± 0.276
1.873HisPro: 1.873 ± 1.082
1.249HisGln: 1.249 ± 0.422
1.249HisArg: 1.249 ± 0.721
2.498HisSer: 2.498 ± 0.959
0.312HisThr: 0.312 ± 0.18
3.122HisVal: 3.122 ± 0.871
0.312HisTrp: 0.312 ± 0.355
1.249HisTyr: 1.249 ± 0.276
0.0HisXaa: 0.0 ± 0.0
Ile
1.249IleAla: 1.249 ± 1.231
0.937IleCys: 0.937 ± 1.429
2.185IleAsp: 2.185 ± 0.5
1.561IleGlu: 1.561 ± 0.902
2.81IlePhe: 2.81 ± 1.623
2.498IleGly: 2.498 ± 0.281
0.937IleHis: 0.937 ± 0.167
2.81IleIle: 2.81 ± 0.707
3.746IleLys: 3.746 ± 0.829
7.805IleLeu: 7.805 ± 1.553
0.624IleMet: 0.624 ± 0.361
2.81IleAsn: 2.81 ± 1.73
2.185IlePro: 2.185 ± 1.243
4.059IleGln: 4.059 ± 0.737
2.185IleArg: 2.185 ± 1.752
6.244IleSer: 6.244 ± 2.628
4.371IleThr: 4.371 ± 1.066
4.683IleVal: 4.683 ± 2.287
0.312IleTrp: 0.312 ± 0.18
1.873IleTyr: 1.873 ± 1.082
0.0IleXaa: 0.0 ± 0.0
Lys
4.371LysAla: 4.371 ± 1.066
1.873LysCys: 1.873 ± 1.618
4.371LysAsp: 4.371 ± 1.43
3.122LysGlu: 3.122 ± 1.641
2.185LysPhe: 2.185 ± 0.5
1.561LysGly: 1.561 ± 0.762
1.249LysHis: 1.249 ± 0.721
2.498LysIle: 2.498 ± 1.033
4.995LysLys: 4.995 ± 2.067
4.683LysLeu: 4.683 ± 0.706
2.498LysMet: 2.498 ± 0.281
1.873LysAsn: 1.873 ± 1.192
2.185LysPro: 2.185 ± 0.782
0.937LysGln: 0.937 ± 0.642
4.059LysArg: 4.059 ± 1.854
2.81LysSer: 2.81 ± 0.502
4.059LysThr: 4.059 ± 0.168
4.059LysVal: 4.059 ± 0.168
1.249LysTrp: 1.249 ± 0.276
2.185LysTyr: 2.185 ± 1.212
0.0LysXaa: 0.0 ± 0.0
Leu
6.556LeuAla: 6.556 ± 0.939
2.498LeuCys: 2.498 ± 0.281
3.746LeuAsp: 3.746 ± 1.017
7.181LeuGlu: 7.181 ± 1.516
1.873LeuPhe: 1.873 ± 0.335
7.805LeuGly: 7.805 ± 0.062
4.371LeuHis: 4.371 ± 0.054
6.244LeuIle: 6.244 ± 1.568
4.371LeuLys: 4.371 ± 1.693
8.742LeuLeu: 8.742 ± 1.348
3.746LeuMet: 3.746 ± 0.192
2.185LeuAsn: 2.185 ± 1.243
4.371LeuPro: 4.371 ± 1.043
2.498LeuGln: 2.498 ± 0.553
4.371LeuArg: 4.371 ± 1.564
8.117LeuSer: 8.117 ± 1.637
5.62LeuThr: 5.62 ± 0.323
5.932LeuVal: 5.932 ± 1.422
1.873LeuTrp: 1.873 ± 0.633
2.185LeuTyr: 2.185 ± 0.754
0.0LeuXaa: 0.0 ± 0.0
Met
1.873MetAla: 1.873 ± 1.192
0.0MetCys: 0.0 ± 0.0
2.185MetAsp: 2.185 ± 0.42
1.249MetGlu: 1.249 ± 0.721
1.249MetPhe: 1.249 ± 0.629
1.249MetGly: 1.249 ± 0.721
2.185MetHis: 2.185 ± 0.782
2.81MetIle: 2.81 ± 0.741
1.561MetLys: 1.561 ± 0.448
2.185MetLeu: 2.185 ± 0.5
1.249MetMet: 1.249 ± 0.524
0.937MetAsn: 0.937 ± 0.167
0.937MetPro: 0.937 ± 0.541
1.561MetGln: 1.561 ± 0.448
1.561MetArg: 1.561 ± 0.336
4.059MetSer: 4.059 ± 1.388
3.122MetThr: 3.122 ± 0.4
1.561MetVal: 1.561 ± 1.112
0.0MetTrp: 0.0 ± 0.0
1.561MetTyr: 1.561 ± 0.436
0.0MetXaa: 0.0 ± 0.0
Asn
1.873AsnAla: 1.873 ± 0.82
1.249AsnCys: 1.249 ± 0.836
1.561AsnAsp: 1.561 ± 0.436
3.122AsnGlu: 3.122 ± 1.011
0.624AsnPhe: 0.624 ± 0.211
2.81AsnGly: 2.81 ± 1.668
1.873AsnHis: 1.873 ± 1.284
3.434AsnIle: 3.434 ± 0.645
2.498AsnLys: 2.498 ± 1.293
4.995AsnLeu: 4.995 ± 1.502
0.937AsnMet: 0.937 ± 0.541
0.937AsnAsn: 0.937 ± 0.556
2.498AsnPro: 2.498 ± 1.293
1.561AsnGln: 1.561 ± 0.336
1.561AsnArg: 1.561 ± 0.668
3.122AsnSer: 3.122 ± 1.065
1.249AsnThr: 1.249 ± 0.721
1.873AsnVal: 1.873 ± 0.633
0.937AsnTrp: 0.937 ± 0.541
1.561AsnTyr: 1.561 ± 0.336
0.0AsnXaa: 0.0 ± 0.0
Pro
3.746ProAla: 3.746 ± 1.629
0.624ProCys: 0.624 ± 0.71
2.81ProAsp: 2.81 ± 0.502
3.434ProGlu: 3.434 ± 0.645
1.561ProPhe: 1.561 ± 0.762
3.434ProGly: 3.434 ± 0.949
0.624ProHis: 0.624 ± 0.211
1.873ProIle: 1.873 ± 0.335
1.873ProLys: 1.873 ± 1.284
4.059ProLeu: 4.059 ± 0.737
0.624ProMet: 0.624 ± 0.361
0.937ProAsn: 0.937 ± 0.541
0.624ProPro: 0.624 ± 0.71
1.873ProGln: 1.873 ± 0.439
2.185ProArg: 2.185 ± 0.535
4.371ProSer: 4.371 ± 0.528
1.873ProThr: 1.873 ± 0.933
1.561ProVal: 1.561 ± 0.336
0.937ProTrp: 0.937 ± 0.596
1.249ProTyr: 1.249 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
1.873GlnAla: 1.873 ± 0.508
0.624GlnCys: 0.624 ± 0.71
2.185GlnAsp: 2.185 ± 0.369
1.561GlnGlu: 1.561 ± 0.336
0.624GlnPhe: 0.624 ± 0.211
2.81GlnGly: 2.81 ± 0.295
0.937GlnHis: 0.937 ± 0.541
1.873GlnIle: 1.873 ± 0.82
0.937GlnLys: 0.937 ± 0.541
2.81GlnLeu: 2.81 ± 0.741
0.937GlnMet: 0.937 ± 0.541
1.249GlnAsn: 1.249 ± 0.909
1.561GlnPro: 1.561 ± 0.902
0.624GlnGln: 0.624 ± 0.361
1.561GlnArg: 1.561 ± 0.902
4.059GlnSer: 4.059 ± 0.819
1.561GlnThr: 1.561 ± 0.902
3.746GlnVal: 3.746 ± 0.669
0.624GlnTrp: 0.624 ± 0.211
0.624GlnTyr: 0.624 ± 0.616
0.0GlnXaa: 0.0 ± 0.0
Arg
4.371ArgAla: 4.371 ± 0.054
0.624ArgCys: 0.624 ± 0.71
2.81ArgAsp: 2.81 ± 1.138
3.746ArgGlu: 3.746 ± 0.829
1.249ArgPhe: 1.249 ± 0.836
2.498ArgGly: 2.498 ± 0.281
0.937ArgHis: 0.937 ± 0.541
1.873ArgIle: 1.873 ± 0.508
4.059ArgLys: 4.059 ± 0.981
3.746ArgLeu: 3.746 ± 0.192
1.561ArgMet: 1.561 ± 0.902
2.498ArgAsn: 2.498 ± 1.442
0.937ArgPro: 0.937 ± 0.167
1.561ArgGln: 1.561 ± 0.436
3.434ArgArg: 3.434 ± 0.691
5.62ArgSer: 5.62 ± 0.323
3.122ArgThr: 3.122 ± 0.671
3.746ArgVal: 3.746 ± 0.192
0.624ArgTrp: 0.624 ± 0.361
1.561ArgTyr: 1.561 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
3.746SerAla: 3.746 ± 1.017
3.122SerCys: 3.122 ± 2.526
3.122SerAsp: 3.122 ± 1.316
4.995SerGlu: 4.995 ± 1.106
3.434SerPhe: 3.434 ± 1.495
5.308SerGly: 5.308 ± 1.946
2.185SerHis: 2.185 ± 0.782
5.932SerIle: 5.932 ± 0.38
5.308SerLys: 5.308 ± 1.063
9.366SerLeu: 9.366 ± 0.954
2.498SerMet: 2.498 ± 1.12
4.995SerAsn: 4.995 ± 1.22
4.995SerPro: 4.995 ± 1.232
2.498SerGln: 2.498 ± 0.486
3.122SerArg: 3.122 ± 0.871
8.43SerSer: 8.43 ± 2.364
6.869SerThr: 6.869 ± 1.333
5.62SerVal: 5.62 ± 1.153
1.561SerTrp: 1.561 ± 0.436
2.81SerTyr: 2.81 ± 1.034
0.0SerXaa: 0.0 ± 0.0
Thr
3.434ThrAla: 3.434 ± 0.645
1.561ThrCys: 1.561 ± 1.604
2.185ThrAsp: 2.185 ± 0.42
4.059ThrGlu: 4.059 ± 0.354
2.81ThrPhe: 2.81 ± 0.748
2.498ThrGly: 2.498 ± 0.486
1.873ThrHis: 1.873 ± 0.439
3.746ThrIle: 3.746 ± 1.385
2.81ThrLys: 2.81 ± 0.295
7.493ThrLeu: 7.493 ± 0.195
2.185ThrMet: 2.185 ± 0.369
0.937ThrAsn: 0.937 ± 0.556
3.122ThrPro: 3.122 ± 0.9
2.498ThrGln: 2.498 ± 0.959
4.371ThrArg: 4.371 ± 0.84
6.869ThrSer: 6.869 ± 1.828
2.81ThrThr: 2.81 ± 1.668
3.434ThrVal: 3.434 ± 0.645
1.561ThrTrp: 1.561 ± 0.436
1.561ThrTyr: 1.561 ± 1.207
0.0ThrXaa: 0.0 ± 0.0
Val
4.371ValAla: 4.371 ± 0.738
1.249ValCys: 1.249 ± 0.276
5.308ValAsp: 5.308 ± 0.976
3.434ValGlu: 3.434 ± 0.114
3.434ValPhe: 3.434 ± 0.691
5.308ValGly: 5.308 ± 1.587
1.249ValHis: 1.249 ± 0.276
2.81ValIle: 2.81 ± 0.401
4.059ValLys: 4.059 ± 0.737
5.62ValLeu: 5.62 ± 1.427
2.185ValMet: 2.185 ± 0.43
3.434ValAsn: 3.434 ± 0.524
2.81ValPro: 2.81 ± 0.295
1.873ValGln: 1.873 ± 0.607
4.371ValArg: 4.371 ± 0.84
4.995ValSer: 4.995 ± 1.275
4.995ValThr: 4.995 ± 1.689
5.62ValVal: 5.62 ± 0.92
0.937ValTrp: 0.937 ± 0.167
2.498ValTyr: 2.498 ± 0.553
0.0ValXaa: 0.0 ± 0.0
Trp
1.561TrpAla: 1.561 ± 0.448
0.0TrpCys: 0.0 ± 0.0
1.249TrpAsp: 1.249 ± 0.721
1.249TrpGlu: 1.249 ± 0.721
0.624TrpPhe: 0.624 ± 0.211
0.312TrpGly: 0.312 ± 0.18
0.312TrpHis: 0.312 ± 0.18
0.624TrpIle: 0.624 ± 0.361
0.937TrpLys: 0.937 ± 0.167
1.249TrpLeu: 1.249 ± 0.422
0.312TrpMet: 0.312 ± 0.18
0.312TrpAsn: 0.312 ± 0.18
0.312TrpPro: 0.312 ± 0.18
0.0TrpGln: 0.0 ± 0.0
0.937TrpArg: 0.937 ± 0.167
0.937TrpSer: 0.937 ± 0.167
0.937TrpThr: 0.937 ± 0.642
2.185TrpVal: 2.185 ± 0.971
0.624TrpTrp: 0.624 ± 0.71
0.624TrpTyr: 0.624 ± 0.361
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.937TyrAla: 0.937 ± 0.167
1.249TyrCys: 1.249 ± 0.422
1.249TyrAsp: 1.249 ± 0.422
2.185TyrGlu: 2.185 ± 0.42
0.624TyrPhe: 0.624 ± 0.71
1.561TyrGly: 1.561 ± 0.336
2.185TyrHis: 2.185 ± 0.782
1.561TyrIle: 1.561 ± 0.436
2.185TyrLys: 2.185 ± 0.754
1.561TyrLeu: 1.561 ± 0.436
1.873TyrMet: 1.873 ± 0.82
1.249TyrAsn: 1.249 ± 0.629
1.249TyrPro: 1.249 ± 0.629
0.937TyrGln: 0.937 ± 0.167
2.498TyrArg: 2.498 ± 0.959
1.561TyrSer: 1.561 ± 0.436
2.81TyrThr: 2.81 ± 1.034
1.561TyrVal: 1.561 ± 0.448
0.624TyrTrp: 0.624 ± 0.616
1.249TyrTyr: 1.249 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3204 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski