Amino acid dipepetide frequency for Yichang Insect virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.282AlaAla: 3.282 ± 0.413
2.088AlaCys: 2.088 ± 1.615
3.58AlaAsp: 3.58 ± 2.113
4.177AlaGlu: 4.177 ± 1.967
1.79AlaPhe: 1.79 ± 0.604
3.282AlaGly: 3.282 ± 1.076
0.895AlaHis: 0.895 ± 0.235
4.773AlaIle: 4.773 ± 2.888
4.475AlaLys: 4.475 ± 1.339
5.668AlaLeu: 5.668 ± 1.543
1.79AlaMet: 1.79 ± 1.638
1.193AlaAsn: 1.193 ± 0.7
1.492AlaPro: 1.492 ± 0.422
2.685AlaGln: 2.685 ± 0.781
4.475AlaArg: 4.475 ± 1.866
5.967AlaSer: 5.967 ± 1.571
1.79AlaThr: 1.79 ± 0.936
2.983AlaVal: 2.983 ± 2.248
0.597AlaTrp: 0.597 ± 0.368
1.79AlaTyr: 1.79 ± 0.469
0.0AlaXaa: 0.0 ± 0.0
Cys
1.193CysAla: 1.193 ± 0.451
0.597CysCys: 0.597 ± 0.859
0.895CysAsp: 0.895 ± 1.015
0.298CysGlu: 0.298 ± 0.338
0.298CysPhe: 0.298 ± 0.338
1.79CysGly: 1.79 ± 0.677
2.088CysHis: 2.088 ± 0.983
0.597CysIle: 0.597 ± 0.368
0.597CysLys: 0.597 ± 0.677
1.492CysLeu: 1.492 ± 0.733
0.298CysMet: 0.298 ± 0.184
0.597CysAsn: 0.597 ± 0.226
1.193CysPro: 1.193 ± 1.353
1.193CysGln: 1.193 ± 0.878
0.895CysArg: 0.895 ± 1.063
4.773CysSer: 4.773 ± 4.021
3.282CysThr: 3.282 ± 2.373
1.79CysVal: 1.79 ± 0.677
0.597CysTrp: 0.597 ± 0.677
2.088CysTyr: 2.088 ± 1.889
0.0CysXaa: 0.0 ± 0.0
Asp
2.088AspAla: 2.088 ± 1.04
2.088AspCys: 2.088 ± 0.983
3.58AspAsp: 3.58 ± 0.939
3.878AspGlu: 3.878 ± 1.226
2.685AspPhe: 2.685 ± 1.226
2.685AspGly: 2.685 ± 0.41
1.492AspHis: 1.492 ± 0.762
2.387AspIle: 2.387 ± 0.645
2.983AspLys: 2.983 ± 0.234
5.967AspLeu: 5.967 ± 0.596
2.387AspMet: 2.387 ± 0.713
2.387AspAsn: 2.387 ± 0.713
3.878AspPro: 3.878 ± 1.956
0.895AspGln: 0.895 ± 0.235
1.193AspArg: 1.193 ± 0.451
5.37AspSer: 5.37 ± 0.82
3.282AspThr: 3.282 ± 1.385
4.177AspVal: 4.177 ± 1.11
1.79AspTrp: 1.79 ± 0.677
1.193AspTyr: 1.193 ± 0.736
0.0AspXaa: 0.0 ± 0.0
Glu
4.475GluAla: 4.475 ± 1.866
1.193GluCys: 1.193 ± 0.451
4.475GluAsp: 4.475 ± 1.911
4.475GluGlu: 4.475 ± 2.322
2.387GluPhe: 2.387 ± 1.473
3.58GluGly: 3.58 ± 0.939
0.597GluHis: 0.597 ± 0.368
2.685GluIle: 2.685 ± 1.226
2.387GluLys: 2.387 ± 0.902
6.862GluLeu: 6.862 ± 1.397
2.685GluMet: 2.685 ± 0.41
1.193GluAsn: 1.193 ± 0.451
2.088GluPro: 2.088 ± 0.575
1.193GluGln: 1.193 ± 0.357
2.685GluArg: 2.685 ± 0.868
5.967GluSer: 5.967 ± 1.245
3.282GluThr: 3.282 ± 2.173
4.773GluVal: 4.773 ± 0.975
1.492GluTrp: 1.492 ± 0.517
1.79GluTyr: 1.79 ± 0.604
0.0GluXaa: 0.0 ± 0.0
Phe
1.79PheAla: 1.79 ± 0.469
1.193PheCys: 1.193 ± 1.353
3.282PheAsp: 3.282 ± 1.605
1.492PheGlu: 1.492 ± 0.517
2.088PhePhe: 2.088 ± 0.866
1.79PheGly: 1.79 ± 1.551
1.193PheHis: 1.193 ± 0.451
2.387PheIle: 2.387 ± 0.902
3.282PheLys: 3.282 ± 0.625
2.088PheLeu: 2.088 ± 0.575
0.895PheMet: 0.895 ± 0.738
1.193PheAsn: 1.193 ± 0.357
1.79PhePro: 1.79 ± 0.936
0.597PheGln: 0.597 ± 0.226
3.282PheArg: 3.282 ± 0.625
3.58PheSer: 3.58 ± 0.776
0.895PheThr: 0.895 ± 0.235
1.492PheVal: 1.492 ± 0.921
0.895PheTrp: 0.895 ± 0.545
0.895PheTyr: 0.895 ± 0.552
0.0PheXaa: 0.0 ± 0.0
Gly
2.983GlyAla: 2.983 ± 1.072
2.088GlyCys: 2.088 ± 1.889
2.983GlyAsp: 2.983 ± 0.844
4.177GlyGlu: 4.177 ± 1.15
3.282GlyPhe: 3.282 ± 1.217
3.282GlyGly: 3.282 ± 0.413
0.895GlyHis: 0.895 ± 0.545
2.685GlyIle: 2.685 ± 0.828
2.685GlyLys: 2.685 ± 0.41
5.072GlyLeu: 5.072 ± 1.115
2.983GlyMet: 2.983 ± 1.271
2.088GlyAsn: 2.088 ± 0.635
1.193GlyPro: 1.193 ± 0.451
2.387GlyGln: 2.387 ± 1.757
2.088GlyArg: 2.088 ± 0.983
5.37GlySer: 5.37 ± 2.632
2.685GlyThr: 2.685 ± 0.854
5.668GlyVal: 5.668 ± 0.891
0.597GlyTrp: 0.597 ± 0.226
1.193GlyTyr: 1.193 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
0.298HisAla: 0.298 ± 0.184
0.895HisCys: 0.895 ± 0.809
0.298HisAsp: 0.298 ± 0.184
1.193HisGlu: 1.193 ± 0.451
0.895HisPhe: 0.895 ± 0.545
0.895HisGly: 0.895 ± 0.235
0.895HisHis: 0.895 ± 0.545
2.685HisIle: 2.685 ± 2.092
0.895HisLys: 0.895 ± 0.235
2.088HisLeu: 2.088 ± 0.635
0.597HisMet: 0.597 ± 0.368
1.193HisAsn: 1.193 ± 0.357
1.193HisPro: 1.193 ± 0.736
0.895HisGln: 0.895 ± 0.809
0.597HisArg: 0.597 ± 0.368
2.088HisSer: 2.088 ± 0.983
0.597HisThr: 0.597 ± 0.677
1.193HisVal: 1.193 ± 0.878
0.0HisTrp: 0.0 ± 0.0
1.193HisTyr: 1.193 ± 0.736
0.0HisXaa: 0.0 ± 0.0
Ile
1.79IleAla: 1.79 ± 0.902
0.895IleCys: 0.895 ± 0.545
2.387IleAsp: 2.387 ± 0.713
3.878IleGlu: 3.878 ± 1.033
1.492IlePhe: 1.492 ± 0.762
2.983IleGly: 2.983 ± 0.844
2.387IleHis: 2.387 ± 0.902
3.282IleIle: 3.282 ± 1.59
5.37IleLys: 5.37 ± 0.614
4.177IleLeu: 4.177 ± 1.967
1.492IleMet: 1.492 ± 0.517
2.983IleAsn: 2.983 ± 1.271
2.088IlePro: 2.088 ± 1.671
2.983IleGln: 2.983 ± 0.234
3.58IleArg: 3.58 ± 1.082
7.16IleSer: 7.16 ± 0.891
2.685IleThr: 2.685 ± 0.704
3.282IleVal: 3.282 ± 0.927
0.0IleTrp: 0.0 ± 0.0
1.193IleTyr: 1.193 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
5.072LysAla: 5.072 ± 0.7
0.597LysCys: 0.597 ± 0.226
3.58LysAsp: 3.58 ± 1.082
1.492LysGlu: 1.492 ± 0.517
2.387LysPhe: 2.387 ± 1.163
1.79LysGly: 1.79 ± 1.551
1.193LysHis: 1.193 ± 0.451
3.878LysIle: 3.878 ± 1.223
3.58LysLys: 3.58 ± 1.873
8.353LysLeu: 8.353 ± 1.181
2.983LysMet: 2.983 ± 0.803
1.79LysAsn: 1.79 ± 1.105
2.088LysPro: 2.088 ± 1.04
1.79LysGln: 1.79 ± 0.586
3.282LysArg: 3.282 ± 2.025
3.58LysSer: 3.58 ± 1.805
3.878LysThr: 3.878 ± 1.064
2.387LysVal: 2.387 ± 0.714
1.193LysTrp: 1.193 ± 0.451
0.895LysTyr: 0.895 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
7.16LeuAla: 7.16 ± 4.183
0.895LeuCys: 0.895 ± 0.552
5.072LeuAsp: 5.072 ± 0.494
7.458LeuGlu: 7.458 ± 3.088
1.492LeuPhe: 1.492 ± 0.627
4.475LeuGly: 4.475 ± 1.174
0.895LeuHis: 0.895 ± 0.552
5.967LeuIle: 5.967 ± 1.662
5.37LeuLys: 5.37 ± 0.877
5.967LeuLeu: 5.967 ± 0.568
2.685LeuMet: 2.685 ± 0.41
2.387LeuAsn: 2.387 ± 0.713
4.177LeuPro: 4.177 ± 1.579
3.878LeuGln: 3.878 ± 1.223
6.862LeuArg: 6.862 ± 2.253
8.652LeuSer: 8.652 ± 1.86
7.16LeuThr: 7.16 ± 0.573
8.95LeuVal: 8.95 ± 0.98
2.088LeuTrp: 2.088 ± 0.575
3.58LeuTyr: 3.58 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
3.878MetAla: 3.878 ± 1.024
0.298MetCys: 0.298 ± 0.184
1.193MetAsp: 1.193 ± 0.736
2.387MetGlu: 2.387 ± 1.639
0.298MetPhe: 0.298 ± 0.338
1.492MetGly: 1.492 ± 0.627
0.597MetHis: 0.597 ± 0.368
1.492MetIle: 1.492 ± 0.422
2.088MetLys: 2.088 ± 0.866
3.58MetLeu: 3.58 ± 0.939
2.088MetMet: 2.088 ± 0.866
1.79MetAsn: 1.79 ± 0.604
2.088MetPro: 2.088 ± 1.289
0.298MetGln: 0.298 ± 0.184
2.088MetArg: 2.088 ± 1.486
3.58MetSer: 3.58 ± 0.286
2.387MetThr: 2.387 ± 0.547
2.685MetVal: 2.685 ± 0.704
0.895MetTrp: 0.895 ± 0.819
1.492MetTyr: 1.492 ± 1.634
0.0MetXaa: 0.0 ± 0.0
Asn
1.79AsnAla: 1.79 ± 0.936
0.895AsnCys: 0.895 ± 0.545
1.492AsnAsp: 1.492 ± 0.921
1.193AsnGlu: 1.193 ± 0.451
1.492AsnPhe: 1.492 ± 0.517
2.387AsnGly: 2.387 ± 0.399
0.895AsnHis: 0.895 ± 0.809
3.282AsnIle: 3.282 ± 2.199
2.088AsnLys: 2.088 ± 0.575
4.475AsnLeu: 4.475 ± 0.958
0.895AsnMet: 0.895 ± 0.235
0.895AsnAsn: 0.895 ± 0.552
1.79AsnPro: 1.79 ± 0.469
2.088AsnGln: 2.088 ± 1.289
2.387AsnArg: 2.387 ± 0.714
3.282AsnSer: 3.282 ± 0.893
2.088AsnThr: 2.088 ± 1.149
1.492AsnVal: 1.492 ± 0.517
0.597AsnTrp: 0.597 ± 0.368
1.492AsnTyr: 1.492 ± 0.921
0.0AsnXaa: 0.0 ± 0.0
Pro
2.088ProAla: 2.088 ± 1.615
0.597ProCys: 0.597 ± 0.677
2.983ProAsp: 2.983 ± 1.271
2.685ProGlu: 2.685 ± 1.226
0.597ProPhe: 0.597 ± 0.368
3.58ProGly: 3.58 ± 1.172
0.895ProHis: 0.895 ± 0.235
1.79ProIle: 1.79 ± 0.677
2.685ProLys: 2.685 ± 0.781
4.177ProLeu: 4.177 ± 1.269
0.895ProMet: 0.895 ± 0.819
2.088ProAsn: 2.088 ± 0.635
0.597ProPro: 0.597 ± 0.368
2.387ProGln: 2.387 ± 1.046
2.088ProArg: 2.088 ± 0.635
4.773ProSer: 4.773 ± 0.975
4.177ProThr: 4.177 ± 0.617
2.983ProVal: 2.983 ± 1.524
0.298ProTrp: 0.298 ± 0.184
1.492ProTyr: 1.492 ± 0.762
0.0ProXaa: 0.0 ± 0.0
Gln
1.79GlnAla: 1.79 ± 0.677
0.895GlnCys: 0.895 ± 1.015
1.79GlnAsp: 1.79 ± 0.469
1.492GlnGlu: 1.492 ± 0.921
1.492GlnPhe: 1.492 ± 0.422
2.685GlnGly: 2.685 ± 0.828
0.298GlnHis: 0.298 ± 0.184
1.492GlnIle: 1.492 ± 0.517
0.597GlnLys: 0.597 ± 0.368
3.58GlnLeu: 3.58 ± 0.776
0.895GlnMet: 0.895 ± 0.809
1.193GlnAsn: 1.193 ± 0.451
1.492GlnPro: 1.492 ± 0.762
0.0GlnGln: 0.0 ± 0.0
0.895GlnArg: 0.895 ± 0.235
2.685GlnSer: 2.685 ± 1.206
1.79GlnThr: 1.79 ± 0.469
2.983GlnVal: 2.983 ± 0.803
0.895GlnTrp: 0.895 ± 0.235
2.685GlnTyr: 2.685 ± 1.42
0.0GlnXaa: 0.0 ± 0.0
Arg
3.878ArgAla: 3.878 ± 3.116
0.298ArgCys: 0.298 ± 0.338
1.79ArgAsp: 1.79 ± 0.469
4.475ArgGlu: 4.475 ± 0.326
2.387ArgPhe: 2.387 ± 1.046
3.58ArgGly: 3.58 ± 0.776
0.0ArgHis: 0.0 ± 0.0
2.983ArgIle: 2.983 ± 0.803
1.79ArgLys: 1.79 ± 1.105
6.563ArgLeu: 6.563 ± 2.777
2.387ArgMet: 2.387 ± 0.713
1.79ArgAsn: 1.79 ± 1.105
2.685ArgPro: 2.685 ± 0.781
2.387ArgGln: 2.387 ± 0.713
3.58ArgArg: 3.58 ± 1.378
6.563ArgSer: 6.563 ± 1.613
3.58ArgThr: 3.58 ± 1.207
4.475ArgVal: 4.475 ± 0.958
1.193ArgTrp: 1.193 ± 0.357
1.492ArgTyr: 1.492 ± 0.517
0.0ArgXaa: 0.0 ± 0.0
Ser
5.072SerAla: 5.072 ± 1.156
4.773SerCys: 4.773 ± 3.032
5.668SerAsp: 5.668 ± 1.499
2.983SerGlu: 2.983 ± 0.803
3.878SerPhe: 3.878 ± 1.625
7.757SerGly: 7.757 ± 3.078
1.492SerHis: 1.492 ± 0.422
5.072SerIle: 5.072 ± 0.929
5.37SerLys: 5.37 ± 1.485
9.845SerLeu: 9.845 ± 2.96
2.983SerMet: 2.983 ± 0.135
4.475SerAsn: 4.475 ± 2.922
5.37SerPro: 5.37 ± 0.756
2.387SerGln: 2.387 ± 1.046
7.458SerArg: 7.458 ± 2.931
8.055SerSer: 8.055 ± 1.427
5.967SerThr: 5.967 ± 3.5
3.878SerVal: 3.878 ± 1.064
0.895SerTrp: 0.895 ± 0.819
1.79SerTyr: 1.79 ± 1.552
0.0SerXaa: 0.0 ± 0.0
Thr
2.983ThrAla: 2.983 ± 2.308
2.387ThrCys: 2.387 ± 2.706
5.37ThrAsp: 5.37 ± 1.501
5.072ThrGlu: 5.072 ± 0.494
2.983ThrPhe: 2.983 ± 1.033
3.58ThrGly: 3.58 ± 0.23
0.597ThrHis: 0.597 ± 0.677
3.58ThrIle: 3.58 ± 1.082
2.983ThrLys: 2.983 ± 0.234
6.862ThrLeu: 6.862 ± 0.642
1.492ThrMet: 1.492 ± 1.634
2.685ThrAsn: 2.685 ± 0.378
3.282ThrPro: 3.282 ± 1.076
0.895ThrGln: 0.895 ± 0.235
2.983ThrArg: 2.983 ± 1.033
4.475ThrSer: 4.475 ± 2.022
5.072ThrThr: 5.072 ± 2.504
2.088ThrVal: 2.088 ± 1.671
0.597ThrTrp: 0.597 ± 0.368
2.088ThrTyr: 2.088 ± 1.486
0.0ThrXaa: 0.0 ± 0.0
Val
5.37ValAla: 5.37 ± 1.208
2.387ValCys: 2.387 ± 0.965
2.685ValAsp: 2.685 ± 0.41
4.177ValGlu: 4.177 ± 1.732
2.983ValPhe: 2.983 ± 0.495
2.088ValGly: 2.088 ± 0.575
1.492ValHis: 1.492 ± 0.517
2.685ValIle: 2.685 ± 0.378
2.387ValLys: 2.387 ± 0.714
5.072ValLeu: 5.072 ± 1.475
3.58ValMet: 3.58 ± 0.776
3.282ValAsn: 3.282 ± 0.413
4.475ValPro: 4.475 ± 2.113
1.79ValGln: 1.79 ± 0.469
3.58ValArg: 3.58 ± 1.212
5.072ValSer: 5.072 ± 0.494
3.282ValThr: 3.282 ± 0.413
5.072ValVal: 5.072 ± 1.891
1.79ValTrp: 1.79 ± 1.105
2.088ValTyr: 2.088 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
0.298TrpAla: 0.298 ± 0.184
0.597TrpCys: 0.597 ± 0.677
1.193TrpAsp: 1.193 ± 0.357
1.193TrpGlu: 1.193 ± 1.733
0.895TrpPhe: 0.895 ± 0.235
0.597TrpGly: 0.597 ± 0.677
0.597TrpHis: 0.597 ± 0.368
0.895TrpIle: 0.895 ± 0.235
1.193TrpLys: 1.193 ± 0.451
1.193TrpLeu: 1.193 ± 0.451
0.298TrpMet: 0.298 ± 0.184
0.597TrpAsn: 0.597 ± 0.226
0.0TrpPro: 0.0 ± 0.0
0.298TrpGln: 0.298 ± 0.184
1.492TrpArg: 1.492 ± 0.921
2.387TrpSer: 2.387 ± 1.046
0.895TrpThr: 0.895 ± 0.552
1.79TrpVal: 1.79 ± 0.689
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.492TyrAla: 1.492 ± 0.422
0.895TyrCys: 0.895 ± 0.545
2.088TyrAsp: 2.088 ± 0.723
2.088TyrGlu: 2.088 ± 0.866
0.597TyrPhe: 0.597 ± 0.226
2.088TyrGly: 2.088 ± 0.723
1.193TyrHis: 1.193 ± 1.265
1.492TyrIle: 1.492 ± 0.517
2.983TyrLys: 2.983 ± 1.422
2.088TyrLeu: 2.088 ± 0.635
2.088TyrMet: 2.088 ± 1.289
1.193TyrAsn: 1.193 ± 0.82
0.895TyrPro: 0.895 ± 0.545
0.597TyrGln: 0.597 ± 0.942
2.387TyrArg: 2.387 ± 1.046
1.79TyrSer: 1.79 ± 1.089
3.282TyrThr: 3.282 ± 0.625
1.193TyrVal: 1.193 ± 0.451
0.0TyrTrp: 0.0 ± 0.0
1.492TyrTyr: 1.492 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3353 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski