Amino acid dipepetide frequency for Wuhan house centipede virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.234AlaAla: 2.234 ± 0.571
0.638AlaCys: 0.638 ± 0.335
1.595AlaAsp: 1.595 ± 0.837
1.276AlaGlu: 1.276 ± 0.579
1.595AlaPhe: 1.595 ± 0.895
2.872AlaGly: 2.872 ± 0.95
1.276AlaHis: 1.276 ± 0.884
2.234AlaIle: 2.234 ± 1.171
1.276AlaLys: 1.276 ± 0.669
9.892AlaLeu: 9.892 ± 1.855
1.914AlaMet: 1.914 ± 1.004
1.276AlaAsn: 1.276 ± 0.669
1.276AlaPro: 1.276 ± 1.453
1.276AlaGln: 1.276 ± 0.412
1.276AlaArg: 1.276 ± 0.669
2.234AlaSer: 2.234 ± 1.005
1.914AlaThr: 1.914 ± 1.004
5.743AlaVal: 5.743 ± 1.991
0.638AlaTrp: 0.638 ± 0.727
1.595AlaTyr: 1.595 ± 0.803
0.0AlaXaa: 0.0 ± 0.0
Cys
0.957CysAla: 0.957 ± 0.502
0.319CysCys: 0.319 ± 0.167
1.595CysAsp: 1.595 ± 0.837
0.957CysGlu: 0.957 ± 0.433
1.595CysPhe: 1.595 ± 0.457
1.276CysGly: 1.276 ± 2.008
0.957CysHis: 0.957 ± 0.502
1.914CysIle: 1.914 ± 0.551
0.638CysLys: 0.638 ± 0.954
3.829CysLeu: 3.829 ± 0.601
0.0CysMet: 0.0 ± 0.0
0.638CysAsn: 0.638 ± 0.954
0.957CysPro: 0.957 ± 0.502
0.319CysGln: 0.319 ± 0.625
0.957CysArg: 0.957 ± 0.502
3.51CysSer: 3.51 ± 0.813
0.638CysThr: 0.638 ± 0.727
3.51CysVal: 3.51 ± 0.613
0.319CysTrp: 0.319 ± 1.029
2.234CysTyr: 2.234 ± 0.829
0.0CysXaa: 0.0 ± 0.0
Asp
3.51AspAla: 3.51 ± 0.656
3.51AspCys: 3.51 ± 1.51
1.914AspAsp: 1.914 ± 0.551
2.553AspGlu: 2.553 ± 1.339
3.191AspPhe: 3.191 ± 1.096
2.872AspGly: 2.872 ± 0.657
1.276AspHis: 1.276 ± 0.412
2.553AspIle: 2.553 ± 0.788
0.957AspLys: 0.957 ± 0.502
2.872AspLeu: 2.872 ± 1.87
0.638AspMet: 0.638 ± 0.469
2.553AspAsn: 2.553 ± 0.952
2.553AspPro: 2.553 ± 0.825
2.553AspGln: 2.553 ± 0.813
3.191AspArg: 3.191 ± 1.115
3.191AspSer: 3.191 ± 1.674
2.553AspThr: 2.553 ± 0.788
3.829AspVal: 3.829 ± 2.008
0.319AspTrp: 0.319 ± 0.625
3.51AspTyr: 3.51 ± 0.998
0.0AspXaa: 0.0 ± 0.0
Glu
1.595GluAla: 1.595 ± 0.837
0.957GluCys: 0.957 ± 0.502
2.234GluAsp: 2.234 ± 0.674
1.914GluGlu: 1.914 ± 1.004
2.872GluPhe: 2.872 ± 0.961
0.638GluGly: 0.638 ± 0.335
0.638GluHis: 0.638 ± 0.335
4.148GluIle: 4.148 ± 1.592
7.339GluLys: 7.339 ± 1.571
5.105GluLeu: 5.105 ± 1.626
0.957GluMet: 0.957 ± 0.502
3.829GluAsn: 3.829 ± 1.102
1.595GluPro: 1.595 ± 2.222
1.595GluGln: 1.595 ± 0.457
2.872GluArg: 2.872 ± 0.912
5.743GluSer: 5.743 ± 0.488
2.872GluThr: 2.872 ± 0.854
4.467GluVal: 4.467 ± 1.348
0.957GluTrp: 0.957 ± 0.433
4.786GluTyr: 4.786 ± 1.398
0.0GluXaa: 0.0 ± 0.0
Phe
2.553PheAla: 2.553 ± 1.593
3.51PheCys: 3.51 ± 1.791
1.595PheAsp: 1.595 ± 1.149
4.467PheGlu: 4.467 ± 0.785
0.957PhePhe: 0.957 ± 0.433
1.914PheGly: 1.914 ± 1.136
1.595PheHis: 1.595 ± 0.803
2.553PheIle: 2.553 ± 0.825
2.553PheLys: 2.553 ± 0.518
6.701PheLeu: 6.701 ± 2.517
1.276PheMet: 1.276 ± 0.819
2.234PheAsn: 2.234 ± 1.781
0.638PhePro: 0.638 ± 0.335
1.914PheGln: 1.914 ± 0.937
2.234PheArg: 2.234 ± 1.0
6.382PheSer: 6.382 ± 1.163
2.872PheThr: 2.872 ± 2.122
6.063PheVal: 6.063 ± 1.768
0.319PheTrp: 0.319 ± 0.167
1.914PheTyr: 1.914 ± 0.551
0.0PheXaa: 0.0 ± 0.0
Gly
1.595GlyAla: 1.595 ± 0.457
0.957GlyCys: 0.957 ± 0.502
2.872GlyAsp: 2.872 ± 1.201
0.957GlyGlu: 0.957 ± 0.502
1.914GlyPhe: 1.914 ± 1.064
1.914GlyGly: 1.914 ± 0.713
0.319GlyHis: 0.319 ± 0.167
2.234GlyIle: 2.234 ± 0.674
2.234GlyLys: 2.234 ± 1.445
2.553GlyLeu: 2.553 ± 0.518
1.595GlyMet: 1.595 ± 0.837
1.914GlyAsn: 1.914 ± 0.866
0.957GlyPro: 0.957 ± 1.043
0.957GlyGln: 0.957 ± 1.129
3.191GlyArg: 3.191 ± 1.32
2.553GlySer: 2.553 ± 1.768
2.234GlyThr: 2.234 ± 0.674
3.191GlyVal: 3.191 ± 0.703
0.638GlyTrp: 0.638 ± 0.511
2.234GlyTyr: 2.234 ± 1.094
0.0GlyXaa: 0.0 ± 0.0
His
0.319HisAla: 0.319 ± 0.167
0.638HisCys: 0.638 ± 0.511
1.276HisAsp: 1.276 ± 0.579
1.914HisGlu: 1.914 ± 1.004
0.638HisPhe: 0.638 ± 0.335
1.914HisGly: 1.914 ± 0.713
0.957HisHis: 0.957 ± 0.433
1.595HisIle: 1.595 ± 0.457
1.276HisLys: 1.276 ± 0.884
2.234HisLeu: 2.234 ± 1.005
0.638HisMet: 0.638 ± 0.511
1.276HisAsn: 1.276 ± 0.669
1.914HisPro: 1.914 ± 0.551
0.957HisGln: 0.957 ± 1.129
1.595HisArg: 1.595 ± 0.803
2.234HisSer: 2.234 ± 1.171
2.234HisThr: 2.234 ± 0.682
1.595HisVal: 1.595 ± 1.281
0.0HisTrp: 0.0 ± 0.0
1.595HisTyr: 1.595 ± 0.457
0.0HisXaa: 0.0 ± 0.0
Ile
2.234IleAla: 2.234 ± 1.203
1.595IleCys: 1.595 ± 0.457
5.105IleAsp: 5.105 ± 1.451
1.914IleGlu: 1.914 ± 1.004
4.467IlePhe: 4.467 ± 1.134
0.638IleGly: 0.638 ± 0.335
0.957IleHis: 0.957 ± 0.502
4.148IleIle: 4.148 ± 1.258
3.829IleLys: 3.829 ± 0.601
4.786IleLeu: 4.786 ± 1.093
0.957IleMet: 0.957 ± 0.502
3.51IleAsn: 3.51 ± 1.51
4.786IlePro: 4.786 ± 1.038
0.638IleGln: 0.638 ± 0.511
3.51IleArg: 3.51 ± 1.272
7.339IleSer: 7.339 ± 1.468
4.786IleThr: 4.786 ± 0.815
5.424IleVal: 5.424 ± 1.199
0.0IleTrp: 0.0 ± 0.0
1.914IleTyr: 1.914 ± 0.605
0.0IleXaa: 0.0 ± 0.0
Lys
1.595LysAla: 1.595 ± 0.781
1.276LysCys: 1.276 ± 0.669
2.872LysAsp: 2.872 ± 0.961
5.105LysGlu: 5.105 ± 1.525
3.51LysPhe: 3.51 ± 1.915
0.957LysGly: 0.957 ± 0.502
0.957LysHis: 0.957 ± 0.502
3.829LysIle: 3.829 ± 1.431
4.786LysLys: 4.786 ± 1.31
7.339LysLeu: 7.339 ± 0.729
1.595LysMet: 1.595 ± 0.457
5.105LysAsn: 5.105 ± 0.938
2.234LysPro: 2.234 ± 1.171
0.638LysGln: 0.638 ± 0.335
1.914LysArg: 1.914 ± 1.004
3.191LysSer: 3.191 ± 1.115
2.872LysThr: 2.872 ± 0.923
2.553LysVal: 2.553 ± 1.033
0.0LysTrp: 0.0 ± 0.0
1.595LysTyr: 1.595 ± 0.457
0.0LysXaa: 0.0 ± 0.0
Leu
4.148LeuAla: 4.148 ± 1.496
2.872LeuCys: 2.872 ± 1.262
3.829LeuAsp: 3.829 ± 0.927
6.701LeuGlu: 6.701 ± 2.082
4.786LeuPhe: 4.786 ± 0.858
3.829LeuGly: 3.829 ± 1.282
2.234LeuHis: 2.234 ± 1.872
4.786LeuIle: 4.786 ± 2.316
7.02LeuLys: 7.02 ± 2.068
9.892LeuLeu: 9.892 ± 3.499
1.914LeuMet: 1.914 ± 1.064
6.701LeuAsn: 6.701 ± 1.035
5.424LeuPro: 5.424 ± 1.28
2.872LeuGln: 2.872 ± 0.912
5.424LeuArg: 5.424 ± 2.131
6.701LeuSer: 6.701 ± 2.027
7.02LeuThr: 7.02 ± 0.586
5.743LeuVal: 5.743 ± 0.784
0.638LeuTrp: 0.638 ± 0.727
3.51LeuTyr: 3.51 ± 1.791
0.0LeuXaa: 0.0 ± 0.0
Met
0.319MetAla: 0.319 ± 0.167
0.319MetCys: 0.319 ± 0.167
0.638MetAsp: 0.638 ± 0.511
0.957MetGlu: 0.957 ± 0.433
2.234MetPhe: 2.234 ± 1.005
0.638MetGly: 0.638 ± 0.335
0.638MetHis: 0.638 ± 0.335
2.872MetIle: 2.872 ± 0.912
0.957MetLys: 0.957 ± 0.502
0.957MetLeu: 0.957 ± 0.502
0.0MetMet: 0.0 ± 0.0
0.638MetAsn: 0.638 ± 0.335
0.957MetPro: 0.957 ± 0.433
0.0MetGln: 0.0 ± 0.0
0.957MetArg: 0.957 ± 0.904
1.914MetSer: 1.914 ± 0.605
0.638MetThr: 0.638 ± 0.335
1.914MetVal: 1.914 ± 1.27
0.0MetTrp: 0.0 ± 0.0
2.234MetTyr: 2.234 ± 0.829
0.0MetXaa: 0.0 ± 0.0
Asn
2.872AsnAla: 2.872 ± 1.201
0.638AsnCys: 0.638 ± 0.954
2.872AsnAsp: 2.872 ± 1.262
3.191AsnGlu: 3.191 ± 1.674
1.914AsnPhe: 1.914 ± 0.937
2.553AsnGly: 2.553 ± 1.829
1.914AsnHis: 1.914 ± 0.713
1.914AsnIle: 1.914 ± 1.242
3.191AsnLys: 3.191 ± 0.566
7.339AsnLeu: 7.339 ± 1.719
1.595AsnMet: 1.595 ± 0.837
2.872AsnAsn: 2.872 ± 1.905
3.829AsnPro: 3.829 ± 0.766
2.872AsnGln: 2.872 ± 0.757
2.553AsnArg: 2.553 ± 0.788
5.105AsnSer: 5.105 ± 1.448
3.829AsnThr: 3.829 ± 1.594
2.872AsnVal: 2.872 ± 0.961
0.0AsnTrp: 0.0 ± 0.0
4.467AsnTyr: 4.467 ± 2.632
0.0AsnXaa: 0.0 ± 0.0
Pro
1.276ProAla: 1.276 ± 1.251
0.319ProCys: 0.319 ± 0.167
2.553ProAsp: 2.553 ± 1.339
3.829ProGlu: 3.829 ± 1.281
1.914ProPhe: 1.914 ± 0.551
1.914ProGly: 1.914 ± 0.937
1.595ProHis: 1.595 ± 0.837
3.829ProIle: 3.829 ± 0.601
0.957ProLys: 0.957 ± 0.502
2.553ProLeu: 2.553 ± 2.474
0.638ProMet: 0.638 ± 0.738
3.191ProAsn: 3.191 ± 0.914
1.595ProPro: 1.595 ± 0.837
2.234ProGln: 2.234 ± 1.439
2.872ProArg: 2.872 ± 4.285
5.105ProSer: 5.105 ± 0.928
1.595ProThr: 1.595 ± 0.837
4.786ProVal: 4.786 ± 0.858
0.319ProTrp: 0.319 ± 1.029
0.957ProTyr: 0.957 ± 0.433
0.0ProXaa: 0.0 ± 0.0
Gln
0.638GlnAla: 0.638 ± 0.335
0.319GlnCys: 0.319 ± 0.167
1.276GlnAsp: 1.276 ± 0.412
1.914GlnGlu: 1.914 ± 0.866
1.595GlnPhe: 1.595 ± 0.457
1.276GlnGly: 1.276 ± 0.412
0.638GlnHis: 0.638 ± 0.511
1.595GlnIle: 1.595 ± 0.837
1.914GlnLys: 1.914 ± 0.605
3.191GlnLeu: 3.191 ± 0.914
0.638GlnMet: 0.638 ± 0.335
2.553GlnAsn: 2.553 ± 1.094
0.638GlnPro: 0.638 ± 0.511
0.638GlnGln: 0.638 ± 0.335
1.595GlnArg: 1.595 ± 1.354
4.786GlnSer: 4.786 ± 1.679
1.914GlnThr: 1.914 ± 0.713
1.914GlnVal: 1.914 ± 0.551
0.319GlnTrp: 0.319 ± 0.842
1.914GlnTyr: 1.914 ± 0.866
0.0GlnXaa: 0.0 ± 0.0
Arg
2.234ArgAla: 2.234 ± 1.203
2.234ArgCys: 2.234 ± 1.005
2.872ArgAsp: 2.872 ± 1.772
3.829ArgGlu: 3.829 ± 2.008
1.914ArgPhe: 1.914 ± 0.937
2.553ArgGly: 2.553 ± 0.813
0.957ArgHis: 0.957 ± 0.502
2.872ArgIle: 2.872 ± 0.82
2.553ArgLys: 2.553 ± 0.813
2.872ArgLeu: 2.872 ± 0.912
0.638ArgMet: 0.638 ± 0.335
4.786ArgAsn: 4.786 ± 2.355
1.276ArgPro: 1.276 ± 0.579
1.276ArgGln: 1.276 ± 0.579
2.553ArgArg: 2.553 ± 1.094
2.872ArgSer: 2.872 ± 1.225
3.51ArgThr: 3.51 ± 1.161
4.467ArgVal: 4.467 ± 1.364
0.957ArgTrp: 0.957 ± 1.043
4.148ArgTyr: 4.148 ± 1.357
0.0ArgXaa: 0.0 ± 0.0
Ser
6.382SerAla: 6.382 ± 2.502
2.553SerCys: 2.553 ± 0.633
3.51SerAsp: 3.51 ± 1.841
4.148SerGlu: 4.148 ± 1.935
6.382SerPhe: 6.382 ± 1.95
3.191SerGly: 3.191 ± 1.674
4.467SerHis: 4.467 ± 0.706
5.424SerIle: 5.424 ± 1.071
4.467SerLys: 4.467 ± 1.364
7.658SerLeu: 7.658 ± 1.799
2.553SerMet: 2.553 ± 0.508
4.467SerAsn: 4.467 ± 2.798
3.829SerPro: 3.829 ± 2.403
4.148SerGln: 4.148 ± 1.17
4.148SerArg: 4.148 ± 0.603
7.339SerSer: 7.339 ± 2.627
5.743SerThr: 5.743 ± 1.388
5.105SerVal: 5.105 ± 2.118
0.319SerTrp: 0.319 ± 0.167
4.148SerTyr: 4.148 ± 1.592
0.0SerXaa: 0.0 ± 0.0
Thr
2.872ThrAla: 2.872 ± 0.961
0.638ThrCys: 0.638 ± 0.954
2.553ThrAsp: 2.553 ± 0.518
2.872ThrGlu: 2.872 ± 0.854
3.829ThrPhe: 3.829 ± 1.282
2.553ThrGly: 2.553 ± 1.339
1.595ThrHis: 1.595 ± 0.781
1.914ThrIle: 1.914 ± 0.605
2.234ThrLys: 2.234 ± 1.171
3.191ThrLeu: 3.191 ± 1.257
0.319ThrMet: 0.319 ± 0.625
3.51ThrAsn: 3.51 ± 0.613
3.191ThrPro: 3.191 ± 0.703
2.234ThrGln: 2.234 ± 0.674
3.51ThrArg: 3.51 ± 2.621
6.063ThrSer: 6.063 ± 3.088
3.51ThrThr: 3.51 ± 1.272
5.424ThrVal: 5.424 ± 1.176
0.0ThrTrp: 0.0 ± 0.0
4.786ThrTyr: 4.786 ± 1.399
0.0ThrXaa: 0.0 ± 0.0
Val
3.51ValAla: 3.51 ± 0.656
1.914ValCys: 1.914 ± 0.713
5.743ValAsp: 5.743 ± 0.488
4.148ValGlu: 4.148 ± 1.592
3.829ValPhe: 3.829 ± 0.737
2.553ValGly: 2.553 ± 1.033
1.914ValHis: 1.914 ± 0.551
5.105ValIle: 5.105 ± 2.756
4.148ValLys: 4.148 ± 1.592
7.339ValLeu: 7.339 ± 2.302
0.957ValMet: 0.957 ± 0.635
3.51ValAsn: 3.51 ± 0.768
4.467ValPro: 4.467 ± 1.652
2.553ValGln: 2.553 ± 0.518
3.51ValArg: 3.51 ± 1.193
7.02ValSer: 7.02 ± 1.536
3.51ValThr: 3.51 ± 0.61
6.701ValVal: 6.701 ± 4.617
0.0ValTrp: 0.0 ± 0.0
4.786ValTyr: 4.786 ± 1.371
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.319TrpAsp: 0.319 ± 0.167
0.0TrpGlu: 0.0 ± 0.0
0.957TrpPhe: 0.957 ± 1.058
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.638TrpIle: 0.638 ± 0.727
0.0TrpLys: 0.0 ± 0.0
0.638TrpLeu: 0.638 ± 0.511
0.319TrpMet: 0.319 ± 0.167
0.957TrpAsn: 0.957 ± 1.977
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.319TrpArg: 0.319 ± 0.167
0.0TrpSer: 0.0 ± 0.0
0.319TrpThr: 0.319 ± 0.625
0.319TrpVal: 0.319 ± 0.842
0.0TrpTrp: 0.0 ± 0.0
0.957TrpTyr: 0.957 ± 1.043
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.191TyrAla: 3.191 ± 1.674
1.914TyrCys: 1.914 ± 1.004
3.191TyrAsp: 3.191 ± 0.566
4.467TyrGlu: 4.467 ± 1.45
4.148TyrPhe: 4.148 ± 3.634
0.957TyrGly: 0.957 ± 0.433
1.914TyrHis: 1.914 ± 0.937
6.382TyrIle: 6.382 ± 1.068
1.595TyrLys: 1.595 ± 0.803
5.105TyrLeu: 5.105 ± 1.269
0.319TyrMet: 0.319 ± 0.167
2.872TyrAsn: 2.872 ± 1.201
1.914TyrPro: 1.914 ± 0.866
1.595TyrGln: 1.595 ± 0.837
3.191TyrArg: 3.191 ± 0.72
6.701TyrSer: 6.701 ± 2.082
2.234TyrThr: 2.234 ± 0.998
1.595TyrVal: 1.595 ± 0.932
0.0TyrTrp: 0.0 ± 0.0
3.191TyrTyr: 3.191 ± 0.72
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3135 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski