Amino acid dipepetide frequency for Wuhan heteroptera virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.035AlaAla: 6.035 ± 1.675
0.604AlaCys: 0.604 ± 0.42
1.811AlaAsp: 1.811 ± 0.902
4.828AlaGlu: 4.828 ± 0.84
4.225AlaPhe: 4.225 ± 1.948
4.225AlaGly: 4.225 ± 0.778
1.207AlaHis: 1.207 ± 0.536
4.828AlaIle: 4.828 ± 0.516
7.846AlaLys: 7.846 ± 4.034
6.035AlaLeu: 6.035 ± 0.5
3.621AlaMet: 3.621 ± 0.872
3.621AlaAsn: 3.621 ± 0.991
6.035AlaPro: 6.035 ± 1.675
1.811AlaGln: 1.811 ± 0.672
2.414AlaArg: 2.414 ± 1.843
6.035AlaSer: 6.035 ± 1.161
8.449AlaThr: 8.449 ± 1.401
4.225AlaVal: 4.225 ± 1.48
1.207AlaTrp: 1.207 ± 0.702
3.621AlaTyr: 3.621 ± 2.185
0.0AlaXaa: 0.0 ± 0.0
Cys
1.811CysAla: 1.811 ± 0.713
0.604CysCys: 0.604 ± 0.42
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.604CysPhe: 0.604 ± 0.545
1.207CysGly: 1.207 ± 0.536
0.0CysHis: 0.0 ± 0.0
0.604CysIle: 0.604 ± 0.42
0.604CysLys: 0.604 ± 0.545
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.604CysPro: 0.604 ± 0.42
1.811CysGln: 1.811 ± 1.078
1.207CysArg: 1.207 ± 0.445
0.0CysSer: 0.0 ± 0.0
1.207CysThr: 1.207 ± 0.445
0.604CysVal: 0.604 ± 0.647
0.604CysTrp: 0.604 ± 0.42
0.604CysTyr: 0.604 ± 0.42
0.0CysXaa: 0.0 ± 0.0
Asp
4.225AspAla: 4.225 ± 1.171
0.0AspCys: 0.0 ± 0.0
1.811AspAsp: 1.811 ± 1.942
3.018AspGlu: 3.018 ± 1.421
1.811AspPhe: 1.811 ± 1.078
3.018AspGly: 3.018 ± 0.989
1.207AspHis: 1.207 ± 0.841
1.207AspIle: 1.207 ± 0.445
0.0AspLys: 0.0 ± 0.0
0.604AspLeu: 0.604 ± 0.545
2.414AspMet: 2.414 ± 0.691
2.414AspAsn: 2.414 ± 0.691
1.811AspPro: 1.811 ± 1.078
6.035AspGln: 6.035 ± 0.5
1.811AspArg: 1.811 ± 1.235
7.846AspSer: 7.846 ± 1.764
3.621AspThr: 3.621 ± 0.926
3.018AspVal: 3.018 ± 0.669
1.811AspTrp: 1.811 ± 1.078
1.207AspTyr: 1.207 ± 0.536
0.0AspXaa: 0.0 ± 0.0
Glu
4.225GluAla: 4.225 ± 0.884
2.414GluCys: 2.414 ± 1.738
1.811GluAsp: 1.811 ± 0.713
2.414GluGlu: 2.414 ± 1.041
3.018GluPhe: 3.018 ± 0.574
1.811GluGly: 1.811 ± 1.261
1.811GluHis: 1.811 ± 0.713
1.811GluIle: 1.811 ± 0.672
1.207GluLys: 1.207 ± 0.536
4.225GluLeu: 4.225 ± 0.778
1.811GluMet: 1.811 ± 1.078
0.0GluAsn: 0.0 ± 0.0
1.811GluPro: 1.811 ± 0.292
2.414GluGln: 2.414 ± 1.041
1.811GluArg: 1.811 ± 1.112
4.828GluSer: 4.828 ± 0.84
3.018GluThr: 3.018 ± 0.574
2.414GluVal: 2.414 ± 1.557
0.0GluTrp: 0.0 ± 0.0
1.207GluTyr: 1.207 ± 0.841
0.0GluXaa: 0.0 ± 0.0
Phe
1.207PheAla: 1.207 ± 0.536
0.0PheCys: 0.0 ± 0.0
5.432PheAsp: 5.432 ± 1.894
3.018PheGlu: 3.018 ± 1.54
0.604PhePhe: 0.604 ± 0.42
2.414PheGly: 2.414 ± 1.738
0.0PheHis: 0.0 ± 0.0
2.414PheIle: 2.414 ± 1.072
0.604PheLys: 0.604 ± 0.545
1.811PheLeu: 1.811 ± 0.713
0.0PheMet: 0.0 ± 0.0
4.828PheAsn: 4.828 ± 0.882
1.207PhePro: 1.207 ± 0.445
0.604PheGln: 0.604 ± 0.545
2.414PheArg: 2.414 ± 2.589
1.811PheSer: 1.811 ± 0.902
1.811PheThr: 1.811 ± 0.292
1.811PheVal: 1.811 ± 0.902
0.0PheTrp: 0.0 ± 0.0
1.207PheTyr: 1.207 ± 0.702
0.0PheXaa: 0.0 ± 0.0
Gly
3.621GlyAla: 3.621 ± 1.335
0.0GlyCys: 0.0 ± 0.0
3.018GlyAsp: 3.018 ± 1.54
0.0GlyGlu: 0.0 ± 0.0
4.828GlyPhe: 4.828 ± 0.351
4.828GlyGly: 4.828 ± 0.516
4.225GlyHis: 4.225 ± 2.143
3.018GlyIle: 3.018 ± 1.421
3.018GlyLys: 3.018 ± 1.421
3.621GlyLeu: 3.621 ± 2.223
0.604GlyMet: 0.604 ± 0.647
3.018GlyAsn: 3.018 ± 0.669
1.811GlyPro: 1.811 ± 1.942
3.018GlyGln: 3.018 ± 1.19
3.621GlyArg: 3.621 ± 0.583
5.432GlySer: 5.432 ± 1.536
4.225GlyThr: 4.225 ± 2.319
3.018GlyVal: 3.018 ± 1.21
1.207GlyTrp: 1.207 ± 1.294
3.018GlyTyr: 3.018 ± 0.989
0.0GlyXaa: 0.0 ± 0.0
His
1.207HisAla: 1.207 ± 0.536
0.0HisCys: 0.0 ± 0.0
0.604HisAsp: 0.604 ± 0.545
1.811HisGlu: 1.811 ± 1.112
2.414HisPhe: 2.414 ± 0.691
1.207HisGly: 1.207 ± 0.536
0.604HisHis: 0.604 ± 0.42
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.811HisLeu: 1.811 ± 1.261
0.604HisMet: 0.604 ± 0.647
1.811HisAsn: 1.811 ± 0.292
0.0HisPro: 0.0 ± 0.0
0.604HisGln: 0.604 ± 0.42
1.207HisArg: 1.207 ± 0.445
1.811HisSer: 1.811 ± 0.672
1.207HisThr: 1.207 ± 0.445
1.811HisVal: 1.811 ± 0.292
0.604HisTrp: 0.604 ± 0.647
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.639IleAla: 6.639 ± 0.294
1.811IleCys: 1.811 ± 0.902
0.604IleAsp: 0.604 ± 0.647
1.207IleGlu: 1.207 ± 0.841
0.604IlePhe: 0.604 ± 0.42
4.225IleGly: 4.225 ± 1.471
0.604IleHis: 0.604 ± 0.42
0.604IleIle: 0.604 ± 0.42
0.604IleLys: 0.604 ± 0.647
3.018IleLeu: 3.018 ± 1.419
3.018IleMet: 3.018 ± 1.419
0.604IleAsn: 0.604 ± 0.545
1.811IlePro: 1.811 ± 0.672
3.018IleGln: 3.018 ± 1.19
2.414IleArg: 2.414 ± 0.175
3.018IleSer: 3.018 ± 1.421
3.018IleThr: 3.018 ± 2.726
3.018IleVal: 3.018 ± 0.669
1.811IleTrp: 1.811 ± 0.292
1.207IleTyr: 1.207 ± 1.294
0.0IleXaa: 0.0 ± 0.0
Lys
2.414LysAla: 2.414 ± 2.181
0.604LysCys: 0.604 ± 0.42
1.811LysAsp: 1.811 ± 0.672
2.414LysGlu: 2.414 ± 0.175
1.207LysPhe: 1.207 ± 1.294
1.207LysGly: 1.207 ± 0.841
2.414LysHis: 2.414 ± 0.691
3.018LysIle: 3.018 ± 1.21
1.811LysLys: 1.811 ± 0.292
5.432LysLeu: 5.432 ± 2.017
0.0LysMet: 0.0 ± 0.0
1.811LysAsn: 1.811 ± 0.902
3.018LysPro: 3.018 ± 1.314
2.414LysGln: 2.414 ± 0.9
4.225LysArg: 4.225 ± 0.739
3.621LysSer: 3.621 ± 0.991
2.414LysThr: 2.414 ± 0.691
2.414LysVal: 2.414 ± 1.738
2.414LysTrp: 2.414 ± 0.175
1.207LysTyr: 1.207 ± 0.841
0.0LysXaa: 0.0 ± 0.0
Leu
6.639LeuAla: 6.639 ± 1.589
1.811LeuCys: 1.811 ± 0.713
4.828LeuAsp: 4.828 ± 1.216
4.225LeuGlu: 4.225 ± 0.739
1.207LeuPhe: 1.207 ± 0.536
3.621LeuGly: 3.621 ± 0.662
0.0LeuHis: 0.0 ± 0.0
7.242LeuIle: 7.242 ± 0.825
4.828LeuLys: 4.828 ± 1.382
7.846LeuLeu: 7.846 ± 2.401
3.018LeuMet: 3.018 ± 0.669
1.811LeuAsn: 1.811 ± 0.713
9.053LeuPro: 9.053 ± 0.61
4.225LeuGln: 4.225 ± 2.236
3.621LeuArg: 3.621 ± 1.826
6.639LeuSer: 6.639 ± 1.131
4.225LeuThr: 4.225 ± 0.778
4.225LeuVal: 4.225 ± 0.739
0.604LeuTrp: 0.604 ± 0.545
3.621LeuTyr: 3.621 ± 0.583
0.0LeuXaa: 0.0 ± 0.0
Met
3.621MetAla: 3.621 ± 1.304
0.0MetCys: 0.0 ± 0.0
2.414MetAsp: 2.414 ± 1.404
2.414MetGlu: 2.414 ± 1.681
0.0MetPhe: 0.0 ± 0.0
3.018MetGly: 3.018 ± 1.419
0.0MetHis: 0.0 ± 0.0
0.604MetIle: 0.604 ± 0.647
1.811MetLys: 1.811 ± 1.942
3.018MetLeu: 3.018 ± 0.989
0.0MetMet: 0.0 ± 0.0
0.604MetAsn: 0.604 ± 0.42
1.207MetPro: 1.207 ± 0.536
1.207MetGln: 1.207 ± 0.702
0.604MetArg: 0.604 ± 0.42
1.811MetSer: 1.811 ± 0.672
1.811MetThr: 1.811 ± 1.078
2.414MetVal: 2.414 ± 0.175
0.0MetTrp: 0.0 ± 0.0
0.604MetTyr: 0.604 ± 0.545
0.0MetXaa: 0.0 ± 0.0
Asn
4.225AsnAla: 4.225 ± 0.235
0.0AsnCys: 0.0 ± 0.0
1.811AsnAsp: 1.811 ± 1.078
0.604AsnGlu: 0.604 ± 0.42
1.207AsnPhe: 1.207 ± 0.841
1.811AsnGly: 1.811 ± 1.235
0.604AsnHis: 0.604 ± 0.647
1.811AsnIle: 1.811 ± 1.112
1.811AsnLys: 1.811 ± 0.672
1.207AsnLeu: 1.207 ± 0.445
0.604AsnMet: 0.604 ± 0.938
2.414AsnAsn: 2.414 ± 1.681
2.414AsnPro: 2.414 ± 1.681
1.811AsnGln: 1.811 ± 0.672
2.414AsnArg: 2.414 ± 1.072
4.225AsnSer: 4.225 ± 2.22
3.621AsnThr: 3.621 ± 0.662
2.414AsnVal: 2.414 ± 1.557
0.604AsnTrp: 0.604 ± 0.647
1.207AsnTyr: 1.207 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
2.414ProAla: 2.414 ± 0.691
0.604ProCys: 0.604 ± 0.545
5.432ProAsp: 5.432 ± 1.702
2.414ProGlu: 2.414 ± 1.072
0.604ProPhe: 0.604 ± 0.545
3.621ProGly: 3.621 ± 0.583
0.604ProHis: 0.604 ± 0.545
1.811ProIle: 1.811 ± 0.292
2.414ProLys: 2.414 ± 0.89
6.639ProLeu: 6.639 ± 0.294
0.604ProMet: 0.604 ± 0.42
1.811ProAsn: 1.811 ± 0.713
2.414ProPro: 2.414 ± 1.029
4.828ProGln: 4.828 ± 1.829
3.621ProArg: 3.621 ± 1.32
9.656ProSer: 9.656 ± 2.532
6.035ProThr: 6.035 ± 1.573
2.414ProVal: 2.414 ± 1.423
0.0ProTrp: 0.0 ± 0.0
3.018ProTyr: 3.018 ± 1.19
0.0ProXaa: 0.0 ± 0.0
Gln
3.621GlnAla: 3.621 ± 1.335
1.207GlnCys: 1.207 ± 0.841
1.811GlnAsp: 1.811 ± 0.713
3.018GlnGlu: 3.018 ± 0.384
1.207GlnPhe: 1.207 ± 0.841
1.811GlnGly: 1.811 ± 0.902
1.207GlnHis: 1.207 ± 0.445
2.414GlnIle: 2.414 ± 1.041
1.811GlnLys: 1.811 ± 0.672
6.035GlnLeu: 6.035 ± 2.34
1.207GlnMet: 1.207 ± 1.096
1.811GlnAsn: 1.811 ± 0.713
6.639GlnPro: 6.639 ± 1.847
3.018GlnGln: 3.018 ± 0.989
2.414GlnArg: 2.414 ± 0.9
7.242GlnSer: 7.242 ± 2.154
4.828GlnThr: 4.828 ± 1.13
1.207GlnVal: 1.207 ± 0.536
1.207GlnTrp: 1.207 ± 0.445
2.414GlnTyr: 2.414 ± 1.072
0.0GlnXaa: 0.0 ± 0.0
Arg
6.035ArgAla: 6.035 ± 0.807
0.0ArgCys: 0.0 ± 0.0
1.207ArgAsp: 1.207 ± 1.091
3.621ArgGlu: 3.621 ± 2.223
1.207ArgPhe: 1.207 ± 1.294
3.621ArgGly: 3.621 ± 0.583
1.207ArgHis: 1.207 ± 0.841
1.811ArgIle: 1.811 ± 1.235
2.414ArgLys: 2.414 ± 1.423
6.639ArgLeu: 6.639 ± 1.569
1.811ArgMet: 1.811 ± 1.112
1.811ArgAsn: 1.811 ± 0.713
3.018ArgPro: 3.018 ± 1.314
4.828ArgGln: 4.828 ± 0.84
6.639ArgArg: 6.639 ± 3.873
5.432ArgSer: 5.432 ± 1.911
3.621ArgThr: 3.621 ± 1.608
6.035ArgVal: 6.035 ± 0.5
0.0ArgTrp: 0.0 ± 0.0
1.207ArgTyr: 1.207 ± 0.702
0.0ArgXaa: 0.0 ± 0.0
Ser
8.449SerAla: 8.449 ± 2.078
0.604SerCys: 0.604 ± 0.545
4.828SerAsp: 4.828 ± 1.279
3.621SerGlu: 3.621 ± 1.335
1.207SerPhe: 1.207 ± 1.091
9.053SerGly: 9.053 ± 3.401
1.811SerHis: 1.811 ± 1.078
2.414SerIle: 2.414 ± 1.029
7.846SerLys: 7.846 ± 1.12
6.035SerLeu: 6.035 ± 1.984
1.811SerMet: 1.811 ± 0.902
2.414SerAsn: 2.414 ± 0.89
4.828SerPro: 4.828 ± 1.78
5.432SerGln: 5.432 ± 2.446
4.225SerArg: 4.225 ± 1.742
11.467SerSer: 11.467 ± 2.634
7.846SerThr: 7.846 ± 3.197
6.639SerVal: 6.639 ± 0.677
0.604SerTrp: 0.604 ± 0.647
4.225SerTyr: 4.225 ± 1.177
0.604SerXaa: 0.604 ± 0.545
Thr
7.242ThrAla: 7.242 ± 1.747
1.207ThrCys: 1.207 ± 0.445
4.828ThrAsp: 4.828 ± 0.84
2.414ThrGlu: 2.414 ± 1.072
1.811ThrPhe: 1.811 ± 1.078
3.018ThrGly: 3.018 ± 1.902
0.604ThrHis: 0.604 ± 0.545
3.621ThrIle: 3.621 ± 0.413
3.621ThrLys: 3.621 ± 1.745
8.449ThrLeu: 8.449 ± 1.311
1.811ThrMet: 1.811 ± 0.713
1.207ThrAsn: 1.207 ± 1.294
9.053ThrPro: 9.053 ± 1.151
2.414ThrGln: 2.414 ± 0.691
4.828ThrArg: 4.828 ± 1.279
6.035ThrSer: 6.035 ± 1.279
5.432ThrThr: 5.432 ± 1.208
6.035ThrVal: 6.035 ± 1.573
0.0ThrTrp: 0.0 ± 0.0
2.414ThrTyr: 2.414 ± 0.89
0.0ThrXaa: 0.0 ± 0.0
Val
5.432ValAla: 5.432 ± 1.077
0.0ValCys: 0.0 ± 0.0
1.811ValAsp: 1.811 ± 0.292
1.811ValGlu: 1.811 ± 1.078
3.018ValPhe: 3.018 ± 1.735
1.811ValGly: 1.811 ± 0.292
1.207ValHis: 1.207 ± 0.445
2.414ValIle: 2.414 ± 0.175
1.811ValLys: 1.811 ± 0.292
6.639ValLeu: 6.639 ± 0.838
3.018ValMet: 3.018 ± 0.669
1.811ValAsn: 1.811 ± 1.112
3.018ValPro: 3.018 ± 1.314
3.621ValGln: 3.621 ± 1.32
6.639ValArg: 6.639 ± 2.068
3.621ValSer: 3.621 ± 1.335
3.621ValThr: 3.621 ± 0.662
2.414ValVal: 2.414 ± 0.175
3.018ValTrp: 3.018 ± 1.314
3.018ValTyr: 3.018 ± 1.957
0.0ValXaa: 0.0 ± 0.0
Trp
1.207TrpAla: 1.207 ± 1.294
0.604TrpCys: 0.604 ± 0.545
1.207TrpAsp: 1.207 ± 0.702
0.604TrpGlu: 0.604 ± 0.545
0.0TrpPhe: 0.0 ± 0.0
0.604TrpGly: 0.604 ± 0.647
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.811TrpLeu: 1.811 ± 0.292
0.604TrpMet: 0.604 ± 0.647
0.604TrpAsn: 0.604 ± 0.545
0.0TrpPro: 0.0 ± 0.0
0.604TrpGln: 0.604 ± 0.42
1.207TrpArg: 1.207 ± 0.841
2.414TrpSer: 2.414 ± 0.9
4.225TrpThr: 4.225 ± 1.471
0.604TrpVal: 0.604 ± 0.42
0.0TrpTrp: 0.0 ± 0.0
0.604TrpTyr: 0.604 ± 0.545
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 0.9
0.0TyrCys: 0.0 ± 0.0
1.811TyrAsp: 1.811 ± 0.713
0.604TyrGlu: 0.604 ± 0.42
2.414TyrPhe: 2.414 ± 2.589
3.018TyrGly: 3.018 ± 1.902
0.0TyrHis: 0.0 ± 0.0
1.207TyrIle: 1.207 ± 0.445
1.811TyrLys: 1.811 ± 1.078
2.414TyrLeu: 2.414 ± 0.89
0.0TyrMet: 0.0 ± 0.0
3.018TyrAsn: 3.018 ± 1.421
1.207TyrPro: 1.207 ± 0.536
2.414TyrGln: 2.414 ± 0.175
4.225TyrArg: 4.225 ± 0.739
3.018TyrSer: 3.018 ± 1.54
1.811TyrThr: 1.811 ± 1.942
3.018TyrVal: 3.018 ± 0.384
1.207TyrTrp: 1.207 ± 0.445
0.604TyrTyr: 0.604 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.604XaaArg: 0.604 ± 0.545
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski