Amino acid dipepetide frequency for Aleutian mink disease parvovirus (strain G) (ADV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.775AlaAla: 3.775 ± 1.03
0.472AlaCys: 0.472 ± 1.141
1.888AlaAsp: 1.888 ± 0.853
3.303AlaGlu: 3.303 ± 0.72
0.944AlaPhe: 0.944 ± 0.913
5.191AlaGly: 5.191 ± 1.799
0.472AlaHis: 0.472 ± 0.456
0.472AlaIle: 0.472 ± 0.456
1.888AlaLys: 1.888 ± 0.496
1.416AlaLeu: 1.416 ± 0.294
0.472AlaMet: 0.472 ± 1.03
0.944AlaAsn: 0.944 ± 0.913
2.832AlaPro: 2.832 ± 0.804
5.663AlaGln: 5.663 ± 3.948
0.0AlaArg: 0.0 ± 0.0
3.775AlaSer: 3.775 ± 0.804
7.551AlaThr: 7.551 ± 1.582
1.888AlaVal: 1.888 ± 0.496
2.36AlaTrp: 2.36 ± 1.49
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.944CysAla: 0.944 ± 1.076
0.472CysCys: 0.472 ± 0.456
0.944CysAsp: 0.944 ± 0.966
0.944CysGlu: 0.944 ± 2.283
0.472CysPhe: 0.472 ± 0.456
1.416CysGly: 1.416 ± 1.231
1.888CysHis: 1.888 ± 2.201
0.944CysIle: 0.944 ± 0.913
4.719CysLys: 4.719 ± 1.673
1.416CysLeu: 1.416 ± 0.294
0.472CysMet: 0.472 ± 0.456
0.472CysAsn: 0.472 ± 0.877
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.944CysArg: 0.944 ± 1.076
0.944CysSer: 0.944 ± 0.427
0.472CysThr: 0.472 ± 0.456
0.944CysVal: 0.944 ± 0.913
1.416CysTrp: 1.416 ± 0.778
0.944CysTyr: 0.944 ± 0.427
0.0CysXaa: 0.0 ± 0.0
Asp
3.303AspAla: 3.303 ± 0.882
1.416AspCys: 1.416 ± 1.231
3.303AspAsp: 3.303 ± 0.966
5.663AspGlu: 5.663 ± 1.089
2.36AspPhe: 2.36 ± 0.573
3.775AspGly: 3.775 ± 1.8
0.944AspHis: 0.944 ± 0.427
1.416AspIle: 1.416 ± 1.369
5.663AspLys: 5.663 ± 2.026
6.135AspLeu: 6.135 ± 2.698
0.944AspMet: 0.944 ± 0.913
7.551AspAsn: 7.551 ± 1.539
0.944AspPro: 0.944 ± 0.913
0.472AspGln: 0.472 ± 0.456
2.832AspArg: 2.832 ± 0.587
4.247AspSer: 4.247 ± 1.229
7.551AspThr: 7.551 ± 2.848
0.944AspVal: 0.944 ± 0.913
0.472AspTrp: 0.472 ± 0.456
3.303AspTyr: 3.303 ± 0.627
0.0AspXaa: 0.0 ± 0.0
Glu
1.416GluAla: 1.416 ± 0.698
1.888GluCys: 1.888 ± 3.297
4.719GluAsp: 4.719 ± 2.746
5.663GluGlu: 5.663 ± 0.544
2.832GluPhe: 2.832 ± 1.28
2.36GluGly: 2.36 ± 1.49
0.944GluHis: 0.944 ± 1.076
5.663GluIle: 5.663 ± 1.089
0.472GluLys: 0.472 ± 0.456
2.832GluLeu: 2.832 ± 0.587
1.888GluMet: 1.888 ± 0.496
4.247GluAsn: 4.247 ± 1.38
1.888GluPro: 1.888 ± 0.638
8.023GluGln: 8.023 ± 1.826
2.36GluArg: 2.36 ± 0.573
0.0GluSer: 0.0 ± 0.0
3.775GluThr: 3.775 ± 1.03
4.247GluVal: 4.247 ± 1.38
0.944GluTrp: 0.944 ± 0.427
1.416GluTyr: 1.416 ± 1.1
0.0GluXaa: 0.0 ± 0.0
Phe
1.416PheAla: 1.416 ± 0.294
0.472PheCys: 0.472 ± 0.456
1.416PheAsp: 1.416 ± 1.369
1.888PheGlu: 1.888 ± 0.853
0.944PhePhe: 0.944 ± 0.427
0.472PheGly: 0.472 ± 0.456
2.36PheHis: 2.36 ± 0.573
1.888PheIle: 1.888 ± 0.496
1.416PheLys: 1.416 ± 0.294
0.0PheLeu: 0.0 ± 0.0
0.472PheMet: 0.472 ± 0.456
8.495PheAsn: 8.495 ± 1.933
1.416PhePro: 1.416 ± 0.294
2.36PheGln: 2.36 ± 1.49
0.0PheArg: 0.0 ± 0.0
1.888PheSer: 1.888 ± 0.853
2.36PheThr: 2.36 ± 0.573
3.775PheVal: 3.775 ± 1.706
0.944PheTrp: 0.944 ± 0.427
0.472PheTyr: 0.472 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
1.888GlyAla: 1.888 ± 0.853
1.888GlyCys: 1.888 ± 1.825
2.36GlyAsp: 2.36 ± 0.573
5.663GlyGlu: 5.663 ± 1.581
1.888GlyPhe: 1.888 ± 0.853
19.349GlyGly: 19.349 ± 6.066
0.944GlyHis: 0.944 ± 0.427
1.416GlyIle: 1.416 ± 0.294
6.135GlyLys: 6.135 ± 0.871
4.247GlyLeu: 4.247 ± 1.38
0.944GlyMet: 0.944 ± 0.913
5.191GlyAsn: 5.191 ± 0.935
3.303GlyPro: 3.303 ± 0.882
3.303GlyGln: 3.303 ± 0.966
1.888GlyArg: 1.888 ± 0.853
0.944GlySer: 0.944 ± 0.427
2.832GlyThr: 2.832 ± 0.587
4.247GlyVal: 4.247 ± 1.351
1.416GlyTrp: 1.416 ± 0.294
2.832GlyTyr: 2.832 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
1.416HisAla: 1.416 ± 0.294
0.944HisCys: 0.944 ± 0.913
0.0HisAsp: 0.0 ± 0.0
2.832HisGlu: 2.832 ± 0.854
1.888HisPhe: 1.888 ± 0.853
1.888HisGly: 1.888 ± 1.175
1.416HisHis: 1.416 ± 0.294
1.416HisIle: 1.416 ± 0.294
2.832HisLys: 2.832 ± 1.28
1.888HisLeu: 1.888 ± 0.638
1.416HisMet: 1.416 ± 0.294
0.944HisAsn: 0.944 ± 1.775
0.944HisPro: 0.944 ± 0.427
2.36HisGln: 2.36 ± 0.897
0.472HisArg: 0.472 ± 0.456
1.888HisSer: 1.888 ± 0.853
2.36HisThr: 2.36 ± 1.217
0.944HisVal: 0.944 ± 0.457
0.0HisTrp: 0.0 ± 0.0
0.472HisTyr: 0.472 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.472IleCys: 0.472 ± 0.456
4.247IleAsp: 4.247 ± 1.224
3.775IleGlu: 3.775 ± 0.898
0.944IlePhe: 0.944 ± 0.427
2.832IleGly: 2.832 ± 1.28
2.36IleHis: 2.36 ± 0.573
2.36IleIle: 2.36 ± 1.069
5.663IleLys: 5.663 ± 1.913
2.36IleLeu: 2.36 ± 0.573
0.472IleMet: 0.472 ± 0.456
6.135IleAsn: 6.135 ± 1.26
4.247IlePro: 4.247 ± 1.922
3.303IleGln: 3.303 ± 0.72
1.416IleArg: 1.416 ± 0.294
0.944IleSer: 0.944 ± 0.427
3.775IleThr: 3.775 ± 1.276
1.888IleVal: 1.888 ± 0.638
2.832IleTrp: 2.832 ± 1.516
5.663IleTyr: 5.663 ± 1.53
0.0IleXaa: 0.0 ± 0.0
Lys
2.832LysAla: 2.832 ± 2.786
0.944LysCys: 0.944 ± 1.076
5.663LysAsp: 5.663 ± 2.026
4.247LysGlu: 4.247 ± 3.521
0.472LysPhe: 0.472 ± 0.456
3.775LysGly: 3.775 ± 0.804
0.472LysHis: 0.472 ± 0.456
3.303LysIle: 3.303 ± 1.421
8.023LysLys: 8.023 ± 1.976
8.495LysLeu: 8.495 ± 2.335
0.472LysMet: 0.472 ± 0.456
3.303LysAsn: 3.303 ± 0.882
7.551LysPro: 7.551 ± 0.51
6.607LysGln: 6.607 ± 1.643
3.303LysArg: 3.303 ± 1.061
1.416LysSer: 1.416 ± 1.369
9.91LysThr: 9.91 ± 2.052
6.135LysVal: 6.135 ± 2.336
0.0LysTrp: 0.0 ± 0.0
2.832LysTyr: 2.832 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
1.888LeuAla: 1.888 ± 1.825
2.832LeuCys: 2.832 ± 0.6
4.719LeuAsp: 4.719 ± 1.145
3.303LeuGlu: 3.303 ± 0.627
3.303LeuPhe: 3.303 ± 1.576
5.663LeuGly: 5.663 ± 1.174
0.472LeuHis: 0.472 ± 0.456
4.719LeuIle: 4.719 ± 1.149
4.247LeuLys: 4.247 ± 2.101
4.247LeuLeu: 4.247 ± 1.701
0.0LeuMet: 0.0 ± 0.0
4.247LeuAsn: 4.247 ± 0.881
1.888LeuPro: 1.888 ± 0.853
5.191LeuGln: 5.191 ± 1.475
6.135LeuArg: 6.135 ± 3.91
3.775LeuSer: 3.775 ± 0.804
7.551LeuThr: 7.551 ± 0.567
1.888LeuVal: 1.888 ± 0.853
0.0LeuTrp: 0.0 ± 0.0
3.303LeuTyr: 3.303 ± 1.576
0.0LeuXaa: 0.0 ± 0.0
Met
1.416MetAla: 1.416 ± 1.721
0.472MetCys: 0.472 ± 0.877
3.303MetAsp: 3.303 ± 0.72
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.472MetGly: 0.472 ± 0.456
0.0MetHis: 0.0 ± 0.0
1.416MetIle: 1.416 ± 1.369
2.832MetLys: 2.832 ± 0.587
0.472MetLeu: 0.472 ± 0.456
0.0MetMet: 0.0 ± 0.0
2.36MetAsn: 2.36 ± 0.573
0.472MetPro: 0.472 ± 0.456
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.416MetSer: 1.416 ± 0.698
0.944MetThr: 0.944 ± 0.427
1.416MetVal: 1.416 ± 0.294
0.0MetTrp: 0.0 ± 0.0
0.472MetTyr: 0.472 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
5.191AsnAla: 5.191 ± 1.799
0.944AsnCys: 0.944 ± 1.775
2.36AsnAsp: 2.36 ± 1.49
5.191AsnGlu: 5.191 ± 1.511
7.079AsnPhe: 7.079 ± 1.718
1.888AsnGly: 1.888 ± 0.638
1.416AsnHis: 1.416 ± 0.294
6.135AsnIle: 6.135 ± 1.365
6.607AsnLys: 6.607 ± 4.165
3.775AsnLeu: 3.775 ± 1.276
0.0AsnMet: 0.0 ± 0.0
8.966AsnAsn: 8.966 ± 3.494
4.719AsnPro: 4.719 ± 0.981
3.775AsnGln: 3.775 ± 0.787
2.36AsnArg: 2.36 ± 0.573
2.832AsnSer: 2.832 ± 1.28
3.775AsnThr: 3.775 ± 1.276
2.36AsnVal: 2.36 ± 0.573
0.472AsnTrp: 0.472 ± 0.456
2.36AsnTyr: 2.36 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
3.303ProAla: 3.303 ± 1.504
1.888ProCys: 1.888 ± 0.853
0.944ProAsp: 0.944 ± 0.427
2.36ProGlu: 2.36 ± 1.069
0.944ProPhe: 0.944 ± 0.427
3.303ProGly: 3.303 ± 0.72
1.888ProHis: 1.888 ± 0.853
3.303ProIle: 3.303 ± 0.966
4.247ProLys: 4.247 ± 0.881
4.247ProLeu: 4.247 ± 1.224
0.0ProMet: 0.0 ± 0.0
2.832ProAsn: 2.832 ± 0.587
4.247ProPro: 4.247 ± 1.351
1.416ProGln: 1.416 ± 0.829
1.416ProArg: 1.416 ± 0.698
1.416ProSer: 1.416 ± 0.294
3.775ProThr: 3.775 ± 1.209
2.832ProVal: 2.832 ± 0.587
3.303ProTrp: 3.303 ± 0.966
2.36ProTyr: 2.36 ± 0.573
0.0ProXaa: 0.0 ± 0.0
Gln
3.775GlnAla: 3.775 ± 1.359
0.472GlnCys: 0.472 ± 1.141
5.663GlnAsp: 5.663 ± 1.011
2.36GlnGlu: 2.36 ± 1.217
1.416GlnPhe: 1.416 ± 0.294
3.303GlnGly: 3.303 ± 0.966
3.303GlnHis: 3.303 ± 1.59
3.775GlnIle: 3.775 ± 1.209
4.247GlnLys: 4.247 ± 1.224
3.303GlnLeu: 3.303 ± 2.48
2.36GlnMet: 2.36 ± 1.092
0.944GlnAsn: 0.944 ± 0.427
3.303GlnPro: 3.303 ± 0.882
6.607GlnGln: 6.607 ± 0.805
1.888GlnArg: 1.888 ± 1.786
3.303GlnSer: 3.303 ± 1.595
2.36GlnThr: 2.36 ± 0.649
3.303GlnVal: 3.303 ± 0.966
0.944GlnTrp: 0.944 ± 0.427
4.247GlnTyr: 4.247 ± 1.38
0.0GlnXaa: 0.0 ± 0.0
Arg
1.888ArgAla: 1.888 ± 0.682
1.416ArgCys: 1.416 ± 0.294
0.944ArgAsp: 0.944 ± 1.076
0.944ArgGlu: 0.944 ± 0.427
0.0ArgPhe: 0.0 ± 0.0
2.36ArgGly: 2.36 ± 0.573
0.944ArgHis: 0.944 ± 1.146
1.888ArgIle: 1.888 ± 0.853
1.416ArgLys: 1.416 ± 1.196
3.775ArgLeu: 3.775 ± 2.322
1.416ArgMet: 1.416 ± 0.294
1.416ArgAsn: 1.416 ± 0.294
0.944ArgPro: 0.944 ± 0.427
1.416ArgGln: 1.416 ± 0.294
1.888ArgArg: 1.888 ± 2.511
2.832ArgSer: 2.832 ± 1.044
4.247ArgThr: 4.247 ± 1.351
0.0ArgVal: 0.0 ± 0.0
1.416ArgTrp: 1.416 ± 1.369
2.832ArgTyr: 2.832 ± 1.28
0.0ArgXaa: 0.0 ± 0.0
Ser
1.888SerAla: 1.888 ± 1.072
0.472SerCys: 0.472 ± 0.456
4.247SerAsp: 4.247 ± 1.224
0.944SerGlu: 0.944 ± 0.427
0.0SerPhe: 0.0 ± 0.0
1.888SerGly: 1.888 ± 0.853
0.944SerHis: 0.944 ± 0.913
0.944SerIle: 0.944 ± 0.913
4.247SerLys: 4.247 ± 0.99
3.775SerLeu: 3.775 ± 1.276
1.416SerMet: 1.416 ± 0.294
4.719SerAsn: 4.719 ± 1.658
0.944SerPro: 0.944 ± 0.427
1.888SerGln: 1.888 ± 0.853
0.472SerArg: 0.472 ± 0.877
1.888SerSer: 1.888 ± 1.786
6.135SerThr: 6.135 ± 1.73
0.944SerVal: 0.944 ± 0.427
0.944SerTrp: 0.944 ± 0.427
2.832SerTyr: 2.832 ± 1.28
0.0SerXaa: 0.0 ± 0.0
Thr
3.775ThrAla: 3.775 ± 0.804
1.888ThrCys: 1.888 ± 0.853
7.079ThrAsp: 7.079 ± 0.849
5.191ThrGlu: 5.191 ± 2.343
1.416ThrPhe: 1.416 ± 0.294
4.247ThrGly: 4.247 ± 0.881
3.775ThrHis: 3.775 ± 1.706
4.719ThrIle: 4.719 ± 0.981
10.854ThrLys: 10.854 ± 2.774
7.551ThrLeu: 7.551 ± 1.71
0.944ThrMet: 0.944 ± 0.856
5.191ThrAsn: 5.191 ± 1.167
5.191ThrPro: 5.191 ± 0.935
1.416ThrGln: 1.416 ± 0.294
3.775ThrArg: 3.775 ± 1.706
5.191ThrSer: 5.191 ± 2.932
6.607ThrThr: 6.607 ± 1.857
2.832ThrVal: 2.832 ± 0.587
2.36ThrTrp: 2.36 ± 0.573
2.832ThrTyr: 2.832 ± 2.565
0.0ThrXaa: 0.0 ± 0.0
Val
4.719ValAla: 4.719 ± 1.069
0.944ValCys: 0.944 ± 0.427
3.775ValAsp: 3.775 ± 0.804
0.944ValGlu: 0.944 ± 0.913
1.416ValPhe: 1.416 ± 0.698
2.832ValGly: 2.832 ± 0.587
1.416ValHis: 1.416 ± 0.294
2.832ValIle: 2.832 ± 0.587
3.303ValLys: 3.303 ± 0.882
2.36ValLeu: 2.36 ± 0.573
0.944ValMet: 0.944 ± 0.427
0.944ValAsn: 0.944 ± 0.913
0.944ValPro: 0.944 ± 0.427
3.303ValGln: 3.303 ± 1.576
0.0ValArg: 0.0 ± 0.0
0.0ValSer: 0.0 ± 0.0
7.079ValThr: 7.079 ± 1.468
0.0ValVal: 0.0 ± 0.0
1.416ValTrp: 1.416 ± 0.294
3.303ValTyr: 3.303 ± 1.504
0.0ValXaa: 0.0 ± 0.0
Trp
0.472TrpAla: 0.472 ± 0.456
0.0TrpCys: 0.0 ± 0.0
2.832TrpAsp: 2.832 ± 0.587
1.416TrpGlu: 1.416 ± 0.294
0.944TrpPhe: 0.944 ± 0.913
2.832TrpGly: 2.832 ± 1.28
0.0TrpHis: 0.0 ± 0.0
1.416TrpIle: 1.416 ± 0.294
0.472TrpLys: 0.472 ± 0.456
1.888TrpLeu: 1.888 ± 1.786
1.888TrpMet: 1.888 ± 0.635
0.944TrpAsn: 0.944 ± 0.427
0.944TrpPro: 0.944 ± 0.427
0.944TrpGln: 0.944 ± 1.076
1.888TrpArg: 1.888 ± 0.853
0.944TrpSer: 0.944 ± 0.427
0.944TrpThr: 0.944 ± 1.076
0.472TrpVal: 0.472 ± 0.456
0.0TrpTrp: 0.0 ± 0.0
0.944TrpTyr: 0.944 ± 0.427
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.472TyrCys: 0.472 ± 1.141
2.832TyrAsp: 2.832 ± 1.28
0.944TyrGlu: 0.944 ± 0.427
5.191TyrPhe: 5.191 ± 1.066
2.832TyrGly: 2.832 ± 0.587
2.36TyrHis: 2.36 ± 1.092
5.663TyrIle: 5.663 ± 2.559
0.472TyrLys: 0.472 ± 0.456
5.191TyrLeu: 5.191 ± 1.511
0.472TyrMet: 0.472 ± 0.384
2.36TyrAsn: 2.36 ± 0.573
3.303TyrPro: 3.303 ± 1.504
2.832TyrGln: 2.832 ± 1.28
0.944TyrArg: 0.944 ± 0.427
1.888TyrSer: 1.888 ± 0.638
2.832TyrThr: 2.832 ± 1.464
1.416TyrVal: 1.416 ± 1.721
0.944TyrTrp: 0.944 ± 2.283
2.36TyrTyr: 2.36 ± 1.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski