Amino acid dipepetide frequency for Angelonia flower break virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.641AlaAla: 12.641 ± 3.511
1.996AlaCys: 1.996 ± 0.889
1.331AlaAsp: 1.331 ± 1.56
4.657AlaGlu: 4.657 ± 1.353
6.653AlaPhe: 6.653 ± 2.658
0.665AlaGly: 0.665 ± 0.407
0.0AlaHis: 0.0 ± 0.0
5.323AlaIle: 5.323 ± 2.357
6.653AlaLys: 6.653 ± 2.348
9.98AlaLeu: 9.98 ± 2.011
3.327AlaMet: 3.327 ± 1.169
5.323AlaAsn: 5.323 ± 1.113
3.327AlaPro: 3.327 ± 1.685
2.661AlaGln: 2.661 ± 0.505
6.653AlaArg: 6.653 ± 2.124
3.992AlaSer: 3.992 ± 1.478
2.661AlaThr: 2.661 ± 2.829
2.661AlaVal: 2.661 ± 1.471
3.992AlaTrp: 3.992 ± 1.146
3.327AlaTyr: 3.327 ± 1.581
0.0AlaXaa: 0.0 ± 0.0
Cys
1.996CysAla: 1.996 ± 0.889
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.996CysPhe: 1.996 ± 0.889
2.661CysGly: 2.661 ± 1.471
0.0CysHis: 0.0 ± 0.0
1.331CysIle: 1.331 ± 0.813
0.0CysLys: 0.0 ± 0.0
2.661CysLeu: 2.661 ± 0.505
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.665CysPro: 0.665 ± 0.407
1.331CysGln: 1.331 ± 0.813
1.331CysArg: 1.331 ± 0.813
1.331CysSer: 1.331 ± 1.452
2.661CysThr: 2.661 ± 1.115
1.331CysVal: 1.331 ± 0.813
1.331CysTrp: 1.331 ± 0.735
1.996CysTyr: 1.996 ± 1.22
0.0CysXaa: 0.0 ± 0.0
Asp
3.327AspAla: 3.327 ± 1.174
3.327AspCys: 3.327 ± 1.511
1.331AspAsp: 1.331 ± 0.528
0.665AspGlu: 0.665 ± 0.707
1.331AspPhe: 1.331 ± 0.528
2.661AspGly: 2.661 ± 0.505
1.331AspHis: 1.331 ± 0.735
2.661AspIle: 2.661 ± 1.373
1.996AspLys: 1.996 ± 1.18
5.323AspLeu: 5.323 ± 1.841
1.996AspMet: 1.996 ± 0.623
3.992AspAsn: 3.992 ± 1.652
3.327AspPro: 3.327 ± 1.081
2.661AspGln: 2.661 ± 1.131
0.665AspArg: 0.665 ± 0.707
1.996AspSer: 1.996 ± 1.991
3.327AspThr: 3.327 ± 1.174
3.992AspVal: 3.992 ± 1.478
0.0AspTrp: 0.0 ± 0.0
2.661AspTyr: 2.661 ± 1.171
0.0AspXaa: 0.0 ± 0.0
Glu
6.653GluAla: 6.653 ± 3.161
0.665GluCys: 0.665 ± 0.707
1.331GluAsp: 1.331 ± 0.528
4.657GluGlu: 4.657 ± 2.04
5.988GluPhe: 5.988 ± 1.455
1.996GluGly: 1.996 ± 1.22
3.327GluHis: 3.327 ± 1.511
3.992GluIle: 3.992 ± 2.563
1.996GluLys: 1.996 ± 1.22
1.996GluLeu: 1.996 ± 0.623
0.0GluMet: 0.0 ± 0.0
0.665GluAsn: 0.665 ± 0.407
2.661GluPro: 2.661 ± 1.627
0.665GluGln: 0.665 ± 0.407
0.665GluArg: 0.665 ± 0.407
3.327GluSer: 3.327 ± 1.581
1.996GluThr: 1.996 ± 0.889
3.992GluVal: 3.992 ± 1.478
1.331GluTrp: 1.331 ± 0.528
1.996GluTyr: 1.996 ± 1.18
0.0GluXaa: 0.0 ± 0.0
Phe
2.661PheAla: 2.661 ± 1.644
0.665PheCys: 0.665 ± 0.407
3.327PheAsp: 3.327 ± 1.081
4.657PheGlu: 4.657 ± 1.142
1.996PhePhe: 1.996 ± 0.626
7.319PheGly: 7.319 ± 1.637
0.0PheHis: 0.0 ± 0.0
1.996PheIle: 1.996 ± 0.626
1.996PheLys: 1.996 ± 0.889
1.331PheLeu: 1.331 ± 1.452
1.331PheMet: 1.331 ± 0.735
1.996PheAsn: 1.996 ± 0.623
0.665PhePro: 0.665 ± 0.707
1.331PheGln: 1.331 ± 0.528
1.996PheArg: 1.996 ± 0.889
3.327PheSer: 3.327 ± 1.258
5.988PheThr: 5.988 ± 2.231
2.661PheVal: 2.661 ± 0.91
0.0PheTrp: 0.0 ± 0.0
1.996PheTyr: 1.996 ± 1.22
0.0PheXaa: 0.0 ± 0.0
Gly
5.323GlyAla: 5.323 ± 1.859
1.996GlyCys: 1.996 ± 0.889
4.657GlyAsp: 4.657 ± 1.267
5.323GlyGlu: 5.323 ± 1.01
5.323GlyPhe: 5.323 ± 1.164
5.988GlyGly: 5.988 ± 1.321
0.0GlyHis: 0.0 ± 0.0
5.323GlyIle: 5.323 ± 1.065
5.323GlyLys: 5.323 ± 2.557
5.988GlyLeu: 5.988 ± 2.008
1.331GlyMet: 1.331 ± 0.813
3.327GlyAsn: 3.327 ± 1.511
0.665GlyPro: 0.665 ± 0.407
0.665GlyGln: 0.665 ± 0.407
3.327GlyArg: 3.327 ± 1.081
5.323GlySer: 5.323 ± 1.919
2.661GlyThr: 2.661 ± 1.115
4.657GlyVal: 4.657 ± 0.94
1.331GlyTrp: 1.331 ± 0.528
0.665GlyTyr: 0.665 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
3.327HisAla: 3.327 ± 1.581
0.665HisCys: 0.665 ± 0.407
0.665HisAsp: 0.665 ± 0.707
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.665HisGly: 0.665 ± 0.407
1.996HisHis: 1.996 ± 0.889
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.661HisLeu: 2.661 ± 1.471
0.0HisMet: 0.0 ± 0.0
0.665HisAsn: 0.665 ± 0.407
2.661HisPro: 2.661 ± 1.171
0.0HisGln: 0.0 ± 0.0
0.665HisArg: 0.665 ± 1.245
1.996HisSer: 1.996 ± 0.623
2.661HisThr: 2.661 ± 0.505
0.665HisVal: 0.665 ± 0.407
0.665HisTrp: 0.665 ± 0.407
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.992IleAla: 3.992 ± 1.733
0.0IleCys: 0.0 ± 0.0
3.327IleAsp: 3.327 ± 1.174
1.331IleGlu: 1.331 ± 1.293
1.996IlePhe: 1.996 ± 0.623
6.653IleGly: 6.653 ± 1.339
0.0IleHis: 0.0 ± 0.0
3.327IleIle: 3.327 ± 0.94
2.661IleLys: 2.661 ± 1.373
1.996IleLeu: 1.996 ± 0.623
0.665IleMet: 0.665 ± 0.707
3.327IleAsn: 3.327 ± 1.174
2.661IlePro: 2.661 ± 0.505
1.331IleGln: 1.331 ± 0.528
5.988IleArg: 5.988 ± 1.455
5.323IleSer: 5.323 ± 2.603
3.992IleThr: 3.992 ± 0.726
4.657IleVal: 4.657 ± 2.202
0.665IleTrp: 0.665 ± 0.407
1.996IleTyr: 1.996 ± 1.222
0.0IleXaa: 0.0 ± 0.0
Lys
5.323LysAla: 5.323 ± 1.012
3.992LysCys: 3.992 ± 1.779
3.327LysAsp: 3.327 ± 1.721
1.331LysGlu: 1.331 ± 1.293
1.996LysPhe: 1.996 ± 0.889
3.327LysGly: 3.327 ± 0.848
0.0LysHis: 0.0 ± 0.0
3.327LysIle: 3.327 ± 1.081
3.327LysLys: 3.327 ± 1.721
3.327LysLeu: 3.327 ± 1.437
0.665LysMet: 0.665 ± 0.704
2.661LysAsn: 2.661 ± 1.171
2.661LysPro: 2.661 ± 1.484
0.0LysGln: 0.0 ± 0.0
4.657LysArg: 4.657 ± 0.815
1.331LysSer: 1.331 ± 1.293
5.323LysThr: 5.323 ± 2.926
1.996LysVal: 1.996 ± 0.889
1.331LysTrp: 1.331 ± 0.813
0.665LysTyr: 0.665 ± 0.707
0.665LysXaa: 0.665 ± 0.407
Leu
5.988LeuAla: 5.988 ± 1.773
0.665LeuCys: 0.665 ± 0.707
3.992LeuAsp: 3.992 ± 1.461
5.323LeuGlu: 5.323 ± 2.343
1.331LeuPhe: 1.331 ± 0.528
5.323LeuGly: 5.323 ± 1.411
4.657LeuHis: 4.657 ± 1.262
3.992LeuIle: 3.992 ± 0.987
3.327LeuLys: 3.327 ± 1.511
9.315LeuLeu: 9.315 ± 6.764
3.992LeuMet: 3.992 ± 1.217
5.988LeuAsn: 5.988 ± 4.628
1.996LeuPro: 1.996 ± 2.611
3.327LeuGln: 3.327 ± 1.511
6.653LeuArg: 6.653 ± 1.697
7.319LeuSer: 7.319 ± 1.969
5.988LeuThr: 5.988 ± 1.924
5.323LeuVal: 5.323 ± 2.603
0.0LeuTrp: 0.0 ± 0.0
1.331LeuTyr: 1.331 ± 0.528
0.0LeuXaa: 0.0 ± 0.0
Met
1.996MetAla: 1.996 ± 0.889
0.0MetCys: 0.0 ± 0.0
0.665MetAsp: 0.665 ± 1.38
0.665MetGlu: 0.665 ± 0.407
1.331MetPhe: 1.331 ± 0.735
1.331MetGly: 1.331 ± 0.735
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.996MetLys: 1.996 ± 0.623
1.331MetLeu: 1.331 ± 1.452
0.0MetMet: 0.0 ± 0.0
1.331MetAsn: 1.331 ± 0.528
0.665MetPro: 0.665 ± 0.407
0.665MetGln: 0.665 ± 0.707
1.331MetArg: 1.331 ± 0.735
4.657MetSer: 4.657 ± 1.262
0.665MetThr: 0.665 ± 0.707
4.657MetVal: 4.657 ± 2.04
0.665MetTrp: 0.665 ± 0.407
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.323AsnAla: 5.323 ± 1.859
1.996AsnCys: 1.996 ± 1.222
2.661AsnAsp: 2.661 ± 1.131
0.665AsnGlu: 0.665 ± 0.407
2.661AsnPhe: 2.661 ± 1.236
4.657AsnGly: 4.657 ± 2.202
0.665AsnHis: 0.665 ± 0.407
3.327AsnIle: 3.327 ± 1.685
0.665AsnLys: 0.665 ± 0.407
8.649AsnLeu: 8.649 ± 4.656
1.331AsnMet: 1.331 ± 0.735
2.661AsnAsn: 2.661 ± 0.91
1.996AsnPro: 1.996 ± 2.121
3.327AsnGln: 3.327 ± 1.759
2.661AsnArg: 2.661 ± 2.707
2.661AsnSer: 2.661 ± 1.166
3.327AsnThr: 3.327 ± 1.581
1.996AsnVal: 1.996 ± 0.623
0.665AsnTrp: 0.665 ± 1.38
1.331AsnTyr: 1.331 ± 0.735
0.0AsnXaa: 0.0 ± 0.0
Pro
3.327ProAla: 3.327 ± 0.67
0.665ProCys: 0.665 ± 0.407
3.327ProAsp: 3.327 ± 1.511
3.992ProGlu: 3.992 ± 1.878
0.665ProPhe: 0.665 ± 1.245
1.331ProGly: 1.331 ± 0.528
0.0ProHis: 0.0 ± 0.0
1.331ProIle: 1.331 ± 0.813
1.331ProLys: 1.331 ± 0.528
3.327ProLeu: 3.327 ± 0.67
0.0ProMet: 0.0 ± 0.0
1.331ProAsn: 1.331 ± 1.414
2.661ProPro: 2.661 ± 0.91
1.996ProGln: 1.996 ± 1.425
5.323ProArg: 5.323 ± 1.672
3.992ProSer: 3.992 ± 2.537
3.992ProThr: 3.992 ± 0.726
6.653ProVal: 6.653 ± 2.395
0.0ProTrp: 0.0 ± 0.0
0.665ProTyr: 0.665 ± 0.407
0.0ProXaa: 0.0 ± 0.0
Gln
0.665GlnAla: 0.665 ± 0.707
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.331GlnGlu: 1.331 ± 0.813
1.996GlnPhe: 1.996 ± 0.623
1.996GlnGly: 1.996 ± 1.18
1.331GlnHis: 1.331 ± 0.813
1.331GlnIle: 1.331 ± 1.165
1.996GlnLys: 1.996 ± 0.623
1.331GlnLeu: 1.331 ± 0.813
2.661GlnMet: 2.661 ± 1.171
1.996GlnAsn: 1.996 ± 0.626
2.661GlnPro: 2.661 ± 0.505
0.665GlnGln: 0.665 ± 0.407
3.327GlnArg: 3.327 ± 0.848
1.996GlnSer: 1.996 ± 1.991
1.996GlnThr: 1.996 ± 1.915
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.996GlnTyr: 1.996 ± 1.212
0.0GlnXaa: 0.0 ± 0.0
Arg
5.323ArgAla: 5.323 ± 1.859
0.0ArgCys: 0.0 ± 0.0
1.996ArgAsp: 1.996 ± 0.889
3.327ArgGlu: 3.327 ± 0.67
6.653ArgPhe: 6.653 ± 1.75
5.323ArgGly: 5.323 ± 2.012
1.331ArgHis: 1.331 ± 0.735
2.661ArgIle: 2.661 ± 1.056
2.661ArgLys: 2.661 ± 1.236
5.988ArgLeu: 5.988 ± 1.176
1.331ArgMet: 1.331 ± 1.019
2.661ArgAsn: 2.661 ± 1.056
3.327ArgPro: 3.327 ± 0.67
0.665ArgGln: 0.665 ± 0.707
5.323ArgArg: 5.323 ± 1.737
1.996ArgSer: 1.996 ± 2.183
3.992ArgThr: 3.992 ± 1.146
6.653ArgVal: 6.653 ± 1.697
0.665ArgTrp: 0.665 ± 0.707
3.327ArgTyr: 3.327 ± 1.167
0.0ArgXaa: 0.0 ± 0.0
Ser
3.992SerAla: 3.992 ± 1.186
1.996SerCys: 1.996 ± 0.623
4.657SerAsp: 4.657 ± 1.645
4.657SerGlu: 4.657 ± 2.3
1.996SerPhe: 1.996 ± 0.623
2.661SerGly: 2.661 ± 1.947
2.661SerHis: 2.661 ± 1.166
5.323SerIle: 5.323 ± 2.994
4.657SerLys: 4.657 ± 0.94
6.653SerLeu: 6.653 ± 1.77
0.0SerMet: 0.0 ± 0.0
3.992SerAsn: 3.992 ± 1.186
1.996SerPro: 1.996 ± 1.18
1.996SerGln: 1.996 ± 1.237
2.661SerArg: 2.661 ± 1.131
3.992SerSer: 3.992 ± 2.871
2.661SerThr: 2.661 ± 2.425
5.323SerVal: 5.323 ± 1.395
1.996SerTrp: 1.996 ± 1.18
1.331SerTyr: 1.331 ± 0.735
0.0SerXaa: 0.0 ± 0.0
Thr
5.323ThrAla: 5.323 ± 1.164
0.665ThrCys: 0.665 ± 0.407
5.323ThrAsp: 5.323 ± 2.23
1.331ThrGlu: 1.331 ± 1.414
1.331ThrPhe: 1.331 ± 0.813
5.323ThrGly: 5.323 ± 1.164
0.665ThrHis: 0.665 ± 0.707
4.657ThrIle: 4.657 ± 1.812
6.653ThrLys: 6.653 ± 1.196
3.992ThrLeu: 3.992 ± 1.186
0.0ThrMet: 0.0 ± 0.357
3.992ThrAsn: 3.992 ± 1.146
5.323ThrPro: 5.323 ± 1.092
1.331ThrGln: 1.331 ± 1.452
3.992ThrArg: 3.992 ± 2.049
3.327ThrSer: 3.327 ± 1.169
2.661ThrThr: 2.661 ± 1.056
3.992ThrVal: 3.992 ± 3.957
0.665ThrTrp: 0.665 ± 0.407
1.996ThrTyr: 1.996 ± 1.18
0.0ThrXaa: 0.0 ± 0.0
Val
7.984ValAla: 7.984 ± 2.396
1.331ValCys: 1.331 ± 0.735
4.657ValAsp: 4.657 ± 1.262
4.657ValGlu: 4.657 ± 1.505
1.331ValPhe: 1.331 ± 0.528
5.988ValGly: 5.988 ± 1.115
1.331ValHis: 1.331 ± 0.528
2.661ValIle: 2.661 ± 2.425
1.996ValLys: 1.996 ± 1.222
5.323ValLeu: 5.323 ± 1.092
2.661ValMet: 2.661 ± 0.647
3.992ValAsn: 3.992 ± 3.241
4.657ValPro: 4.657 ± 0.94
1.331ValGln: 1.331 ± 0.528
3.992ValArg: 3.992 ± 1.146
4.657ValSer: 4.657 ± 0.815
4.657ValThr: 4.657 ± 4.462
5.988ValVal: 5.988 ± 3.658
0.0ValTrp: 0.0 ± 0.0
0.665ValTyr: 0.665 ± 0.407
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.707
0.0TrpCys: 0.0 ± 0.0
0.665TrpAsp: 0.665 ± 0.407
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.331TrpGly: 1.331 ± 0.528
0.0TrpHis: 0.0 ± 0.0
1.996TrpIle: 1.996 ± 2.121
0.0TrpLys: 0.0 ± 0.0
0.665TrpLeu: 0.665 ± 0.407
1.996TrpMet: 1.996 ± 0.889
0.665TrpAsn: 0.665 ± 0.407
0.0TrpPro: 0.0 ± 0.0
1.996TrpGln: 1.996 ± 0.889
0.665TrpArg: 0.665 ± 0.407
0.665TrpSer: 0.665 ± 0.407
0.665TrpThr: 0.665 ± 0.407
1.996TrpVal: 1.996 ± 1.347
1.331TrpTrp: 1.331 ± 0.735
1.331TrpTyr: 1.331 ± 0.735
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.996TyrAla: 1.996 ± 0.889
1.996TyrCys: 1.996 ± 0.626
1.331TyrAsp: 1.331 ± 1.452
1.331TyrGlu: 1.331 ± 0.735
0.665TyrPhe: 0.665 ± 0.407
1.331TyrGly: 1.331 ± 0.528
0.665TyrHis: 0.665 ± 0.407
1.331TyrIle: 1.331 ± 0.735
1.996TyrLys: 1.996 ± 1.22
3.992TyrLeu: 3.992 ± 1.878
0.0TyrMet: 0.0 ± 0.0
2.661TyrAsn: 2.661 ± 1.171
1.331TyrPro: 1.331 ± 0.735
1.331TyrGln: 1.331 ± 0.528
3.327TyrArg: 3.327 ± 1.265
1.996TyrSer: 1.996 ± 1.222
1.331TyrThr: 1.331 ± 1.414
0.665TyrVal: 0.665 ± 1.245
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.665XaaGly: 0.665 ± 0.407
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1504 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski