Amino acid dipepetide frequency for Penicillium roqueforti ssRNA mycovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.12AlaAla: 5.12 ± 0.233
0.512AlaCys: 0.512 ± 0.251
4.096AlaAsp: 4.096 ± 4.825
4.096AlaGlu: 4.096 ± 2.547
2.048AlaPhe: 2.048 ± 0.135
2.56AlaGly: 2.56 ± 0.116
2.048AlaHis: 2.048 ± 0.135
3.072AlaIle: 3.072 ± 1.911
4.096AlaLys: 4.096 ± 0.269
11.777AlaLeu: 11.777 ± 2.198
1.024AlaMet: 1.024 ± 0.637
3.072AlaAsn: 3.072 ± 0.772
2.048AlaPro: 2.048 ± 0.135
4.096AlaGln: 4.096 ± 1.408
4.096AlaArg: 4.096 ± 0.269
7.168AlaSer: 7.168 ± 4.458
3.072AlaThr: 3.072 ± 0.772
4.608AlaVal: 4.608 ± 0.018
1.536AlaTrp: 1.536 ± 0.753
1.536AlaTyr: 1.536 ± 1.525
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.512CysCys: 0.512 ± 0.251
0.512CysAsp: 0.512 ± 0.251
0.0CysGlu: 0.0 ± 0.0
0.512CysPhe: 0.512 ± 0.251
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.512CysIle: 0.512 ± 0.251
0.512CysLys: 0.512 ± 0.251
0.0CysLeu: 0.0 ± 0.0
0.512CysMet: 0.512 ± 0.251
0.0CysAsn: 0.0 ± 0.0
0.512CysPro: 0.512 ± 0.251
0.512CysGln: 0.512 ± 0.251
0.512CysArg: 0.512 ± 0.251
0.512CysSer: 0.512 ± 0.251
0.512CysThr: 0.512 ± 0.251
1.536CysVal: 1.536 ± 0.753
0.512CysTrp: 0.512 ± 0.251
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.584AspAla: 3.584 ± 2.798
0.0AspCys: 0.0 ± 0.0
1.024AspAsp: 1.024 ± 0.637
2.56AspGlu: 2.56 ± 0.116
2.048AspPhe: 2.048 ± 0.135
2.56AspGly: 2.56 ± 1.255
1.024AspHis: 1.024 ± 0.502
1.536AspIle: 1.536 ± 0.386
1.024AspLys: 1.024 ± 0.637
10.241AspLeu: 10.241 ± 0.465
1.536AspMet: 1.536 ± 0.194
1.024AspAsn: 1.024 ± 0.502
4.608AspPro: 4.608 ± 0.018
1.536AspGln: 1.536 ± 0.753
2.56AspArg: 2.56 ± 1.255
5.12AspSer: 5.12 ± 2.511
1.024AspThr: 1.024 ± 0.637
5.12AspVal: 5.12 ± 1.372
1.024AspTrp: 1.024 ± 0.502
3.072AspTyr: 3.072 ± 1.506
0.0AspXaa: 0.0 ± 0.0
Glu
6.656GluAla: 6.656 ± 2.431
0.0GluCys: 0.0 ± 0.0
2.56GluAsp: 2.56 ± 1.255
5.632GluGlu: 5.632 ± 2.933
1.536GluPhe: 1.536 ± 0.753
2.048GluGly: 2.048 ± 1.004
0.0GluHis: 0.0 ± 0.0
1.536GluIle: 1.536 ± 0.386
7.168GluLys: 7.168 ± 3.319
9.217GluLeu: 9.217 ± 3.454
0.0GluMet: 0.0 ± 0.0
2.56GluAsn: 2.56 ± 1.023
4.608GluPro: 4.608 ± 0.018
2.56GluGln: 2.56 ± 3.301
2.048GluArg: 2.048 ± 2.413
4.608GluSer: 4.608 ± 4.574
2.048GluThr: 2.048 ± 1.274
4.096GluVal: 4.096 ± 0.87
1.536GluTrp: 1.536 ± 0.753
1.536GluTyr: 1.536 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
2.56PheAla: 2.56 ± 1.255
0.512PheCys: 0.512 ± 0.251
2.048PheAsp: 2.048 ± 0.135
3.072PheGlu: 3.072 ± 0.367
1.024PhePhe: 1.024 ± 0.502
2.048PheGly: 2.048 ± 0.135
0.512PheHis: 0.512 ± 0.251
3.584PheIle: 3.584 ± 1.757
2.56PheLys: 2.56 ± 0.116
5.12PheLeu: 5.12 ± 2.511
0.512PheMet: 0.512 ± 0.251
3.072PheAsn: 3.072 ± 1.911
0.512PhePro: 0.512 ± 0.251
0.0PheGln: 0.0 ± 0.0
3.584PheArg: 3.584 ± 1.757
3.584PheSer: 3.584 ± 0.618
1.024PheThr: 1.024 ± 0.637
3.584PheVal: 3.584 ± 0.618
2.048PheTrp: 2.048 ± 2.413
3.072PheTyr: 3.072 ± 1.506
0.0PheXaa: 0.0 ± 0.0
Gly
4.096GlyAla: 4.096 ± 0.269
0.512GlyCys: 0.512 ± 0.251
2.56GlyAsp: 2.56 ± 0.116
1.536GlyGlu: 1.536 ± 1.525
3.072GlyPhe: 3.072 ± 0.772
1.536GlyGly: 1.536 ± 0.753
1.536GlyHis: 1.536 ± 0.753
1.536GlyIle: 1.536 ± 0.753
4.608GlyLys: 4.608 ± 0.018
9.217GlyLeu: 9.217 ± 3.38
2.56GlyMet: 2.56 ± 1.023
0.512GlyAsn: 0.512 ± 0.251
1.024GlyPro: 1.024 ± 0.502
3.584GlyGln: 3.584 ± 0.521
5.12GlyArg: 5.12 ± 0.233
3.584GlySer: 3.584 ± 0.521
2.56GlyThr: 2.56 ± 1.255
4.096GlyVal: 4.096 ± 2.009
1.536GlyTrp: 1.536 ± 0.753
2.048GlyTyr: 2.048 ± 0.135
0.0GlyXaa: 0.0 ± 0.0
His
1.536HisAla: 1.536 ± 0.753
0.0HisCys: 0.0 ± 0.0
2.048HisAsp: 2.048 ± 1.004
0.512HisGlu: 0.512 ± 0.251
0.512HisPhe: 0.512 ± 0.251
0.0HisGly: 0.0 ± 0.0
1.024HisHis: 1.024 ± 0.502
1.536HisIle: 1.536 ± 0.753
1.536HisLys: 1.536 ± 0.386
1.536HisLeu: 1.536 ± 0.753
0.512HisMet: 0.512 ± 0.251
0.512HisAsn: 0.512 ± 0.251
0.512HisPro: 0.512 ± 0.251
0.512HisGln: 0.512 ± 0.251
1.536HisArg: 1.536 ± 0.753
1.536HisSer: 1.536 ± 0.753
1.536HisThr: 1.536 ± 0.753
1.024HisVal: 1.024 ± 0.502
1.536HisTrp: 1.536 ± 0.386
1.536HisTyr: 1.536 ± 0.753
0.0HisXaa: 0.0 ± 0.0
Ile
3.584IleAla: 3.584 ± 0.618
0.512IleCys: 0.512 ± 0.251
3.072IleAsp: 3.072 ± 0.367
2.56IleGlu: 2.56 ± 0.116
2.56IlePhe: 2.56 ± 1.255
2.56IleGly: 2.56 ± 0.116
1.536IleHis: 1.536 ± 0.753
3.584IleIle: 3.584 ± 1.757
2.56IleLys: 2.56 ± 1.023
5.12IleLeu: 5.12 ± 1.372
1.024IleMet: 1.024 ± 0.502
1.536IleAsn: 1.536 ± 0.386
2.048IlePro: 2.048 ± 0.135
1.536IleGln: 1.536 ± 0.753
4.096IleArg: 4.096 ± 0.269
5.632IleSer: 5.632 ± 1.623
0.512IleThr: 0.512 ± 0.251
1.536IleVal: 1.536 ± 0.386
1.024IleTrp: 1.024 ± 0.502
0.512IleTyr: 0.512 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
7.68LysAla: 7.68 ± 8.763
0.512LysCys: 0.512 ± 0.251
3.072LysAsp: 3.072 ± 0.367
6.144LysGlu: 6.144 ± 3.821
2.56LysPhe: 2.56 ± 1.023
5.12LysGly: 5.12 ± 0.906
1.024LysHis: 1.024 ± 0.502
2.56LysIle: 2.56 ± 0.116
3.072LysLys: 3.072 ± 0.772
5.632LysLeu: 5.632 ± 5.211
2.048LysMet: 2.048 ± 1.004
2.56LysAsn: 2.56 ± 0.116
2.048LysPro: 2.048 ± 1.274
1.536LysGln: 1.536 ± 0.386
4.608LysArg: 4.608 ± 1.157
4.608LysSer: 4.608 ± 1.157
3.584LysThr: 3.584 ± 2.798
4.608LysVal: 4.608 ± 2.26
1.536LysTrp: 1.536 ± 0.753
1.024LysTyr: 1.024 ± 0.502
0.0LysXaa: 0.0 ± 0.0
Leu
6.656LeuAla: 6.656 ± 0.986
1.024LeuCys: 1.024 ± 0.502
7.68LeuAsp: 7.68 ± 1.488
6.144LeuGlu: 6.144 ± 2.682
6.656LeuPhe: 6.656 ± 0.986
8.193LeuGly: 8.193 ± 1.739
2.56LeuHis: 2.56 ± 1.255
4.096LeuIle: 4.096 ± 0.87
9.217LeuLys: 9.217 ± 8.01
11.265LeuLeu: 11.265 ± 3.246
2.048LeuMet: 2.048 ± 0.135
5.632LeuAsn: 5.632 ± 0.655
2.56LeuPro: 2.56 ± 1.255
5.12LeuGln: 5.12 ± 0.906
7.68LeuArg: 7.68 ± 0.79
11.777LeuSer: 11.777 ± 2.358
5.12LeuThr: 5.12 ± 2.511
7.68LeuVal: 7.68 ± 2.627
2.048LeuTrp: 2.048 ± 0.135
3.072LeuTyr: 3.072 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
1.536MetAla: 1.536 ± 0.386
0.0MetCys: 0.0 ± 0.0
1.536MetAsp: 1.536 ± 0.753
1.536MetGlu: 1.536 ± 1.525
1.024MetPhe: 1.024 ± 0.502
1.536MetGly: 1.536 ± 0.753
0.512MetHis: 0.512 ± 0.251
1.536MetIle: 1.536 ± 0.753
1.024MetLys: 1.024 ± 0.637
0.512MetLeu: 0.512 ± 0.251
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.536MetPro: 1.536 ± 0.753
1.536MetGln: 1.536 ± 0.753
1.024MetArg: 1.024 ± 0.637
3.072MetSer: 3.072 ± 1.911
0.512MetThr: 0.512 ± 0.888
2.048MetVal: 2.048 ± 1.004
1.024MetTrp: 1.024 ± 0.502
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.536AsnAla: 1.536 ± 1.525
0.512AsnCys: 0.512 ± 0.251
0.512AsnAsp: 0.512 ± 0.888
4.608AsnGlu: 4.608 ± 2.296
1.536AsnPhe: 1.536 ± 0.386
2.56AsnGly: 2.56 ± 0.116
1.024AsnHis: 1.024 ± 0.502
2.048AsnIle: 2.048 ± 1.004
2.048AsnLys: 2.048 ± 1.004
3.584AsnLeu: 3.584 ± 0.618
1.024AsnMet: 1.024 ± 0.502
2.048AsnAsn: 2.048 ± 0.135
1.536AsnPro: 1.536 ± 0.753
0.512AsnGln: 0.512 ± 0.888
1.024AsnArg: 1.024 ± 0.637
3.584AsnSer: 3.584 ± 0.618
2.56AsnThr: 2.56 ± 2.162
2.56AsnVal: 2.56 ± 1.255
0.0AsnTrp: 0.0 ± 0.0
1.024AsnTyr: 1.024 ± 0.502
0.0AsnXaa: 0.0 ± 0.0
Pro
2.56ProAla: 2.56 ± 1.023
0.0ProCys: 0.0 ± 0.0
3.584ProAsp: 3.584 ± 1.757
2.048ProGlu: 2.048 ± 2.413
2.56ProPhe: 2.56 ± 1.255
0.512ProGly: 0.512 ± 0.251
0.512ProHis: 0.512 ± 0.251
2.048ProIle: 2.048 ± 0.135
1.536ProLys: 1.536 ± 0.386
3.072ProLeu: 3.072 ± 0.772
0.0ProMet: 0.0 ± 0.226
3.072ProAsn: 3.072 ± 1.506
1.536ProPro: 1.536 ± 0.753
0.512ProGln: 0.512 ± 0.251
2.048ProArg: 2.048 ± 1.004
3.072ProSer: 3.072 ± 1.506
3.072ProThr: 3.072 ± 1.506
4.096ProVal: 4.096 ± 0.269
1.536ProTrp: 1.536 ± 0.753
1.536ProTyr: 1.536 ± 0.753
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 3.05
0.0GlnCys: 0.0 ± 0.0
1.024GlnAsp: 1.024 ± 0.502
4.096GlnGlu: 4.096 ± 0.269
2.048GlnPhe: 2.048 ± 1.004
2.048GlnGly: 2.048 ± 0.135
0.512GlnHis: 0.512 ± 0.251
1.536GlnIle: 1.536 ± 0.386
3.072GlnLys: 3.072 ± 1.911
5.632GlnLeu: 5.632 ± 2.762
0.512GlnMet: 0.512 ± 0.888
0.512GlnAsn: 0.512 ± 0.251
0.512GlnPro: 0.512 ± 0.888
1.024GlnGln: 1.024 ± 0.502
0.512GlnArg: 0.512 ± 0.251
0.512GlnSer: 0.512 ± 0.888
1.024GlnThr: 1.024 ± 0.637
3.584GlnVal: 3.584 ± 0.521
0.512GlnTrp: 0.512 ± 0.251
0.512GlnTyr: 0.512 ± 0.888
0.0GlnXaa: 0.0 ± 0.0
Arg
6.144ArgAla: 6.144 ± 1.543
0.512ArgCys: 0.512 ± 0.251
1.024ArgAsp: 1.024 ± 0.502
6.656ArgGlu: 6.656 ± 2.431
1.536ArgPhe: 1.536 ± 0.753
2.048ArgGly: 2.048 ± 1.004
1.024ArgHis: 1.024 ± 0.502
3.072ArgIle: 3.072 ± 0.367
4.096ArgLys: 4.096 ± 0.269
7.168ArgLeu: 7.168 ± 1.041
2.048ArgMet: 2.048 ± 1.274
0.512ArgAsn: 0.512 ± 0.251
0.512ArgPro: 0.512 ± 0.251
1.536ArgGln: 1.536 ± 0.386
4.608ArgArg: 4.608 ± 1.157
6.656ArgSer: 6.656 ± 0.986
2.56ArgThr: 2.56 ± 1.255
3.584ArgVal: 3.584 ± 1.757
1.024ArgTrp: 1.024 ± 0.502
2.56ArgTyr: 2.56 ± 1.023
0.0ArgXaa: 0.0 ± 0.0
Ser
4.608SerAla: 4.608 ± 2.296
1.536SerCys: 1.536 ± 0.753
5.12SerAsp: 5.12 ± 0.233
3.584SerGlu: 3.584 ± 0.618
4.096SerPhe: 4.096 ± 0.269
7.168SerGly: 7.168 ± 1.041
1.536SerHis: 1.536 ± 0.753
4.096SerIle: 4.096 ± 2.009
4.608SerLys: 4.608 ± 1.157
9.217SerLeu: 9.217 ± 2.315
1.536SerMet: 1.536 ± 0.386
1.536SerAsn: 1.536 ± 0.753
3.072SerPro: 3.072 ± 1.506
1.024SerGln: 1.024 ± 0.637
7.168SerArg: 7.168 ± 0.098
8.193SerSer: 8.193 ± 2.817
6.656SerThr: 6.656 ± 0.986
6.656SerVal: 6.656 ± 1.292
3.072SerTrp: 3.072 ± 0.772
3.072SerTyr: 3.072 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
3.072ThrAla: 3.072 ± 0.772
0.512ThrCys: 0.512 ± 0.251
3.584ThrAsp: 3.584 ± 0.521
0.512ThrGlu: 0.512 ± 0.251
3.584ThrPhe: 3.584 ± 0.618
5.12ThrGly: 5.12 ± 1.372
1.024ThrHis: 1.024 ± 0.502
3.584ThrIle: 3.584 ± 0.521
2.048ThrLys: 2.048 ± 1.274
4.096ThrLeu: 4.096 ± 2.009
1.536ThrMet: 1.536 ± 0.753
2.048ThrAsn: 2.048 ± 1.004
4.608ThrPro: 4.608 ± 0.018
0.0ThrGln: 0.0 ± 0.0
2.048ThrArg: 2.048 ± 0.135
5.632ThrSer: 5.632 ± 1.794
4.608ThrThr: 4.608 ± 1.121
1.536ThrVal: 1.536 ± 0.753
0.512ThrTrp: 0.512 ± 0.888
1.024ThrTyr: 1.024 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
4.096ValAla: 4.096 ± 2.009
0.0ValCys: 0.0 ± 0.0
4.096ValAsp: 4.096 ± 2.009
2.56ValGlu: 2.56 ± 1.023
2.56ValPhe: 2.56 ± 1.255
5.12ValGly: 5.12 ± 0.906
1.536ValHis: 1.536 ± 0.753
3.584ValIle: 3.584 ± 1.757
6.656ValLys: 6.656 ± 2.431
9.217ValLeu: 9.217 ± 2.241
2.048ValMet: 2.048 ± 1.004
2.048ValAsn: 2.048 ± 1.274
4.096ValPro: 4.096 ± 2.009
2.56ValGln: 2.56 ± 0.116
2.048ValArg: 2.048 ± 1.004
4.608ValSer: 4.608 ± 1.121
5.632ValThr: 5.632 ± 1.623
6.144ValVal: 6.144 ± 1.874
2.048ValTrp: 2.048 ± 1.004
1.536ValTyr: 1.536 ± 0.753
0.0ValXaa: 0.0 ± 0.0
Trp
1.024TrpAla: 1.024 ± 0.502
0.0TrpCys: 0.0 ± 0.0
0.512TrpAsp: 0.512 ± 0.251
1.536TrpGlu: 1.536 ± 0.386
2.048TrpPhe: 2.048 ± 0.135
1.024TrpGly: 1.024 ± 0.637
1.536TrpHis: 1.536 ± 0.386
0.0TrpIle: 0.0 ± 0.0
0.512TrpLys: 0.512 ± 0.251
2.048TrpLeu: 2.048 ± 1.004
0.0TrpMet: 0.0 ± 0.0
1.024TrpAsn: 1.024 ± 0.502
1.024TrpPro: 1.024 ± 0.502
2.048TrpGln: 2.048 ± 0.135
1.024TrpArg: 1.024 ± 0.502
2.048TrpSer: 2.048 ± 1.004
2.048TrpThr: 2.048 ± 1.004
2.56TrpVal: 2.56 ± 0.116
2.048TrpTrp: 2.048 ± 1.274
2.56TrpTyr: 2.56 ± 1.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.56TyrAla: 2.56 ± 2.162
0.512TyrCys: 0.512 ± 0.251
2.56TyrAsp: 2.56 ± 1.255
2.56TyrGlu: 2.56 ± 1.255
0.512TyrPhe: 0.512 ± 0.251
3.072TyrGly: 3.072 ± 1.506
0.512TyrHis: 0.512 ± 0.251
2.048TyrIle: 2.048 ± 0.135
3.584TyrLys: 3.584 ± 0.521
2.56TyrLeu: 2.56 ± 1.255
0.512TyrMet: 0.512 ± 0.251
2.56TyrAsn: 2.56 ± 1.023
0.512TyrPro: 0.512 ± 0.251
0.512TyrGln: 0.512 ± 0.251
1.536TyrArg: 1.536 ± 0.386
2.048TyrSer: 2.048 ± 0.135
1.024TyrThr: 1.024 ± 0.502
1.536TyrVal: 1.536 ± 0.386
0.512TyrTrp: 0.512 ± 0.251
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski