Amino acid dipepetide frequency for Penicillium aurantiogriseum fusarivirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.004AlaAla: 7.004 ± 2.775
1.001AlaCys: 1.001 ± 0.47
3.502AlaAsp: 3.502 ± 1.388
6.503AlaGlu: 6.503 ± 3.01
4.002AlaPhe: 4.002 ± 0.143
3.502AlaGly: 3.502 ± 1.644
3.502AlaHis: 3.502 ± 0.633
3.502AlaIle: 3.502 ± 3.409
8.004AlaLys: 8.004 ± 2.306
6.003AlaLeu: 6.003 ± 0.214
1.501AlaMet: 1.501 ± 0.306
4.502AlaAsn: 4.502 ± 2.939
3.502AlaPro: 3.502 ± 1.644
1.501AlaGln: 1.501 ± 0.704
5.003AlaArg: 5.003 ± 4.725
5.503AlaSer: 5.503 ± 1.459
4.502AlaThr: 4.502 ± 0.092
3.502AlaVal: 3.502 ± 0.633
2.001AlaTrp: 2.001 ± 0.939
1.501AlaTyr: 1.501 ± 2.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.5CysAsp: 0.5 ± 0.235
0.0CysGlu: 0.0 ± 0.0
1.501CysPhe: 1.501 ± 0.704
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.5CysLeu: 0.5 ± 0.235
0.0CysMet: 0.0 ± 0.0
0.5CysAsn: 0.5 ± 0.235
0.5CysPro: 0.5 ± 0.235
0.0CysGln: 0.0 ± 0.0
1.501CysArg: 1.501 ± 0.704
1.001CysSer: 1.001 ± 0.47
0.0CysThr: 0.0 ± 0.0
0.5CysVal: 0.5 ± 0.235
0.0CysTrp: 0.0 ± 0.0
0.5CysTyr: 0.5 ± 0.235
0.0CysXaa: 0.0 ± 0.0
Asp
4.002AspAla: 4.002 ± 1.153
0.5AspCys: 0.5 ± 0.235
3.002AspAsp: 3.002 ± 1.623
4.002AspGlu: 4.002 ± 0.143
4.002AspPhe: 4.002 ± 0.868
2.001AspGly: 2.001 ± 0.939
1.001AspHis: 1.001 ± 0.47
5.003AspIle: 5.003 ± 2.348
1.501AspLys: 1.501 ± 1.316
4.502AspLeu: 4.502 ± 1.929
0.5AspMet: 0.5 ± 0.235
1.001AspAsn: 1.001 ± 0.47
3.002AspPro: 3.002 ± 0.612
0.0AspGln: 0.0 ± 0.0
2.001AspArg: 2.001 ± 0.939
3.002AspSer: 3.002 ± 0.612
3.002AspThr: 3.002 ± 1.623
4.002AspVal: 4.002 ± 0.143
2.001AspTrp: 2.001 ± 0.071
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.002GluAla: 4.002 ± 3.174
0.5GluCys: 0.5 ± 0.235
2.001GluAsp: 2.001 ± 1.082
5.503GluGlu: 5.503 ± 0.562
2.001GluPhe: 2.001 ± 0.939
3.002GluGly: 3.002 ± 1.409
0.5GluHis: 0.5 ± 0.776
2.501GluIle: 2.501 ± 2.868
3.502GluLys: 3.502 ± 0.633
9.505GluLeu: 9.505 ± 0.419
1.001GluMet: 1.001 ± 0.695
2.001GluAsn: 2.001 ± 2.092
2.501GluPro: 2.501 ± 0.164
3.002GluGln: 3.002 ± 1.623
2.501GluArg: 2.501 ± 1.857
5.003GluSer: 5.003 ± 2.704
2.501GluThr: 2.501 ± 0.164
5.503GluVal: 5.503 ± 1.459
0.0GluTrp: 0.0 ± 0.0
1.001GluTyr: 1.001 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
4.002PheAla: 4.002 ± 0.868
0.5PheCys: 0.5 ± 0.235
2.001PheAsp: 2.001 ± 0.939
2.001PheGlu: 2.001 ± 0.071
1.501PhePhe: 1.501 ± 0.704
3.002PheGly: 3.002 ± 1.409
1.001PheHis: 1.001 ± 0.47
3.002PheIle: 3.002 ± 1.409
3.002PheLys: 3.002 ± 0.612
5.003PheLeu: 5.003 ± 1.337
0.0PheMet: 0.0 ± 0.0
2.501PheAsn: 2.501 ± 0.164
2.001PhePro: 2.001 ± 0.939
2.001PheGln: 2.001 ± 0.071
2.001PheArg: 2.001 ± 0.939
0.5PheSer: 0.5 ± 0.235
1.001PheThr: 1.001 ± 0.47
4.002PheVal: 4.002 ± 1.878
1.001PheTrp: 1.001 ± 0.47
0.5PheTyr: 0.5 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
2.501GlyAla: 2.501 ± 1.174
0.5GlyCys: 0.5 ± 0.235
4.002GlyAsp: 4.002 ± 0.868
3.002GlyGlu: 3.002 ± 0.398
4.502GlyPhe: 4.502 ± 2.113
2.001GlyGly: 2.001 ± 0.939
0.5GlyHis: 0.5 ± 0.235
1.001GlyIle: 1.001 ± 0.47
5.003GlyLys: 5.003 ± 0.683
6.003GlyLeu: 6.003 ± 0.797
0.5GlyMet: 0.5 ± 0.235
2.001GlyAsn: 2.001 ± 0.939
3.002GlyPro: 3.002 ± 1.409
2.001GlyGln: 2.001 ± 1.082
3.502GlyArg: 3.502 ± 0.633
4.002GlySer: 4.002 ± 1.878
5.003GlyThr: 5.003 ± 0.327
6.503GlyVal: 6.503 ± 2.042
1.501GlyTrp: 1.501 ± 0.704
2.501GlyTyr: 2.501 ± 0.164
0.0GlyXaa: 0.0 ± 0.0
His
1.501HisAla: 1.501 ± 0.306
0.0HisCys: 0.0 ± 0.0
1.001HisAsp: 1.001 ± 0.47
0.5HisGlu: 0.5 ± 0.235
0.0HisPhe: 0.0 ± 0.0
1.501HisGly: 1.501 ± 0.306
0.5HisHis: 0.5 ± 0.235
1.501HisIle: 1.501 ± 0.704
2.501HisLys: 2.501 ± 1.174
2.001HisLeu: 2.001 ± 0.939
0.5HisMet: 0.5 ± 0.235
1.001HisAsn: 1.001 ± 0.541
0.0HisPro: 0.0 ± 0.0
0.5HisGln: 0.5 ± 0.235
1.001HisArg: 1.001 ± 0.541
1.501HisSer: 1.501 ± 0.704
1.501HisThr: 1.501 ± 0.704
0.5HisVal: 0.5 ± 0.235
0.5HisTrp: 0.5 ± 0.235
1.001HisTyr: 1.001 ± 0.541
0.0HisXaa: 0.0 ± 0.0
Ile
4.502IleAla: 4.502 ± 0.918
0.5IleCys: 0.5 ± 0.235
4.502IleAsp: 4.502 ± 0.918
1.501IleGlu: 1.501 ± 1.316
2.001IlePhe: 2.001 ± 0.939
3.002IleGly: 3.002 ± 1.409
0.0IleHis: 0.0 ± 0.0
4.502IleIle: 4.502 ± 0.918
4.002IleLys: 4.002 ± 1.153
2.501IleLeu: 2.501 ± 1.174
1.501IleMet: 1.501 ± 0.704
3.002IleAsn: 3.002 ± 1.623
1.001IlePro: 1.001 ± 0.47
2.501IleGln: 2.501 ± 2.868
1.501IleArg: 1.501 ± 0.704
4.502IleSer: 4.502 ± 1.103
1.501IleThr: 1.501 ± 0.306
6.503IleVal: 6.503 ± 2.042
1.001IleTrp: 1.001 ± 0.47
2.001IleTyr: 2.001 ± 0.939
0.0IleXaa: 0.0 ± 0.0
Lys
6.503LysAla: 6.503 ± 6.042
0.0LysCys: 0.0 ± 0.0
3.502LysAsp: 3.502 ± 1.388
4.002LysGlu: 4.002 ± 1.153
2.501LysPhe: 2.501 ± 0.164
5.503LysGly: 5.503 ± 3.48
1.001LysHis: 1.001 ± 1.551
1.501LysIle: 1.501 ± 0.704
3.502LysLys: 3.502 ± 0.377
6.003LysLeu: 6.003 ± 0.797
2.001LysMet: 2.001 ± 1.082
2.501LysAsn: 2.501 ± 0.164
2.501LysPro: 2.501 ± 1.174
2.001LysGln: 2.001 ± 1.082
3.502LysArg: 3.502 ± 0.633
3.002LysSer: 3.002 ± 0.398
3.002LysThr: 3.002 ± 1.623
6.503LysVal: 6.503 ± 0.989
2.001LysTrp: 2.001 ± 0.071
2.001LysTyr: 2.001 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
5.503LeuAla: 5.503 ± 1.572
0.0LeuCys: 0.0 ± 0.0
3.502LeuAsp: 3.502 ± 0.633
5.503LeuGlu: 5.503 ± 2.469
2.501LeuPhe: 2.501 ± 0.164
10.005LeuGly: 10.005 ± 3.685
1.501LeuHis: 1.501 ± 0.704
5.003LeuIle: 5.003 ± 2.348
5.503LeuLys: 5.503 ± 2.469
13.507LeuLeu: 13.507 ± 3.308
3.502LeuMet: 3.502 ± 0.633
3.002LeuAsn: 3.002 ± 1.623
6.503LeuPro: 6.503 ± 2.042
3.502LeuGln: 3.502 ± 2.398
8.004LeuArg: 8.004 ± 0.285
8.004LeuSer: 8.004 ± 2.746
9.505LeuThr: 9.505 ± 0.591
6.003LeuVal: 6.003 ± 1.807
7.004LeuTrp: 7.004 ± 2.277
2.001LeuTyr: 2.001 ± 0.939
0.0LeuXaa: 0.0 ± 0.0
Met
3.502MetAla: 3.502 ± 0.633
0.0MetCys: 0.0 ± 0.0
1.501MetAsp: 1.501 ± 0.704
1.001MetGlu: 1.001 ± 0.541
0.0MetPhe: 0.0 ± 0.0
1.501MetGly: 1.501 ± 0.306
1.001MetHis: 1.001 ± 0.47
1.501MetIle: 1.501 ± 1.316
2.001MetLys: 2.001 ± 0.939
2.501MetLeu: 2.501 ± 1.174
0.5MetMet: 0.5 ± 0.235
0.5MetAsn: 0.5 ± 0.776
0.5MetPro: 0.5 ± 0.235
0.5MetGln: 0.5 ± 0.235
1.001MetArg: 1.001 ± 0.47
1.501MetSer: 1.501 ± 0.306
0.0MetThr: 0.0 ± 0.0
0.5MetVal: 0.5 ± 0.235
0.0MetTrp: 0.0 ± 0.0
0.5MetTyr: 0.5 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
2.001AsnAla: 2.001 ± 2.092
0.0AsnCys: 0.0 ± 0.0
1.001AsnAsp: 1.001 ± 1.551
2.501AsnGlu: 2.501 ± 0.847
1.001AsnPhe: 1.001 ± 0.47
1.501AsnGly: 1.501 ± 0.704
1.001AsnHis: 1.001 ± 0.47
2.501AsnIle: 2.501 ± 1.174
4.002AsnLys: 4.002 ± 2.163
1.501AsnLeu: 1.501 ± 0.704
1.001AsnMet: 1.001 ± 0.202
2.001AsnAsn: 2.001 ± 1.082
3.002AsnPro: 3.002 ± 0.612
3.002AsnGln: 3.002 ± 2.633
2.001AsnArg: 2.001 ± 1.082
2.001AsnSer: 2.001 ± 0.071
1.501AsnThr: 1.501 ± 0.306
2.501AsnVal: 2.501 ± 0.164
1.001AsnTrp: 1.001 ± 0.541
1.001AsnTyr: 1.001 ± 0.47
0.0AsnXaa: 0.0 ± 0.0
Pro
3.502ProAla: 3.502 ± 0.377
0.0ProCys: 0.0 ± 0.0
3.002ProAsp: 3.002 ± 0.398
4.002ProGlu: 4.002 ± 0.868
2.001ProPhe: 2.001 ± 0.939
1.501ProGly: 1.501 ± 0.306
0.0ProHis: 0.0 ± 0.0
2.501ProIle: 2.501 ± 0.164
2.501ProLys: 2.501 ± 0.164
5.003ProLeu: 5.003 ± 2.348
0.0ProMet: 0.0 ± 0.0
0.5ProAsn: 0.5 ± 0.235
2.001ProPro: 2.001 ± 0.071
1.001ProGln: 1.001 ± 0.47
1.001ProArg: 1.001 ± 0.47
3.002ProSer: 3.002 ± 0.398
6.503ProThr: 6.503 ± 2.042
4.502ProVal: 4.502 ± 2.113
1.001ProTrp: 1.001 ± 1.551
2.001ProTyr: 2.001 ± 0.939
0.0ProXaa: 0.0 ± 0.0
Gln
6.003GlnAla: 6.003 ± 4.255
0.0GlnCys: 0.0 ± 0.0
2.001GlnAsp: 2.001 ± 0.071
2.001GlnGlu: 2.001 ± 0.071
1.001GlnPhe: 1.001 ± 0.47
1.001GlnGly: 1.001 ± 0.47
1.501GlnHis: 1.501 ± 0.704
3.002GlnIle: 3.002 ± 2.633
3.002GlnLys: 3.002 ± 1.623
2.501GlnLeu: 2.501 ± 0.847
0.5GlnMet: 0.5 ± 0.235
1.501GlnAsn: 1.501 ± 0.306
1.001GlnPro: 1.001 ± 0.541
2.501GlnGln: 2.501 ± 2.868
2.501GlnArg: 2.501 ± 1.857
2.001GlnSer: 2.001 ± 0.071
1.001GlnThr: 1.001 ± 0.541
2.001GlnVal: 2.001 ± 1.082
1.001GlnTrp: 1.001 ± 0.541
1.001GlnTyr: 1.001 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
4.002ArgAla: 4.002 ± 0.143
0.0ArgCys: 0.0 ± 0.0
1.001ArgAsp: 1.001 ± 0.541
3.002ArgGlu: 3.002 ± 3.643
2.001ArgPhe: 2.001 ± 0.071
2.001ArgGly: 2.001 ± 0.939
1.501ArgHis: 1.501 ± 0.704
3.002ArgIle: 3.002 ± 0.398
4.002ArgLys: 4.002 ± 0.143
11.006ArgLeu: 11.006 ± 0.897
0.5ArgMet: 0.5 ± 0.235
2.001ArgAsn: 2.001 ± 0.939
2.501ArgPro: 2.501 ± 0.847
3.002ArgGln: 3.002 ± 0.612
2.501ArgArg: 2.501 ± 0.164
3.502ArgSer: 3.502 ± 0.633
5.003ArgThr: 5.003 ± 0.683
4.002ArgVal: 4.002 ± 1.153
1.001ArgTrp: 1.001 ± 0.47
2.001ArgTyr: 2.001 ± 0.071
0.0ArgXaa: 0.0 ± 0.0
Ser
4.002SerAla: 4.002 ± 1.153
1.001SerCys: 1.001 ± 0.47
3.502SerAsp: 3.502 ± 0.633
2.001SerGlu: 2.001 ± 1.082
5.003SerPhe: 5.003 ± 1.337
2.501SerGly: 2.501 ± 0.164
1.001SerHis: 1.001 ± 0.47
4.002SerIle: 4.002 ± 0.868
3.502SerLys: 3.502 ± 2.398
9.505SerLeu: 9.505 ± 1.43
1.501SerMet: 1.501 ± 0.704
1.501SerAsn: 1.501 ± 1.316
1.501SerPro: 1.501 ± 0.306
2.001SerGln: 2.001 ± 0.939
6.003SerArg: 6.003 ± 0.214
6.503SerSer: 6.503 ± 1.031
6.003SerThr: 6.003 ± 1.807
5.003SerVal: 5.003 ± 0.327
2.501SerTrp: 2.501 ± 0.164
2.001SerTyr: 2.001 ± 0.939
0.0SerXaa: 0.0 ± 0.0
Thr
5.003ThrAla: 5.003 ± 1.694
0.5ThrCys: 0.5 ± 0.235
3.002ThrAsp: 3.002 ± 0.398
3.502ThrGlu: 3.502 ± 0.377
2.001ThrPhe: 2.001 ± 0.939
7.504ThrGly: 7.504 ± 2.511
0.0ThrHis: 0.0 ± 0.0
2.501ThrIle: 2.501 ± 1.174
3.002ThrLys: 3.002 ± 3.643
6.503ThrLeu: 6.503 ± 0.021
1.001ThrMet: 1.001 ± 0.47
1.001ThrAsn: 1.001 ± 0.47
6.503ThrPro: 6.503 ± 2.042
1.001ThrGln: 1.001 ± 0.47
3.502ThrArg: 3.502 ± 0.377
5.503ThrSer: 5.503 ± 1.459
4.502ThrThr: 4.502 ± 0.092
2.501ThrVal: 2.501 ± 0.164
2.501ThrTrp: 2.501 ± 1.857
2.001ThrTyr: 2.001 ± 0.939
0.0ThrXaa: 0.0 ± 0.0
Val
5.003ValAla: 5.003 ± 0.683
1.001ValCys: 1.001 ± 0.47
3.502ValAsp: 3.502 ± 0.633
4.502ValGlu: 4.502 ± 1.929
1.501ValPhe: 1.501 ± 0.704
5.003ValGly: 5.003 ± 2.348
1.001ValHis: 1.001 ± 0.541
4.002ValIle: 4.002 ± 0.868
2.001ValLys: 2.001 ± 0.071
8.004ValLeu: 8.004 ± 2.746
2.001ValMet: 2.001 ± 0.071
3.502ValAsn: 3.502 ± 0.377
2.501ValPro: 2.501 ± 1.174
4.502ValGln: 4.502 ± 2.939
3.002ValArg: 3.002 ± 1.409
7.004ValSer: 7.004 ± 0.256
6.503ValThr: 6.503 ± 0.021
8.004ValVal: 8.004 ± 1.736
3.002ValTrp: 3.002 ± 1.409
1.001ValTyr: 1.001 ± 0.541
0.0ValXaa: 0.0 ± 0.0
Trp
3.502TrpAla: 3.502 ± 1.644
0.5TrpCys: 0.5 ± 0.235
0.5TrpAsp: 0.5 ± 0.235
1.001TrpGlu: 1.001 ± 0.47
0.5TrpPhe: 0.5 ± 0.235
2.001TrpGly: 2.001 ± 1.082
2.501TrpHis: 2.501 ± 1.174
1.001TrpIle: 1.001 ± 1.551
1.501TrpLys: 1.501 ± 0.704
5.503TrpLeu: 5.503 ± 1.459
1.001TrpMet: 1.001 ± 0.47
1.001TrpAsn: 1.001 ± 0.541
1.001TrpPro: 1.001 ± 0.47
1.001TrpGln: 1.001 ± 0.541
1.501TrpArg: 1.501 ± 0.704
2.001TrpSer: 2.001 ± 0.939
1.001TrpThr: 1.001 ± 0.47
1.501TrpVal: 1.501 ± 0.306
0.5TrpTrp: 0.5 ± 0.235
1.001TrpTyr: 1.001 ± 0.47
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.502TyrAla: 3.502 ± 0.633
0.5TyrCys: 0.5 ± 0.235
1.501TyrAsp: 1.501 ± 0.306
2.001TyrGlu: 2.001 ± 0.939
1.001TyrPhe: 1.001 ± 0.47
1.501TyrGly: 1.501 ± 0.704
0.0TyrHis: 0.0 ± 0.0
0.5TyrIle: 0.5 ± 0.235
1.001TyrLys: 1.001 ± 0.47
2.001TyrLeu: 2.001 ± 0.939
0.5TyrMet: 0.5 ± 0.235
1.001TyrAsn: 1.001 ± 0.47
0.5TyrPro: 0.5 ± 0.235
1.501TyrGln: 1.501 ± 0.704
4.002TyrArg: 4.002 ± 1.153
1.501TyrSer: 1.501 ± 0.704
0.0TyrThr: 0.0 ± 0.0
2.501TyrVal: 2.501 ± 1.857
0.5TyrTrp: 0.5 ± 0.235
1.501TyrTyr: 1.501 ± 0.704
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2000 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski