Amino acid dipepetide frequency for Sclerophthora macrospora virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.706AlaAla: 8.706 ± 4.505
1.244AlaCys: 1.244 ± 0.437
4.975AlaAsp: 4.975 ± 2.247
3.109AlaGlu: 3.109 ± 1.181
6.219AlaPhe: 6.219 ± 2.3
4.975AlaGly: 4.975 ± 1.621
0.622AlaHis: 0.622 ± 0.625
5.597AlaIle: 5.597 ± 1.363
4.975AlaLys: 4.975 ± 2.431
5.597AlaLeu: 5.597 ± 0.4
0.622AlaMet: 0.622 ± 0.675
6.219AlaAsn: 6.219 ± 2.85
2.488AlaPro: 2.488 ± 0.884
1.244AlaGln: 1.244 ± 0.76
4.353AlaArg: 4.353 ± 1.869
7.463AlaSer: 7.463 ± 0.737
7.463AlaThr: 7.463 ± 2.006
9.328AlaVal: 9.328 ± 2.601
0.0AlaTrp: 0.0 ± 0.0
5.597AlaTyr: 5.597 ± 0.885
0.0AlaXaa: 0.0 ± 0.0
Cys
3.109CysAla: 3.109 ± 0.259
0.622CysCys: 0.622 ± 0.38
1.244CysAsp: 1.244 ± 0.76
1.244CysGlu: 1.244 ± 0.437
2.488CysPhe: 2.488 ± 1.29
1.866CysGly: 1.866 ± 1.139
0.622CysHis: 0.622 ± 0.38
1.866CysIle: 1.866 ± 1.139
0.622CysLys: 0.622 ± 0.38
1.244CysLeu: 1.244 ± 0.437
0.622CysMet: 0.622 ± 0.38
0.622CysAsn: 0.622 ± 0.625
1.244CysPro: 1.244 ± 0.645
1.244CysGln: 1.244 ± 0.437
0.0CysArg: 0.0 ± 0.0
1.866CysSer: 1.866 ± 1.421
0.622CysThr: 0.622 ± 0.816
2.488CysVal: 2.488 ± 0.884
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.109AspAla: 3.109 ± 1.181
0.0AspCys: 0.0 ± 0.0
2.488AspAsp: 2.488 ± 1.519
3.109AspGlu: 3.109 ± 1.899
1.244AspPhe: 1.244 ± 0.76
4.975AspGly: 4.975 ± 1.321
0.622AspHis: 0.622 ± 0.38
0.622AspIle: 0.622 ± 0.38
1.244AspLys: 1.244 ± 0.76
4.353AspLeu: 4.353 ± 0.993
1.866AspMet: 1.866 ± 1.139
1.244AspAsn: 1.244 ± 0.437
1.244AspPro: 1.244 ± 0.76
2.488AspGln: 2.488 ± 0.291
3.731AspArg: 3.731 ± 1.348
4.975AspSer: 4.975 ± 0.582
0.622AspThr: 0.622 ± 0.625
3.731AspVal: 3.731 ± 1.509
0.0AspTrp: 0.0 ± 0.0
3.109AspTyr: 3.109 ± 1.15
0.0AspXaa: 0.0 ± 0.0
Glu
5.597GluAla: 5.597 ± 2.051
2.488GluCys: 2.488 ± 1.985
1.244GluAsp: 1.244 ± 0.645
2.488GluGlu: 2.488 ± 0.884
2.488GluPhe: 2.488 ± 0.811
0.622GluGly: 0.622 ± 0.38
1.244GluHis: 1.244 ± 0.645
3.731GluIle: 3.731 ± 1.348
1.866GluLys: 1.866 ± 1.139
3.731GluLeu: 3.731 ± 1.25
0.0GluMet: 0.0 ± 0.0
0.622GluAsn: 0.622 ± 0.625
3.109GluPro: 3.109 ± 1.899
1.244GluGln: 1.244 ± 0.76
1.866GluArg: 1.866 ± 1.139
2.488GluSer: 2.488 ± 1.071
1.244GluThr: 1.244 ± 0.437
3.731GluVal: 3.731 ± 0.368
0.622GluTrp: 0.622 ± 0.38
0.622GluTyr: 0.622 ± 0.625
0.0GluXaa: 0.0 ± 0.0
Phe
6.219PheAla: 6.219 ± 0.5
0.622PheCys: 0.622 ± 0.816
2.488PheAsp: 2.488 ± 1.519
1.244PheGlu: 1.244 ± 0.76
0.622PhePhe: 0.622 ± 0.625
1.866PheGly: 1.866 ± 0.529
1.244PheHis: 1.244 ± 0.437
0.0PheIle: 0.0 ± 0.0
1.866PheLys: 1.866 ± 0.674
2.488PheLeu: 2.488 ± 0.884
0.622PheMet: 0.622 ± 0.38
3.109PheAsn: 3.109 ± 1.15
2.488PhePro: 2.488 ± 0.874
2.488PheGln: 2.488 ± 1.071
1.866PheArg: 1.866 ± 1.139
2.488PheSer: 2.488 ± 0.874
0.622PheThr: 0.622 ± 0.38
2.488PheVal: 2.488 ± 0.811
0.0PheTrp: 0.0 ± 0.0
0.622PheTyr: 0.622 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
3.731GlyAla: 3.731 ± 2.02
2.488GlyCys: 2.488 ± 0.811
2.488GlyAsp: 2.488 ± 1.622
1.866GlyGlu: 1.866 ± 0.674
1.244GlyPhe: 1.244 ± 0.437
6.219GlyGly: 6.219 ± 1.177
1.244GlyHis: 1.244 ± 0.76
0.622GlyIle: 0.622 ± 0.38
4.353GlyLys: 4.353 ± 1.869
5.597GlyLeu: 5.597 ± 0.664
3.109GlyMet: 3.109 ± 0.705
3.731GlyAsn: 3.731 ± 1.311
1.244GlyPro: 1.244 ± 0.437
1.244GlyGln: 1.244 ± 0.437
3.109GlyArg: 3.109 ± 1.105
9.328GlySer: 9.328 ± 1.966
6.219GlyThr: 6.219 ± 2.237
4.975GlyVal: 4.975 ± 0.754
1.866GlyTrp: 1.866 ± 1.421
2.488GlyTyr: 2.488 ± 0.811
0.0GlyXaa: 0.0 ± 0.0
His
1.244HisAla: 1.244 ± 0.645
0.622HisCys: 0.622 ± 0.38
0.622HisAsp: 0.622 ± 0.816
0.622HisGlu: 0.622 ± 0.816
0.0HisPhe: 0.0 ± 0.0
1.866HisGly: 1.866 ± 1.139
0.0HisHis: 0.0 ± 0.0
0.622HisIle: 0.622 ± 0.816
0.622HisLys: 0.622 ± 0.625
2.488HisLeu: 2.488 ± 1.29
0.622HisMet: 0.622 ± 0.38
0.0HisAsn: 0.0 ± 0.0
2.488HisPro: 2.488 ± 1.387
1.866HisGln: 1.866 ± 0.674
3.731HisArg: 3.731 ± 0.582
0.622HisSer: 0.622 ± 0.38
1.866HisThr: 1.866 ± 1.139
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.244HisTyr: 1.244 ± 0.992
0.0HisXaa: 0.0 ± 0.0
Ile
3.109IleAla: 3.109 ± 1.15
1.244IleCys: 1.244 ± 0.76
1.244IleAsp: 1.244 ± 0.76
1.866IleGlu: 1.866 ± 1.444
0.0IlePhe: 0.0 ± 0.0
1.866IleGly: 1.866 ± 0.674
1.244IleHis: 1.244 ± 0.645
1.866IleIle: 1.866 ± 0.625
1.244IleLys: 1.244 ± 0.992
3.109IleLeu: 3.109 ± 0.705
2.488IleMet: 2.488 ± 1.992
1.866IleAsn: 1.866 ± 1.01
2.488IlePro: 2.488 ± 1.071
1.244IleGln: 1.244 ± 1.632
2.488IleArg: 2.488 ± 0.291
6.219IleSer: 6.219 ± 2.211
1.244IleThr: 1.244 ± 0.437
3.109IleVal: 3.109 ± 0.259
0.622IleTrp: 0.622 ± 0.38
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.975LysAla: 4.975 ± 1.621
1.866LysCys: 1.866 ± 1.139
2.488LysAsp: 2.488 ± 1.519
2.488LysGlu: 2.488 ± 0.291
2.488LysPhe: 2.488 ± 1.519
1.866LysGly: 1.866 ± 0.625
0.622LysHis: 0.622 ± 0.38
1.244LysIle: 1.244 ± 0.437
2.488LysLys: 2.488 ± 0.291
3.731LysLeu: 3.731 ± 0.582
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
5.597LysPro: 5.597 ± 0.885
3.109LysGln: 3.109 ± 1.15
5.597LysArg: 5.597 ± 1.395
1.866LysSer: 1.866 ± 1.421
1.866LysThr: 1.866 ± 1.01
1.866LysVal: 1.866 ± 1.01
1.244LysTrp: 1.244 ± 0.76
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.463LeuAla: 7.463 ± 1.337
3.109LeuCys: 3.109 ± 1.899
3.731LeuAsp: 3.731 ± 1.935
3.109LeuGlu: 3.109 ± 1.181
2.488LeuPhe: 2.488 ± 0.884
10.572LeuGly: 10.572 ± 5.466
1.244LeuHis: 1.244 ± 1.632
2.488LeuIle: 2.488 ± 2.226
1.244LeuLys: 1.244 ± 0.76
9.328LeuLeu: 9.328 ± 2.393
1.244LeuMet: 1.244 ± 0.76
4.353LeuAsn: 4.353 ± 0.898
3.109LeuPro: 3.109 ± 1.15
2.488LeuGln: 2.488 ± 1.387
8.085LeuArg: 8.085 ± 2.026
6.841LeuSer: 6.841 ± 1.98
6.841LeuThr: 6.841 ± 3.601
5.597LeuVal: 5.597 ± 1.726
0.622LeuTrp: 0.622 ± 0.625
1.244LeuTyr: 1.244 ± 0.437
0.0LeuXaa: 0.0 ± 0.0
Met
1.244MetAla: 1.244 ± 0.437
0.622MetCys: 0.622 ± 0.38
1.866MetAsp: 1.866 ± 0.529
1.244MetGlu: 1.244 ± 0.992
0.622MetPhe: 0.622 ± 0.38
0.0MetGly: 0.0 ± 0.0
0.622MetHis: 0.622 ± 0.625
0.0MetIle: 0.0 ± 0.0
1.866MetLys: 1.866 ± 1.139
3.731MetLeu: 3.731 ± 0.9
0.622MetMet: 0.622 ± 0.625
0.622MetAsn: 0.622 ± 0.816
1.244MetPro: 1.244 ± 0.76
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
3.731MetSer: 3.731 ± 0.582
0.0MetThr: 0.0 ± 0.0
4.353MetVal: 4.353 ± 0.993
1.244MetTrp: 1.244 ± 1.632
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.731AsnAla: 3.731 ± 2.02
0.622AsnCys: 0.622 ± 0.625
2.488AsnAsp: 2.488 ± 0.811
1.866AsnGlu: 1.866 ± 1.01
1.866AsnPhe: 1.866 ± 0.529
0.622AsnGly: 0.622 ± 0.625
0.622AsnHis: 0.622 ± 0.38
2.488AsnIle: 2.488 ± 0.874
0.622AsnLys: 0.622 ± 0.816
1.866AsnLeu: 1.866 ± 1.139
0.0AsnMet: 0.0 ± 0.0
1.866AsnAsn: 1.866 ± 0.529
5.597AsnPro: 5.597 ± 2.697
1.244AsnGln: 1.244 ± 0.76
2.488AsnArg: 2.488 ± 0.884
3.109AsnSer: 3.109 ± 0.705
0.622AsnThr: 0.622 ± 0.38
6.219AsnVal: 6.219 ± 1.786
0.622AsnTrp: 0.622 ± 0.38
0.622AsnTyr: 0.622 ± 0.816
0.0AsnXaa: 0.0 ± 0.0
Pro
5.597ProAla: 5.597 ± 1.363
1.244ProCys: 1.244 ± 1.632
1.866ProAsp: 1.866 ± 1.139
3.731ProGlu: 3.731 ± 1.516
0.622ProPhe: 0.622 ± 0.625
3.731ProGly: 3.731 ± 1.509
1.244ProHis: 1.244 ± 0.76
3.731ProIle: 3.731 ± 1.058
4.353ProLys: 4.353 ± 0.251
5.597ProLeu: 5.597 ± 1.875
1.866ProMet: 1.866 ± 1.706
0.622ProAsn: 0.622 ± 0.38
3.109ProPro: 3.109 ± 1.15
0.622ProGln: 0.622 ± 0.625
3.109ProArg: 3.109 ± 1.899
3.731ProSer: 3.731 ± 1.921
4.353ProThr: 4.353 ± 0.898
5.597ProVal: 5.597 ± 1.953
1.866ProTrp: 1.866 ± 0.625
0.622ProTyr: 0.622 ± 0.816
0.0ProXaa: 0.0 ± 0.0
Gln
4.353GlnAla: 4.353 ± 0.948
0.0GlnCys: 0.0 ± 0.0
0.622GlnAsp: 0.622 ± 0.38
0.0GlnGlu: 0.0 ± 0.0
3.731GlnPhe: 3.731 ± 0.582
2.488GlnGly: 2.488 ± 0.884
1.244GlnHis: 1.244 ± 1.632
0.622GlnIle: 0.622 ± 0.38
2.488GlnLys: 2.488 ± 0.811
2.488GlnLeu: 2.488 ± 1.071
0.622GlnMet: 0.622 ± 0.38
2.488GlnAsn: 2.488 ± 1.387
1.244GlnPro: 1.244 ± 1.632
1.866GlnGln: 1.866 ± 1.444
1.866GlnArg: 1.866 ± 0.625
1.866GlnSer: 1.866 ± 1.444
2.488GlnThr: 2.488 ± 0.811
2.488GlnVal: 2.488 ± 1.071
1.244GlnTrp: 1.244 ± 0.76
3.109GlnTyr: 3.109 ± 0.893
0.0GlnXaa: 0.0 ± 0.0
Arg
6.219ArgAla: 6.219 ± 1.255
1.244ArgCys: 1.244 ± 0.76
3.731ArgAsp: 3.731 ± 0.582
1.866ArgGlu: 1.866 ± 0.625
2.488ArgPhe: 2.488 ± 0.884
3.109ArgGly: 3.109 ± 0.259
1.866ArgHis: 1.866 ± 0.625
1.244ArgIle: 1.244 ± 0.437
3.109ArgLys: 3.109 ± 1.15
3.731ArgLeu: 3.731 ± 2.843
2.488ArgMet: 2.488 ± 1.053
3.731ArgAsn: 3.731 ± 1.516
5.597ArgPro: 5.597 ± 2.021
4.353ArgGln: 4.353 ± 1.525
4.975ArgArg: 4.975 ± 2.58
4.353ArgSer: 4.353 ± 0.898
4.975ArgThr: 4.975 ± 0.582
3.109ArgVal: 3.109 ± 0.893
0.622ArgTrp: 0.622 ± 0.625
1.866ArgTyr: 1.866 ± 1.139
0.0ArgXaa: 0.0 ± 0.0
Ser
6.841SerAla: 6.841 ± 1.921
0.622SerCys: 0.622 ± 0.38
3.109SerAsp: 3.109 ± 1.181
1.866SerGlu: 1.866 ± 0.625
2.488SerPhe: 2.488 ± 1.071
9.95SerGly: 9.95 ± 4.477
3.109SerHis: 3.109 ± 1.263
5.597SerIle: 5.597 ± 3.454
3.731SerLys: 3.731 ± 0.368
7.463SerLeu: 7.463 ± 1.337
2.488SerMet: 2.488 ± 0.47
1.866SerAsn: 1.866 ± 0.529
4.975SerPro: 4.975 ± 1.539
3.731SerGln: 3.731 ± 1.25
6.219SerArg: 6.219 ± 2.604
9.95SerSer: 9.95 ± 4.014
6.219SerThr: 6.219 ± 3.229
4.975SerVal: 4.975 ± 2.238
3.109SerTrp: 3.109 ± 2.188
1.866SerTyr: 1.866 ± 0.625
0.0SerXaa: 0.0 ± 0.0
Thr
4.353ThrAla: 4.353 ± 1.303
1.244ThrCys: 1.244 ± 0.645
2.488ThrAsp: 2.488 ± 0.291
1.866ThrGlu: 1.866 ± 1.421
2.488ThrPhe: 2.488 ± 0.874
4.353ThrGly: 4.353 ± 1.915
1.244ThrHis: 1.244 ± 0.645
3.109ThrIle: 3.109 ± 2.188
3.109ThrLys: 3.109 ± 0.259
5.597ThrLeu: 5.597 ± 2.28
1.244ThrMet: 1.244 ± 0.437
1.866ThrAsn: 1.866 ± 1.01
3.731ThrPro: 3.731 ± 1.509
1.244ThrGln: 1.244 ± 0.437
4.353ThrArg: 4.353 ± 0.829
8.085ThrSer: 8.085 ± 4.899
5.597ThrThr: 5.597 ± 1.142
4.353ThrVal: 4.353 ± 0.948
0.0ThrTrp: 0.0 ± 0.0
3.731ThrTyr: 3.731 ± 2.237
0.0ThrXaa: 0.0 ± 0.0
Val
8.085ValAla: 8.085 ± 2.357
2.488ValCys: 2.488 ± 0.884
4.353ValAsp: 4.353 ± 1.315
6.841ValGlu: 6.841 ± 2.656
0.622ValPhe: 0.622 ± 0.625
4.975ValGly: 4.975 ± 1.749
0.622ValHis: 0.622 ± 0.38
1.244ValIle: 1.244 ± 0.645
1.866ValLys: 1.866 ± 1.139
6.841ValLeu: 6.841 ± 1.595
1.244ValMet: 1.244 ± 0.437
2.488ValAsn: 2.488 ± 1.519
4.353ValPro: 4.353 ± 2.628
3.731ValGln: 3.731 ± 1.301
4.975ValArg: 4.975 ± 0.754
6.841ValSer: 6.841 ± 2.934
8.085ValThr: 8.085 ± 3.763
8.706ValVal: 8.706 ± 2.63
0.622ValTrp: 0.622 ± 0.38
2.488ValTyr: 2.488 ± 0.811
0.0ValXaa: 0.0 ± 0.0
Trp
1.244TrpAla: 1.244 ± 0.437
0.622TrpCys: 0.622 ± 0.816
0.622TrpAsp: 0.622 ± 0.38
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.622TrpHis: 0.622 ± 0.816
1.244TrpIle: 1.244 ± 0.992
1.866TrpLys: 1.866 ± 1.421
3.109TrpLeu: 3.109 ± 0.705
1.244TrpMet: 1.244 ± 0.645
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.244TrpGln: 1.244 ± 0.645
0.0TrpArg: 0.0 ± 0.0
1.244TrpSer: 1.244 ± 0.645
0.622TrpThr: 0.622 ± 0.38
1.244TrpVal: 1.244 ± 0.437
0.622TrpTrp: 0.622 ± 0.38
1.244TrpTyr: 1.244 ± 0.76
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.488TyrAla: 2.488 ± 0.811
0.622TyrCys: 0.622 ± 0.38
0.622TyrAsp: 0.622 ± 0.625
0.622TyrGlu: 0.622 ± 0.625
1.244TyrPhe: 1.244 ± 0.437
1.244TyrGly: 1.244 ± 0.437
1.244TyrHis: 1.244 ± 0.76
0.622TyrIle: 0.622 ± 0.625
1.866TyrLys: 1.866 ± 0.529
3.109TyrLeu: 3.109 ± 0.259
0.0TyrMet: 0.0 ± 0.0
1.866TyrAsn: 1.866 ± 1.01
1.866TyrPro: 1.866 ± 0.674
0.622TyrGln: 0.622 ± 0.38
1.866TyrArg: 1.866 ± 0.529
3.109TyrSer: 3.109 ± 1.105
2.488TyrThr: 2.488 ± 0.811
3.109TyrVal: 3.109 ± 0.259
1.866TyrTrp: 1.866 ± 0.625
1.866TyrTyr: 1.866 ± 0.529
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski