Amino acid dipepetide frequency for Subterranean clover mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.587AlaAla: 7.587 ± 5.163
1.012AlaCys: 1.012 ± 0.768
5.564AlaAsp: 5.564 ± 1.562
3.541AlaGlu: 3.541 ± 1.459
0.506AlaPhe: 0.506 ± 0.723
4.552AlaGly: 4.552 ± 0.823
0.0AlaHis: 0.0 ± 0.0
6.07AlaIle: 6.07 ± 1.342
3.035AlaLys: 3.035 ± 1.376
8.093AlaLeu: 8.093 ± 1.729
0.506AlaMet: 0.506 ± 0.329
2.529AlaAsn: 2.529 ± 1.479
2.023AlaPro: 2.023 ± 0.919
0.0AlaGln: 0.0 ± 0.0
3.541AlaArg: 3.541 ± 0.588
5.564AlaSer: 5.564 ± 1.458
2.023AlaThr: 2.023 ± 1.213
12.645AlaVal: 12.645 ± 2.48
0.0AlaTrp: 0.0 ± 0.0
2.529AlaTyr: 2.529 ± 0.672
0.0AlaXaa: 0.0 ± 0.0
Cys
1.012CysAla: 1.012 ± 0.374
0.0CysCys: 0.0 ± 0.0
1.517CysAsp: 1.517 ± 1.547
0.0CysGlu: 0.0 ± 0.0
1.012CysPhe: 1.012 ± 0.768
1.012CysGly: 1.012 ± 0.374
0.506CysHis: 0.506 ± 0.723
2.023CysIle: 2.023 ± 1.021
2.529CysLys: 2.529 ± 1.063
1.012CysLeu: 1.012 ± 0.658
1.012CysMet: 1.012 ± 0.357
1.012CysAsn: 1.012 ± 0.768
1.012CysPro: 1.012 ± 0.374
0.506CysGln: 0.506 ± 0.492
0.506CysArg: 0.506 ± 0.813
2.023CysSer: 2.023 ± 0.593
1.012CysThr: 1.012 ± 0.768
1.012CysVal: 1.012 ± 0.768
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.058AspAla: 5.058 ± 0.732
0.506AspCys: 0.506 ± 0.813
6.07AspAsp: 6.07 ± 1.516
1.517AspGlu: 1.517 ± 0.645
4.552AspPhe: 4.552 ± 1.167
1.517AspGly: 1.517 ± 0.504
2.023AspHis: 2.023 ± 0.593
3.541AspIle: 3.541 ± 0.803
2.023AspLys: 2.023 ± 0.764
6.07AspLeu: 6.07 ± 2.292
0.0AspMet: 0.0 ± 0.0
1.012AspAsn: 1.012 ± 0.374
3.541AspPro: 3.541 ± 1.077
1.517AspGln: 1.517 ± 1.547
2.529AspArg: 2.529 ± 1.063
7.587AspSer: 7.587 ± 3.201
1.012AspThr: 1.012 ± 0.374
3.035AspVal: 3.035 ± 0.789
0.506AspTrp: 0.506 ± 0.329
2.529AspTyr: 2.529 ± 1.331
0.0AspXaa: 0.0 ± 0.0
Glu
4.047GluAla: 4.047 ± 1.528
1.012GluCys: 1.012 ± 0.374
5.564GluAsp: 5.564 ± 1.773
3.541GluGlu: 3.541 ± 3.066
0.506GluPhe: 0.506 ± 0.329
6.576GluGly: 6.576 ± 2.459
1.517GluHis: 1.517 ± 0.504
6.07GluIle: 6.07 ± 2.14
5.058GluLys: 5.058 ± 1.869
4.552GluLeu: 4.552 ± 1.426
0.506GluMet: 0.506 ± 0.669
1.012GluAsn: 1.012 ± 0.777
6.07GluPro: 6.07 ± 1.996
1.012GluGln: 1.012 ± 0.374
3.541GluArg: 3.541 ± 0.763
1.517GluSer: 1.517 ± 0.645
6.576GluThr: 6.576 ± 0.961
6.07GluVal: 6.07 ± 1.481
1.012GluTrp: 1.012 ± 0.658
1.517GluTyr: 1.517 ± 1.547
0.0GluXaa: 0.0 ± 0.0
Phe
2.023PheAla: 2.023 ± 0.883
0.506PheCys: 0.506 ± 0.329
0.506PheAsp: 0.506 ± 0.329
2.529PheGlu: 2.529 ± 0.824
0.0PhePhe: 0.0 ± 0.0
2.529PheGly: 2.529 ± 0.824
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.529PheLys: 2.529 ± 1.644
4.552PheLeu: 4.552 ± 1.539
0.0PheMet: 0.0 ± 0.0
3.035PheAsn: 3.035 ± 0.627
3.035PhePro: 3.035 ± 1.024
1.012PheGln: 1.012 ± 0.768
1.517PheArg: 1.517 ± 0.987
2.023PheSer: 2.023 ± 0.425
1.517PheThr: 1.517 ± 2.168
2.529PheVal: 2.529 ± 0.824
1.012PheTrp: 1.012 ± 1.626
2.023PheTyr: 2.023 ± 0.748
0.0PheXaa: 0.0 ± 0.0
Gly
4.047GlyAla: 4.047 ± 1.656
2.023GlyCys: 2.023 ± 0.748
3.035GlyAsp: 3.035 ± 1.321
5.564GlyGlu: 5.564 ± 1.905
3.035GlyPhe: 3.035 ± 0.789
5.058GlyGly: 5.058 ± 0.822
1.012GlyHis: 1.012 ± 0.658
2.023GlyIle: 2.023 ± 0.853
7.587GlyLys: 7.587 ± 1.761
3.035GlyLeu: 3.035 ± 1.007
1.012GlyMet: 1.012 ± 0.658
3.035GlyAsn: 3.035 ± 1.461
2.023GlyPro: 2.023 ± 0.593
3.035GlyGln: 3.035 ± 0.657
3.541GlyArg: 3.541 ± 1.185
6.07GlySer: 6.07 ± 0.953
5.564GlyThr: 5.564 ± 2.081
4.552GlyVal: 4.552 ± 0.876
2.023GlyTrp: 2.023 ± 0.748
6.576GlyTyr: 6.576 ± 0.666
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.023HisAsp: 2.023 ± 0.748
2.529HisGlu: 2.529 ± 1.162
0.506HisPhe: 0.506 ± 0.329
0.506HisGly: 0.506 ± 0.329
0.506HisHis: 0.506 ± 0.329
1.012HisIle: 1.012 ± 0.374
1.012HisLys: 1.012 ± 0.658
1.517HisLeu: 1.517 ± 0.81
0.0HisMet: 0.0 ± 0.0
0.506HisAsn: 0.506 ± 0.329
1.012HisPro: 1.012 ± 0.658
1.012HisGln: 1.012 ± 0.374
1.012HisArg: 1.012 ± 0.768
1.012HisSer: 1.012 ± 0.777
3.035HisThr: 3.035 ± 0.657
1.012HisVal: 1.012 ± 0.374
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.035IleAla: 3.035 ± 1.024
0.506IleCys: 0.506 ± 0.329
1.517IleAsp: 1.517 ± 0.722
2.529IleGlu: 2.529 ± 1.331
4.047IlePhe: 4.047 ± 0.744
2.023IleGly: 2.023 ± 0.919
1.517IleHis: 1.517 ± 0.533
1.517IleIle: 1.517 ± 0.504
3.541IleLys: 3.541 ± 1.524
3.541IleLeu: 3.541 ± 0.742
1.012IleMet: 1.012 ± 0.374
1.517IleAsn: 1.517 ± 0.533
2.023IlePro: 2.023 ± 1.021
1.012IleGln: 1.012 ± 1.184
4.552IleArg: 4.552 ± 1.238
7.081IleSer: 7.081 ± 1.412
2.023IleThr: 2.023 ± 0.748
2.023IleVal: 2.023 ± 0.748
1.012IleTrp: 1.012 ± 0.768
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.552LysAla: 4.552 ± 1.822
1.012LysCys: 1.012 ± 0.374
4.552LysAsp: 4.552 ± 1.539
7.081LysGlu: 7.081 ± 2.503
2.529LysPhe: 2.529 ± 0.824
5.058LysGly: 5.058 ± 1.647
0.506LysHis: 0.506 ± 0.329
1.517LysIle: 1.517 ± 0.987
4.552LysLys: 4.552 ± 1.348
7.587LysLeu: 7.587 ± 2.471
0.0LysMet: 0.0 ± 0.0
1.517LysAsn: 1.517 ± 1.328
2.529LysPro: 2.529 ± 0.824
3.541LysGln: 3.541 ± 1.438
2.529LysArg: 2.529 ± 1.216
4.047LysSer: 4.047 ± 1.244
2.529LysThr: 2.529 ± 1.063
5.564LysVal: 5.564 ± 1.905
1.517LysTrp: 1.517 ± 0.533
1.012LysTyr: 1.012 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
8.599LeuAla: 8.599 ± 1.928
1.517LeuCys: 1.517 ± 0.645
4.047LeuAsp: 4.047 ± 0.744
5.564LeuGlu: 5.564 ± 2.26
4.047LeuPhe: 4.047 ± 0.961
6.07LeuGly: 6.07 ± 0.851
0.0LeuHis: 0.0 ± 0.0
5.058LeuIle: 5.058 ± 0.797
3.035LeuLys: 3.035 ± 1.007
11.128LeuLeu: 11.128 ± 2.162
2.529LeuMet: 2.529 ± 1.002
3.541LeuAsn: 3.541 ± 1.729
4.552LeuPro: 4.552 ± 1.167
3.541LeuGln: 3.541 ± 1.367
6.07LeuArg: 6.07 ± 1.342
9.611LeuSer: 9.611 ± 1.152
3.541LeuThr: 3.541 ± 0.927
5.564LeuVal: 5.564 ± 1.36
2.023LeuTrp: 2.023 ± 0.764
4.552LeuTyr: 4.552 ± 0.834
0.0LeuXaa: 0.0 ± 0.0
Met
2.023MetAla: 2.023 ± 0.425
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.012MetGly: 1.012 ± 0.658
0.0MetHis: 0.0 ± 0.0
0.506MetIle: 0.506 ± 0.723
0.506MetLys: 0.506 ± 0.329
2.529MetLeu: 2.529 ± 0.824
0.0MetMet: 0.0 ± 0.0
2.023MetAsn: 2.023 ± 0.748
1.517MetPro: 1.517 ± 1.056
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.023MetSer: 2.023 ± 0.799
2.529MetThr: 2.529 ± 0.57
1.012MetVal: 1.012 ± 0.374
0.0MetTrp: 0.0 ± 0.0
0.506MetTyr: 0.506 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
2.023AsnAla: 2.023 ± 0.593
0.506AsnCys: 0.506 ± 0.492
1.517AsnAsp: 1.517 ± 0.987
1.517AsnGlu: 1.517 ± 0.858
3.035AsnPhe: 3.035 ± 1.024
3.035AsnGly: 3.035 ± 1.065
1.012AsnHis: 1.012 ± 0.374
0.506AsnIle: 0.506 ± 0.723
1.012AsnLys: 1.012 ± 0.643
5.058AsnLeu: 5.058 ± 1.411
0.0AsnMet: 0.0 ± 0.0
0.506AsnAsn: 0.506 ± 0.813
2.529AsnPro: 2.529 ± 0.709
1.012AsnGln: 1.012 ± 0.777
3.541AsnArg: 3.541 ± 0.4
2.529AsnSer: 2.529 ± 0.57
3.035AsnThr: 3.035 ± 0.661
1.517AsnVal: 1.517 ± 0.533
0.506AsnTrp: 0.506 ± 0.723
1.012AsnTyr: 1.012 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
7.587ProAla: 7.587 ± 1.885
1.517ProCys: 1.517 ± 0.858
1.012ProAsp: 1.012 ± 0.658
4.047ProGlu: 4.047 ± 1.502
1.012ProPhe: 1.012 ± 0.374
4.047ProGly: 4.047 ± 0.975
2.023ProHis: 2.023 ± 0.748
1.012ProIle: 1.012 ± 0.643
2.023ProLys: 2.023 ± 0.748
4.047ProLeu: 4.047 ± 0.961
2.023ProMet: 2.023 ± 0.799
0.506ProAsn: 0.506 ± 0.329
4.552ProPro: 4.552 ± 1.723
1.517ProGln: 1.517 ± 0.504
4.047ProArg: 4.047 ± 0.845
3.541ProSer: 3.541 ± 2.001
3.035ProThr: 3.035 ± 0.627
2.023ProVal: 2.023 ± 0.593
1.517ProTrp: 1.517 ± 0.722
0.506ProTyr: 0.506 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
0.506GlnAla: 0.506 ± 0.329
0.506GlnCys: 0.506 ± 0.813
0.0GlnAsp: 0.0 ± 0.0
3.035GlnGlu: 3.035 ± 1.461
0.0GlnPhe: 0.0 ± 0.0
2.023GlnGly: 2.023 ± 0.919
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.517GlnLys: 1.517 ± 0.81
4.047GlnLeu: 4.047 ± 0.611
1.012GlnMet: 1.012 ± 1.445
2.529GlnAsn: 2.529 ± 1.073
2.023GlnPro: 2.023 ± 0.593
1.012GlnGln: 1.012 ± 1.445
1.012GlnArg: 1.012 ± 0.777
6.07GlnSer: 6.07 ± 1.421
1.012GlnThr: 1.012 ± 0.835
2.529GlnVal: 2.529 ± 0.57
0.506GlnTrp: 0.506 ± 0.329
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.058ArgAla: 5.058 ± 0.732
1.012ArgCys: 1.012 ± 0.768
2.529ArgAsp: 2.529 ± 1.331
5.058ArgGlu: 5.058 ± 0.793
4.552ArgPhe: 4.552 ± 1.433
4.047ArgGly: 4.047 ± 1.528
0.506ArgHis: 0.506 ± 0.329
2.529ArgIle: 2.529 ± 1.644
3.035ArgLys: 3.035 ± 1.122
3.541ArgLeu: 3.541 ± 0.763
0.0ArgMet: 0.0 ± 0.0
1.517ArgAsn: 1.517 ± 0.645
1.517ArgPro: 1.517 ± 0.645
1.012ArgGln: 1.012 ± 0.777
3.541ArgArg: 3.541 ± 0.906
4.047ArgSer: 4.047 ± 1.568
2.023ArgThr: 2.023 ± 1.703
4.552ArgVal: 4.552 ± 1.998
1.517ArgTrp: 1.517 ± 0.533
0.506ArgTyr: 0.506 ± 0.723
0.0ArgXaa: 0.0 ± 0.0
Ser
3.541SerAla: 3.541 ± 1.883
1.517SerCys: 1.517 ± 0.858
6.576SerAsp: 6.576 ± 1.441
4.552SerGlu: 4.552 ± 2.467
2.023SerPhe: 2.023 ± 0.425
11.128SerGly: 11.128 ± 1.856
1.517SerHis: 1.517 ± 0.504
2.529SerIle: 2.529 ± 1.331
8.093SerLys: 8.093 ± 1.554
10.116SerLeu: 10.116 ± 2.898
1.517SerMet: 1.517 ± 0.501
3.541SerAsn: 3.541 ± 0.876
3.541SerPro: 3.541 ± 1.451
3.035SerGln: 3.035 ± 1.024
3.035SerArg: 3.035 ± 1.321
12.645SerSer: 12.645 ± 2.228
6.07SerThr: 6.07 ± 1.953
5.564SerVal: 5.564 ± 0.459
2.023SerTrp: 2.023 ± 0.764
2.529SerTyr: 2.529 ± 1.063
0.0SerXaa: 0.0 ± 0.0
Thr
4.047ThrAla: 4.047 ± 1.313
2.023ThrCys: 2.023 ± 0.593
1.012ThrAsp: 1.012 ± 0.374
3.035ThrGlu: 3.035 ± 0.477
1.012ThrPhe: 1.012 ± 0.374
3.035ThrGly: 3.035 ± 1.065
1.517ThrHis: 1.517 ± 0.645
5.058ThrIle: 5.058 ± 0.22
4.552ThrLys: 4.552 ± 1.713
4.552ThrLeu: 4.552 ± 2.962
0.0ThrMet: 0.0 ± 0.0
1.517ThrAsn: 1.517 ± 0.722
2.529ThrPro: 2.529 ± 0.57
1.517ThrGln: 1.517 ± 1.417
2.529ThrArg: 2.529 ± 1.479
4.047ThrSer: 4.047 ± 1.495
4.552ThrThr: 4.552 ± 2.456
5.058ThrVal: 5.058 ± 2.126
0.506ThrTrp: 0.506 ± 0.723
1.012ThrTyr: 1.012 ± 1.445
0.0ThrXaa: 0.0 ± 0.0
Val
4.047ValAla: 4.047 ± 1.815
1.012ValCys: 1.012 ± 0.643
6.576ValAsp: 6.576 ± 1.476
6.576ValGlu: 6.576 ± 1.088
0.506ValPhe: 0.506 ± 0.329
7.587ValGly: 7.587 ± 1.513
1.012ValHis: 1.012 ± 0.374
4.047ValIle: 4.047 ± 0.611
4.552ValLys: 4.552 ± 0.823
5.564ValLeu: 5.564 ± 0.57
2.023ValMet: 2.023 ± 0.748
4.047ValAsn: 4.047 ± 0.825
3.541ValPro: 3.541 ± 1.077
2.023ValGln: 2.023 ± 0.748
3.035ValArg: 3.035 ± 0.661
5.058ValSer: 5.058 ± 1.249
2.023ValThr: 2.023 ± 0.593
4.552ValVal: 4.552 ± 0.876
2.529ValTrp: 2.529 ± 0.57
2.529ValTyr: 2.529 ± 0.705
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.329
1.517TrpCys: 1.517 ± 1.056
1.012TrpAsp: 1.012 ± 0.768
2.023TrpGlu: 2.023 ± 0.764
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
2.023TrpHis: 2.023 ± 0.748
0.0TrpIle: 0.0 ± 0.0
1.517TrpLys: 1.517 ± 0.504
0.506TrpLeu: 0.506 ± 0.723
1.012TrpMet: 1.012 ± 0.374
0.506TrpAsn: 0.506 ± 0.329
1.517TrpPro: 1.517 ± 0.858
1.012TrpGln: 1.012 ± 0.658
1.517TrpArg: 1.517 ± 0.533
4.047TrpSer: 4.047 ± 0.986
0.0TrpThr: 0.0 ± 0.0
1.012TrpVal: 1.012 ± 0.777
1.517TrpTrp: 1.517 ± 0.504
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.023TyrAla: 2.023 ± 1.213
1.012TyrCys: 1.012 ± 0.658
1.517TyrAsp: 1.517 ± 1.056
3.541TyrGlu: 3.541 ± 1.375
0.0TyrPhe: 0.0 ± 0.0
3.035TyrGly: 3.035 ± 0.627
1.012TyrHis: 1.012 ± 0.374
1.012TyrIle: 1.012 ± 0.658
2.023TyrLys: 2.023 ± 0.748
3.541TyrLeu: 3.541 ± 0.588
1.517TyrMet: 1.517 ± 0.533
0.0TyrAsn: 0.0 ± 0.0
0.506TyrPro: 0.506 ± 0.329
1.012TyrGln: 1.012 ± 0.768
0.506TyrArg: 0.506 ± 0.723
4.552TyrSer: 4.552 ± 1.169
0.0TyrThr: 0.0 ± 0.0
1.012TyrVal: 1.012 ± 0.643
1.517TyrTrp: 1.517 ± 0.645
1.012TyrTyr: 1.012 ± 0.643
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski