Amino acid dipepetide frequency for Scheffersomyces segobiensis virus L

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.23AlaAla: 6.23 ± 0.342
0.656AlaCys: 0.656 ± 0.303
1.311AlaAsp: 1.311 ± 0.606
2.623AlaGlu: 2.623 ± 0.663
2.623AlaPhe: 2.623 ± 0.663
2.295AlaGly: 2.295 ± 0.232
1.311AlaHis: 1.311 ± 0.495
6.557AlaIle: 6.557 ± 0.287
6.557AlaLys: 6.557 ± 0.826
5.902AlaLeu: 5.902 ± 0.529
1.967AlaMet: 1.967 ± 0.193
5.246AlaAsn: 5.246 ± 1.325
4.918AlaPro: 4.918 ± 0.332
1.311AlaGln: 1.311 ± 0.495
4.918AlaArg: 4.918 ± 0.332
4.918AlaSer: 4.918 ± 0.299
3.934AlaThr: 3.934 ± 1.269
4.918AlaVal: 4.918 ± 0.299
1.311AlaTrp: 1.311 ± 0.495
2.623AlaTyr: 2.623 ± 1.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.311CysAsp: 1.311 ± 0.057
2.623CysGlu: 2.623 ± 0.115
0.656CysPhe: 0.656 ± 0.248
0.656CysGly: 0.656 ± 0.248
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.656CysAsn: 0.656 ± 0.303
1.311CysPro: 1.311 ± 0.057
0.0CysGln: 0.0 ± 0.0
0.656CysArg: 0.656 ± 0.248
0.656CysSer: 0.656 ± 0.248
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.656CysTrp: 0.656 ± 0.248
0.656CysTyr: 0.656 ± 0.303
0.0CysXaa: 0.0 ± 0.0
Asp
3.279AspAla: 3.279 ± 0.688
0.656AspCys: 0.656 ± 0.303
4.918AspAsp: 4.918 ± 0.822
2.623AspGlu: 2.623 ± 0.44
5.902AspPhe: 5.902 ± 0.579
1.311AspGly: 1.311 ± 0.606
2.623AspHis: 2.623 ± 0.115
2.951AspIle: 2.951 ± 0.202
1.967AspLys: 1.967 ± 0.193
3.607AspLeu: 3.607 ± 0.77
0.656AspMet: 0.656 ± 0.248
1.311AspAsn: 1.311 ± 0.495
1.967AspPro: 1.967 ± 0.359
1.311AspGln: 1.311 ± 0.606
2.623AspArg: 2.623 ± 0.115
2.623AspSer: 2.623 ± 0.44
4.59AspThr: 4.59 ± 0.093
5.246AspVal: 5.246 ± 0.88
0.656AspTrp: 0.656 ± 0.248
4.59AspTyr: 4.59 ± 0.633
0.0AspXaa: 0.0 ± 0.0
Glu
2.623GluAla: 2.623 ± 1.101
0.656GluCys: 0.656 ± 0.248
3.934GluAsp: 3.934 ± 0.386
4.262GluGlu: 4.262 ± 0.168
3.934GluPhe: 3.934 ± 0.719
2.623GluGly: 2.623 ± 0.663
0.0GluHis: 0.0 ± 0.0
2.951GluIle: 2.951 ± 0.477
2.623GluLys: 2.623 ± 0.99
6.557GluLeu: 6.557 ± 0.281
1.967GluMet: 1.967 ± 1.141
3.279GluAsn: 3.279 ± 0.14
1.311GluPro: 1.311 ± 0.057
1.311GluGln: 1.311 ± 0.495
3.279GluArg: 3.279 ± 0.14
6.23GluSer: 6.23 ± 0.277
3.607GluThr: 3.607 ± 0.262
4.59GluVal: 4.59 ± 1.022
1.967GluTrp: 1.967 ± 0.743
1.967GluTyr: 1.967 ± 0.193
0.0GluXaa: 0.0 ± 0.0
Phe
3.279PheAla: 3.279 ± 0.416
0.656PheCys: 0.656 ± 0.303
3.934PheAsp: 3.934 ± 0.719
1.311PheGlu: 1.311 ± 0.057
1.967PhePhe: 1.967 ± 0.359
4.262PheGly: 4.262 ± 0.168
0.0PheHis: 0.0 ± 0.0
1.967PheIle: 1.967 ± 0.374
2.623PheLys: 2.623 ± 0.115
1.639PheLeu: 1.639 ± 0.246
0.984PheMet: 0.984 ± 0.318
6.557PheAsn: 6.557 ± 0.287
1.311PhePro: 1.311 ± 0.057
0.0PheGln: 0.0 ± 0.0
2.623PheArg: 2.623 ± 0.44
2.623PheSer: 2.623 ± 0.663
2.623PheThr: 2.623 ± 0.663
2.623PheVal: 2.623 ± 0.44
0.0PheTrp: 0.0 ± 0.0
1.967PheTyr: 1.967 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
2.295GlyAla: 2.295 ± 0.232
1.311GlyCys: 1.311 ± 0.495
3.607GlyAsp: 3.607 ± 0.388
3.934GlyGlu: 3.934 ± 0.386
1.639GlyPhe: 1.639 ± 0.435
3.934GlyGly: 3.934 ± 0.386
1.311GlyHis: 1.311 ± 0.057
4.59GlyIle: 4.59 ± 1.022
3.934GlyLys: 3.934 ± 0.172
3.279GlyLeu: 3.279 ± 0.14
2.623GlyMet: 2.623 ± 0.99
0.656GlyAsn: 0.656 ± 0.248
0.656GlyPro: 0.656 ± 0.303
1.311GlyGln: 1.311 ± 0.606
3.279GlyArg: 3.279 ± 0.688
3.607GlySer: 3.607 ± 0.77
3.279GlyThr: 3.279 ± 0.688
5.902GlyVal: 5.902 ± 0.529
1.967GlyTrp: 1.967 ± 0.193
3.279GlyTyr: 3.279 ± 0.416
0.0GlyXaa: 0.0 ± 0.0
His
1.967HisAla: 1.967 ± 0.359
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.311HisGly: 1.311 ± 0.495
0.656HisHis: 0.656 ± 0.248
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
5.246HisLeu: 5.246 ± 0.88
0.656HisMet: 0.656 ± 0.303
1.967HisAsn: 1.967 ± 0.743
1.311HisPro: 1.311 ± 0.495
0.0HisGln: 0.0 ± 0.0
1.967HisArg: 1.967 ± 0.743
1.967HisSer: 1.967 ± 0.193
0.0HisThr: 0.0 ± 0.0
1.967HisVal: 1.967 ± 0.359
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.902IleAla: 5.902 ± 0.529
0.0IleCys: 0.0 ± 0.0
4.59IleAsp: 4.59 ± 1.183
5.574IleGlu: 5.574 ± 0.579
3.934IlePhe: 3.934 ± 0.172
5.246IleGly: 5.246 ± 0.332
0.656IleHis: 0.656 ± 0.303
3.934IleIle: 3.934 ± 0.386
4.59IleLys: 4.59 ± 1.022
5.574IleLeu: 5.574 ± 0.148
0.656IleMet: 0.656 ± 0.303
3.279IleAsn: 3.279 ± 0.966
2.295IlePro: 2.295 ± 0.444
0.656IleGln: 0.656 ± 0.303
3.934IleArg: 3.934 ± 0.935
4.59IleSer: 4.59 ± 1.733
2.623IleThr: 2.623 ± 0.44
3.279IleVal: 3.279 ± 0.14
0.656IleTrp: 0.656 ± 0.303
1.967IleTyr: 1.967 ± 0.743
0.0IleXaa: 0.0 ± 0.0
Lys
5.246LysAla: 5.246 ± 0.332
0.656LysCys: 0.656 ± 0.303
2.623LysAsp: 2.623 ± 0.44
4.59LysGlu: 4.59 ± 0.093
1.311LysPhe: 1.311 ± 0.057
1.311LysGly: 1.311 ± 0.606
1.311LysHis: 1.311 ± 0.057
2.623LysIle: 2.623 ± 0.44
1.311LysLys: 1.311 ± 0.057
5.246LysLeu: 5.246 ± 0.229
0.656LysMet: 0.656 ± 0.303
0.656LysAsn: 0.656 ± 0.303
0.656LysPro: 0.656 ± 0.303
1.311LysGln: 1.311 ± 0.057
5.246LysArg: 5.246 ± 0.229
5.246LysSer: 5.246 ± 0.229
3.279LysThr: 3.279 ± 0.14
2.623LysVal: 2.623 ± 0.44
0.0LysTrp: 0.0 ± 0.0
5.902LysTyr: 5.902 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
8.852LeuAla: 8.852 ± 2.078
0.0LeuCys: 0.0 ± 0.0
3.279LeuAsp: 3.279 ± 1.238
3.607LeuGlu: 3.607 ± 0.262
5.246LeuPhe: 5.246 ± 0.229
4.918LeuGly: 4.918 ± 0.332
1.967LeuHis: 1.967 ± 0.743
7.213LeuIle: 7.213 ± 1.073
3.934LeuLys: 3.934 ± 0.172
6.557LeuLeu: 6.557 ± 0.734
3.279LeuMet: 3.279 ± 0.688
4.59LeuAsn: 4.59 ± 0.633
5.902LeuPro: 5.902 ± 1.628
1.639LeuGln: 1.639 ± 0.246
5.246LeuArg: 5.246 ± 0.775
8.197LeuSer: 8.197 ± 1.776
7.213LeuThr: 7.213 ± 1.685
8.525LeuVal: 8.525 ± 0.472
0.656LeuTrp: 0.656 ± 0.248
3.279LeuTyr: 3.279 ± 0.688
0.0LeuXaa: 0.0 ± 0.0
Met
1.311MetAla: 1.311 ± 0.495
0.0MetCys: 0.0 ± 0.0
0.656MetAsp: 0.656 ± 0.248
1.311MetGlu: 1.311 ± 0.057
1.967MetPhe: 1.967 ± 0.359
1.311MetGly: 1.311 ± 0.057
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.656MetLys: 0.656 ± 0.303
1.967MetLeu: 1.967 ± 0.193
0.656MetMet: 0.656 ± 0.248
2.623MetAsn: 2.623 ± 0.44
1.311MetPro: 1.311 ± 0.057
0.0MetGln: 0.0 ± 0.0
0.656MetArg: 0.656 ± 0.248
2.623MetSer: 2.623 ± 0.44
3.934MetThr: 3.934 ± 0.719
1.311MetVal: 1.311 ± 0.057
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.279AsnAla: 3.279 ± 0.966
0.656AsnCys: 0.656 ± 0.248
4.59AsnAsp: 4.59 ± 0.473
2.623AsnGlu: 2.623 ± 0.115
0.656AsnPhe: 0.656 ± 0.303
3.934AsnGly: 3.934 ± 0.719
1.967AsnHis: 1.967 ± 0.193
3.934AsnIle: 3.934 ± 0.386
3.934AsnLys: 3.934 ± 0.386
5.902AsnLeu: 5.902 ± 0.066
1.311AsnMet: 1.311 ± 0.495
0.0AsnAsn: 0.0 ± 0.0
1.311AsnPro: 1.311 ± 0.495
0.656AsnGln: 0.656 ± 0.248
3.279AsnArg: 3.279 ± 0.688
4.59AsnSer: 4.59 ± 0.093
1.967AsnThr: 1.967 ± 0.193
5.246AsnVal: 5.246 ± 0.775
0.656AsnTrp: 0.656 ± 0.303
3.279AsnTyr: 3.279 ± 0.14
0.0AsnXaa: 0.0 ± 0.0
Pro
2.951ProAla: 2.951 ± 0.481
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.311ProGlu: 1.311 ± 0.057
3.279ProPhe: 3.279 ± 0.14
1.967ProGly: 1.967 ± 0.193
0.656ProHis: 0.656 ± 0.248
2.295ProIle: 2.295 ± 0.232
3.279ProLys: 3.279 ± 0.966
3.934ProLeu: 3.934 ± 1.269
1.311ProMet: 1.311 ± 0.057
4.59ProAsn: 4.59 ± 1.022
2.295ProPro: 2.295 ± 0.719
1.967ProGln: 1.967 ± 0.193
1.311ProArg: 1.311 ± 0.606
0.656ProSer: 0.656 ± 0.248
1.311ProThr: 1.311 ± 0.057
5.574ProVal: 5.574 ± 1.102
0.656ProTrp: 0.656 ± 0.248
1.311ProTyr: 1.311 ± 0.057
0.0ProXaa: 0.0 ± 0.0
Gln
0.656GlnAla: 0.656 ± 0.248
0.0GlnCys: 0.0 ± 0.0
0.656GlnAsp: 0.656 ± 0.248
0.656GlnGlu: 0.656 ± 0.248
0.656GlnPhe: 0.656 ± 0.248
0.656GlnGly: 0.656 ± 0.248
0.656GlnHis: 0.656 ± 0.248
1.967GlnIle: 1.967 ± 0.359
1.967GlnLys: 1.967 ± 0.193
2.295GlnLeu: 2.295 ± 0.719
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.984GlnPro: 0.984 ± 0.669
0.0GlnGln: 0.0 ± 0.0
1.311GlnArg: 1.311 ± 0.495
0.656GlnSer: 0.656 ± 0.303
1.311GlnThr: 1.311 ± 0.057
2.295GlnVal: 2.295 ± 0.232
0.0GlnTrp: 0.0 ± 0.0
1.311GlnTyr: 1.311 ± 0.495
0.0GlnXaa: 0.0 ± 0.0
Arg
4.59ArgAla: 4.59 ± 0.633
0.0ArgCys: 0.0 ± 0.0
2.623ArgAsp: 2.623 ± 0.44
6.557ArgGlu: 6.557 ± 0.826
3.279ArgPhe: 3.279 ± 0.966
3.279ArgGly: 3.279 ± 0.416
0.0ArgHis: 0.0 ± 0.0
3.279ArgIle: 3.279 ± 0.14
1.967ArgLys: 1.967 ± 0.359
6.885ArgLeu: 6.885 ± 0.148
0.656ArgMet: 0.656 ± 0.248
3.279ArgAsn: 3.279 ± 0.688
0.656ArgPro: 0.656 ± 0.303
0.656ArgGln: 0.656 ± 0.248
3.279ArgArg: 3.279 ± 0.688
4.59ArgSer: 4.59 ± 1.183
3.279ArgThr: 3.279 ± 1.238
5.902ArgVal: 5.902 ± 1.678
0.656ArgTrp: 0.656 ± 0.248
3.279ArgTyr: 3.279 ± 0.14
0.0ArgXaa: 0.0 ± 0.0
Ser
6.557SerAla: 6.557 ± 0.287
0.656SerCys: 0.656 ± 0.303
3.279SerAsp: 3.279 ± 0.14
4.262SerGlu: 4.262 ± 1.067
1.967SerPhe: 1.967 ± 0.743
7.213SerGly: 7.213 ± 1.623
0.656SerHis: 0.656 ± 0.303
5.902SerIle: 5.902 ± 0.066
1.311SerLys: 1.311 ± 0.057
7.213SerLeu: 7.213 ± 1.073
1.311SerMet: 1.311 ± 0.057
2.623SerAsn: 2.623 ± 0.44
3.934SerPro: 3.934 ± 1.269
1.967SerGln: 1.967 ± 0.429
3.279SerArg: 3.279 ± 1.238
4.262SerSer: 4.262 ± 0.168
7.541SerThr: 7.541 ± 0.223
6.23SerVal: 6.23 ± 0.342
0.656SerTrp: 0.656 ± 0.303
3.279SerTyr: 3.279 ± 0.966
0.0SerXaa: 0.0 ± 0.0
Thr
5.246ThrAla: 5.246 ± 1.875
0.656ThrCys: 0.656 ± 0.248
2.295ThrAsp: 2.295 ± 0.719
5.246ThrGlu: 5.246 ± 0.332
0.0ThrPhe: 0.0 ± 0.0
2.623ThrGly: 2.623 ± 0.663
0.656ThrHis: 0.656 ± 0.248
6.557ThrIle: 6.557 ± 0.281
2.623ThrLys: 2.623 ± 0.115
6.557ThrLeu: 6.557 ± 0.281
1.311ThrMet: 1.311 ± 0.057
5.246ThrAsn: 5.246 ± 0.229
2.623ThrPro: 2.623 ± 0.115
0.656ThrGln: 0.656 ± 0.248
1.967ThrArg: 1.967 ± 0.743
5.574ThrSer: 5.574 ± 0.148
3.934ThrThr: 3.934 ± 0.719
5.902ThrVal: 5.902 ± 0.579
0.656ThrTrp: 0.656 ± 0.303
1.967ThrTyr: 1.967 ± 0.91
0.0ThrXaa: 0.0 ± 0.0
Val
3.934ValAla: 3.934 ± 0.935
1.311ValCys: 1.311 ± 0.495
7.869ValAsp: 7.869 ± 0.231
3.934ValGlu: 3.934 ± 0.652
2.295ValPhe: 2.295 ± 0.232
4.262ValGly: 4.262 ± 0.168
3.934ValHis: 3.934 ± 0.935
5.246ValIle: 5.246 ± 0.229
4.59ValLys: 4.59 ± 0.093
8.525ValLeu: 8.525 ± 1.191
0.656ValMet: 0.656 ± 0.303
5.902ValAsn: 5.902 ± 0.066
3.279ValPro: 3.279 ± 1.238
1.311ValGln: 1.311 ± 0.495
4.59ValArg: 4.59 ± 0.633
7.213ValSer: 7.213 ± 0.081
4.59ValThr: 4.59 ± 0.473
3.934ValVal: 3.934 ± 1.269
0.656ValTrp: 0.656 ± 0.248
5.246ValTyr: 5.246 ± 0.88
0.0ValXaa: 0.0 ± 0.0
Trp
0.656TrpAla: 0.656 ± 0.303
0.656TrpCys: 0.656 ± 0.248
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.656TrpPhe: 0.656 ± 0.248
0.656TrpGly: 0.656 ± 0.248
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.656TrpLys: 0.656 ± 0.248
2.623TrpLeu: 2.623 ± 0.115
0.656TrpMet: 0.656 ± 0.303
0.656TrpAsn: 0.656 ± 0.248
0.0TrpPro: 0.0 ± 0.0
0.656TrpGln: 0.656 ± 0.248
1.311TrpArg: 1.311 ± 0.495
0.656TrpSer: 0.656 ± 0.248
0.656TrpThr: 0.656 ± 0.248
1.967TrpVal: 1.967 ± 0.193
0.0TrpTrp: 0.0 ± 0.0
0.656TrpTyr: 0.656 ± 0.248
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.279TyrAla: 3.279 ± 0.688
1.311TyrCys: 1.311 ± 0.057
3.934TyrAsp: 3.934 ± 0.172
2.623TyrGlu: 2.623 ± 0.44
1.311TyrPhe: 1.311 ± 0.057
2.623TyrGly: 2.623 ± 0.115
0.656TyrHis: 0.656 ± 0.248
1.967TyrIle: 1.967 ± 0.193
3.279TyrLys: 3.279 ± 0.966
4.59TyrLeu: 4.59 ± 0.473
0.656TyrMet: 0.656 ± 0.248
0.656TyrAsn: 0.656 ± 0.248
2.623TyrPro: 2.623 ± 0.115
1.311TyrGln: 1.311 ± 0.606
3.934TyrArg: 3.934 ± 0.719
2.623TyrSer: 2.623 ± 0.663
2.623TyrThr: 2.623 ± 0.115
5.246TyrVal: 5.246 ± 0.88
1.311TyrTrp: 1.311 ± 0.495
1.967TyrTyr: 1.967 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3051 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski