Amino acid dipepetide frequency for Triatoma virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.132AlaAla: 4.132 ± 1.46
0.376AlaCys: 0.376 ± 0.197
3.381AlaAsp: 3.381 ± 2.459
1.503AlaGlu: 1.503 ± 0.184
4.132AlaPhe: 4.132 ± 0.354
3.381AlaGly: 3.381 ± 1.25
1.503AlaHis: 1.503 ± 1.026
3.005AlaIle: 3.005 ± 0.367
1.503AlaLys: 1.503 ± 0.788
6.011AlaLeu: 6.011 ± 0.475
1.878AlaMet: 1.878 ± 0.224
1.127AlaAsn: 1.127 ± 0.618
3.757AlaPro: 3.757 ± 0.448
1.878AlaGln: 1.878 ± 0.224
3.005AlaArg: 3.005 ± 0.972
2.63AlaSer: 2.63 ± 1.039
2.254AlaThr: 2.254 ± 1.236
4.884AlaVal: 4.884 ± 0.462
0.751AlaTrp: 0.751 ± 0.394
1.878AlaTyr: 1.878 ± 0.224
0.0AlaXaa: 0.0 ± 0.0
Cys
0.751CysAla: 0.751 ± 0.211
0.376CysCys: 0.376 ± 0.197
1.503CysAsp: 1.503 ± 0.788
1.878CysGlu: 1.878 ± 0.985
0.751CysPhe: 0.751 ± 0.394
1.503CysGly: 1.503 ± 0.184
0.0CysHis: 0.0 ± 0.0
1.127CysIle: 1.127 ± 0.014
0.751CysLys: 0.751 ± 0.394
1.878CysLeu: 1.878 ± 0.985
0.0CysMet: 0.0 ± 0.0
0.376CysAsn: 0.376 ± 0.197
0.376CysPro: 0.376 ± 0.197
0.376CysGln: 0.376 ± 0.197
1.127CysArg: 1.127 ± 0.591
3.381CysSer: 3.381 ± 1.169
0.751CysThr: 0.751 ± 0.394
1.878CysVal: 1.878 ± 0.381
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 0.394
0.0CysXaa: 0.0 ± 0.0
Asp
1.878AspAla: 1.878 ± 1.433
1.127AspCys: 1.127 ± 0.591
4.132AspAsp: 4.132 ± 0.354
4.884AspGlu: 4.884 ± 0.143
3.381AspPhe: 3.381 ± 1.25
3.005AspGly: 3.005 ± 0.238
0.751AspHis: 0.751 ± 0.394
4.132AspIle: 4.132 ± 1.46
1.878AspLys: 1.878 ± 0.381
7.513AspLeu: 7.513 ± 2.731
0.376AspMet: 0.376 ± 0.197
2.63AspAsn: 2.63 ± 1.039
1.878AspPro: 1.878 ± 0.381
2.63AspGln: 2.63 ± 1.644
1.127AspArg: 1.127 ± 0.591
3.381AspSer: 3.381 ± 0.041
3.757AspThr: 3.757 ± 1.366
3.757AspVal: 3.757 ± 1.053
1.503AspTrp: 1.503 ± 0.788
1.503AspTyr: 1.503 ± 0.184
0.0AspXaa: 0.0 ± 0.0
Glu
1.503GluAla: 1.503 ± 0.184
0.376GluCys: 0.376 ± 0.408
2.63GluAsp: 2.63 ± 0.775
1.503GluGlu: 1.503 ± 0.788
5.259GluPhe: 5.259 ± 2.154
1.127GluGly: 1.127 ± 0.014
0.0GluHis: 0.0 ± 0.0
4.508GluIle: 4.508 ± 1.155
4.884GluLys: 4.884 ± 0.748
4.508GluLeu: 4.508 ± 1.155
1.503GluMet: 1.503 ± 0.788
1.878GluAsn: 1.878 ± 0.224
1.503GluPro: 1.503 ± 0.184
3.005GluGln: 3.005 ± 0.842
3.757GluArg: 3.757 ± 0.156
6.386GluSer: 6.386 ± 2.14
1.878GluThr: 1.878 ± 0.381
6.762GluVal: 6.762 ± 0.686
0.376GluTrp: 0.376 ± 0.197
1.878GluTyr: 1.878 ± 0.224
0.0GluXaa: 0.0 ± 0.0
Phe
3.381PheAla: 3.381 ± 0.041
1.878PheCys: 1.878 ± 0.381
3.381PheAsp: 3.381 ± 0.564
3.005PheGlu: 3.005 ± 0.238
2.254PhePhe: 2.254 ± 0.632
2.63PheGly: 2.63 ± 0.775
1.127PheHis: 1.127 ± 0.618
2.63PheIle: 2.63 ± 0.17
4.132PheLys: 4.132 ± 0.354
4.508PheLeu: 4.508 ± 0.054
0.376PheMet: 0.376 ± 0.197
3.005PheAsn: 3.005 ± 0.367
3.005PhePro: 3.005 ± 0.238
1.878PheGln: 1.878 ± 0.381
1.878PheArg: 1.878 ± 0.829
6.762PheSer: 6.762 ± 0.081
3.005PheThr: 3.005 ± 2.051
6.011PheVal: 6.011 ± 0.129
0.751PheTrp: 0.751 ± 0.394
1.878PheTyr: 1.878 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
1.878GlyAla: 1.878 ± 0.224
0.751GlyCys: 0.751 ± 0.394
4.132GlyAsp: 4.132 ± 0.958
3.381GlyGlu: 3.381 ± 0.564
6.011GlyPhe: 6.011 ± 1.684
3.005GlyGly: 3.005 ± 0.238
1.503GlyHis: 1.503 ± 0.184
4.884GlyIle: 4.884 ± 0.748
4.132GlyLys: 4.132 ± 0.251
4.132GlyLeu: 4.132 ± 0.856
0.376GlyMet: 0.376 ± 0.408
1.878GlyAsn: 1.878 ± 0.829
1.503GlyPro: 1.503 ± 0.184
1.127GlyGln: 1.127 ± 0.014
1.878GlyArg: 1.878 ± 0.224
1.878GlySer: 1.878 ± 0.224
3.005GlyThr: 3.005 ± 0.238
4.884GlyVal: 4.884 ± 0.462
0.0GlyTrp: 0.0 ± 0.0
3.005GlyTyr: 3.005 ± 0.367
0.0GlyXaa: 0.0 ± 0.0
His
0.751HisAla: 0.751 ± 0.394
0.0HisCys: 0.0 ± 0.0
1.503HisAsp: 1.503 ± 0.184
0.751HisGlu: 0.751 ± 0.211
0.751HisPhe: 0.751 ± 0.211
0.376HisGly: 0.376 ± 0.197
0.0HisHis: 0.0 ± 0.0
1.878HisIle: 1.878 ± 0.381
0.751HisLys: 0.751 ± 0.394
0.0HisLeu: 0.0 ± 0.0
0.376HisMet: 0.376 ± 0.197
0.376HisAsn: 0.376 ± 0.197
1.127HisPro: 1.127 ± 1.223
0.0HisGln: 0.0 ± 0.0
0.751HisArg: 0.751 ± 0.211
2.254HisSer: 2.254 ± 0.632
0.0HisThr: 0.0 ± 0.0
2.63HisVal: 2.63 ± 0.435
0.376HisTrp: 0.376 ± 0.197
0.376HisTyr: 0.376 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
3.757IleAla: 3.757 ± 0.761
2.254IleCys: 2.254 ± 1.182
4.884IleAsp: 4.884 ± 1.352
3.381IleGlu: 3.381 ± 0.564
1.503IlePhe: 1.503 ± 0.184
4.132IleGly: 4.132 ± 1.46
1.127IleHis: 1.127 ± 0.591
2.63IleIle: 2.63 ± 1.039
3.381IleLys: 3.381 ± 0.041
6.011IleLeu: 6.011 ± 0.734
0.751IleMet: 0.751 ± 0.394
3.005IleAsn: 3.005 ± 0.972
3.757IlePro: 3.757 ± 0.448
1.127IleGln: 1.127 ± 0.014
3.757IleArg: 3.757 ± 0.448
7.513IleSer: 7.513 ± 1.522
4.884IleThr: 4.884 ± 1.066
3.757IleVal: 3.757 ± 0.448
0.376IleTrp: 0.376 ± 0.197
2.63IleTyr: 2.63 ± 0.17
0.0IleXaa: 0.0 ± 0.0
Lys
2.254LysAla: 2.254 ± 0.027
1.127LysCys: 1.127 ± 0.591
2.63LysAsp: 2.63 ± 1.379
3.757LysGlu: 3.757 ± 0.156
2.63LysPhe: 2.63 ± 0.17
1.503LysGly: 1.503 ± 0.421
0.376LysHis: 0.376 ± 0.197
4.132LysIle: 4.132 ± 0.856
2.63LysLys: 2.63 ± 1.379
3.381LysLeu: 3.381 ± 0.645
0.751LysMet: 0.751 ± 0.394
2.254LysAsn: 2.254 ± 1.182
2.63LysPro: 2.63 ± 0.17
2.254LysGln: 2.254 ± 0.027
3.005LysArg: 3.005 ± 0.972
4.884LysSer: 4.884 ± 1.957
3.757LysThr: 3.757 ± 0.761
4.884LysVal: 4.884 ± 0.748
0.376LysTrp: 0.376 ± 0.197
4.132LysTyr: 4.132 ± 0.856
0.0LysXaa: 0.0 ± 0.0
Leu
6.386LeuAla: 6.386 ± 0.326
2.63LeuCys: 2.63 ± 0.775
6.011LeuAsp: 6.011 ± 1.339
4.508LeuGlu: 4.508 ± 1.155
3.381LeuPhe: 3.381 ± 1.169
5.259LeuGly: 5.259 ± 0.265
1.503LeuHis: 1.503 ± 0.184
7.137LeuIle: 7.137 ± 1.93
6.386LeuLys: 6.386 ± 0.278
6.011LeuLeu: 6.011 ± 0.475
1.503LeuMet: 1.503 ± 0.421
4.508LeuAsn: 4.508 ± 1.155
6.386LeuPro: 6.386 ± 1.487
2.63LeuGln: 2.63 ± 0.775
3.757LeuArg: 3.757 ± 0.761
9.767LeuSer: 9.767 ± 1.495
6.386LeuThr: 6.386 ± 1.487
6.762LeuVal: 6.762 ± 1.29
2.63LeuTrp: 2.63 ± 0.435
2.63LeuTyr: 2.63 ± 0.775
0.0LeuXaa: 0.0 ± 0.0
Met
1.127MetAla: 1.127 ± 0.591
1.127MetCys: 1.127 ± 0.591
0.376MetAsp: 0.376 ± 0.408
0.376MetGlu: 0.376 ± 0.294
1.127MetPhe: 1.127 ± 0.591
1.127MetGly: 1.127 ± 0.591
0.376MetHis: 0.376 ± 0.197
0.376MetIle: 0.376 ± 0.408
1.878MetLys: 1.878 ± 0.381
2.254MetLeu: 2.254 ± 0.616
0.376MetMet: 0.376 ± 0.197
1.127MetAsn: 1.127 ± 0.014
1.127MetPro: 1.127 ± 0.014
0.376MetGln: 0.376 ± 0.408
0.751MetArg: 0.751 ± 0.394
1.503MetSer: 1.503 ± 0.421
0.751MetThr: 0.751 ± 0.394
0.0MetVal: 0.0 ± 0.0
0.376MetTrp: 0.376 ± 0.197
2.63MetTyr: 2.63 ± 0.435
0.0MetXaa: 0.0 ± 0.0
Asn
1.878AsnAla: 1.878 ± 0.829
0.376AsnCys: 0.376 ± 0.197
1.878AsnAsp: 1.878 ± 0.381
1.878AsnGlu: 1.878 ± 0.224
4.132AsnPhe: 4.132 ± 0.354
3.005AsnGly: 3.005 ± 0.972
0.751AsnHis: 0.751 ± 0.815
2.63AsnIle: 2.63 ± 0.435
1.127AsnLys: 1.127 ± 0.014
4.132AsnLeu: 4.132 ± 2.167
1.127AsnMet: 1.127 ± 0.618
1.503AsnAsn: 1.503 ± 0.421
3.757AsnPro: 3.757 ± 0.761
1.503AsnGln: 1.503 ± 0.184
2.254AsnArg: 2.254 ± 0.578
3.757AsnSer: 3.757 ± 0.448
3.005AsnThr: 3.005 ± 2.051
4.132AsnVal: 4.132 ± 0.856
0.376AsnTrp: 0.376 ± 0.197
1.878AsnTyr: 1.878 ± 0.829
0.0AsnXaa: 0.0 ± 0.0
Pro
1.503ProAla: 1.503 ± 0.421
0.751ProCys: 0.751 ± 0.394
1.878ProAsp: 1.878 ± 0.224
2.254ProGlu: 2.254 ± 1.182
4.508ProPhe: 4.508 ± 0.659
2.63ProGly: 2.63 ± 0.17
0.376ProHis: 0.376 ± 0.408
4.508ProIle: 4.508 ± 0.551
1.878ProLys: 1.878 ± 0.829
8.64ProLeu: 8.64 ± 1.514
1.127ProMet: 1.127 ± 0.014
1.878ProAsn: 1.878 ± 0.829
0.751ProPro: 0.751 ± 0.211
1.878ProGln: 1.878 ± 1.433
1.878ProArg: 1.878 ± 0.224
7.137ProSer: 7.137 ± 0.116
3.757ProThr: 3.757 ± 1.053
4.508ProVal: 4.508 ± 0.659
1.878ProTrp: 1.878 ± 0.381
2.254ProTyr: 2.254 ± 0.632
0.0ProXaa: 0.0 ± 0.0
Gln
1.878GlnAla: 1.878 ± 0.224
0.751GlnCys: 0.751 ± 0.211
1.127GlnAsp: 1.127 ± 1.223
3.005GlnGlu: 3.005 ± 0.367
1.503GlnPhe: 1.503 ± 0.421
2.254GlnGly: 2.254 ± 0.027
0.376GlnHis: 0.376 ± 0.197
2.254GlnIle: 2.254 ± 0.632
1.127GlnLys: 1.127 ± 0.591
3.381GlnLeu: 3.381 ± 0.041
1.127GlnMet: 1.127 ± 0.014
1.127GlnAsn: 1.127 ± 0.591
2.254GlnPro: 2.254 ± 1.236
1.127GlnGln: 1.127 ± 0.591
3.005GlnArg: 3.005 ± 0.367
2.63GlnSer: 2.63 ± 1.379
1.127GlnThr: 1.127 ± 0.618
1.878GlnVal: 1.878 ± 1.433
0.376GlnTrp: 0.376 ± 0.197
1.503GlnTyr: 1.503 ± 0.421
0.0GlnXaa: 0.0 ± 0.0
Arg
1.878ArgAla: 1.878 ± 0.985
0.751ArgCys: 0.751 ± 0.394
1.878ArgAsp: 1.878 ± 2.038
3.005ArgGlu: 3.005 ± 0.972
3.005ArgPhe: 3.005 ± 0.238
1.878ArgGly: 1.878 ± 0.224
0.0ArgHis: 0.0 ± 0.0
3.757ArgIle: 3.757 ± 1.366
1.878ArgLys: 1.878 ± 0.985
4.132ArgLeu: 4.132 ± 0.354
0.751ArgMet: 0.751 ± 0.394
3.757ArgAsn: 3.757 ± 0.448
4.132ArgPro: 4.132 ± 0.251
1.503ArgGln: 1.503 ± 0.788
1.503ArgArg: 1.503 ± 0.788
4.508ArgSer: 4.508 ± 0.054
1.503ArgThr: 1.503 ± 0.184
2.63ArgVal: 2.63 ± 0.435
0.751ArgTrp: 0.751 ± 0.394
2.63ArgTyr: 2.63 ± 0.775
0.0ArgXaa: 0.0 ± 0.0
Ser
6.762SerAla: 6.762 ± 3.709
1.878SerCys: 1.878 ± 0.985
5.259SerAsp: 5.259 ± 0.869
6.386SerGlu: 6.386 ± 2.14
4.884SerPhe: 4.884 ± 0.143
6.762SerGly: 6.762 ± 0.524
2.254SerHis: 2.254 ± 1.182
5.259SerIle: 5.259 ± 0.34
5.259SerLys: 5.259 ± 1.549
11.645SerLeu: 11.645 ± 0.666
1.503SerMet: 1.503 ± 0.788
2.63SerAsn: 2.63 ± 1.644
5.259SerPro: 5.259 ± 0.265
5.635SerGln: 5.635 ± 0.537
4.508SerArg: 4.508 ± 0.551
8.64SerSer: 8.64 ± 2.119
6.011SerThr: 6.011 ± 0.475
2.63SerVal: 2.63 ± 0.17
0.751SerTrp: 0.751 ± 0.211
2.254SerTyr: 2.254 ± 0.578
0.0SerXaa: 0.0 ± 0.0
Thr
2.254ThrAla: 2.254 ± 0.027
0.376ThrCys: 0.376 ± 0.197
2.63ThrAsp: 2.63 ± 0.17
4.132ThrGlu: 4.132 ± 0.461
2.254ThrPhe: 2.254 ± 1.841
4.132ThrGly: 4.132 ± 1.46
1.503ThrHis: 1.503 ± 0.184
3.757ThrIle: 3.757 ± 0.156
2.63ThrLys: 2.63 ± 0.17
5.635ThrLeu: 5.635 ± 0.897
1.878ThrMet: 1.878 ± 0.381
3.381ThrAsn: 3.381 ± 0.564
4.132ThrPro: 4.132 ± 2.669
1.127ThrGln: 1.127 ± 0.591
2.63ThrArg: 2.63 ± 0.775
6.386ThrSer: 6.386 ± 3.301
3.757ThrThr: 3.757 ± 1.053
3.381ThrVal: 3.381 ± 0.645
0.376ThrTrp: 0.376 ± 0.197
0.376ThrTyr: 0.376 ± 0.197
0.0ThrXaa: 0.0 ± 0.0
Val
4.884ValAla: 4.884 ± 1.066
1.503ValCys: 1.503 ± 0.184
3.005ValAsp: 3.005 ± 0.842
2.63ValGlu: 2.63 ± 0.435
3.757ValPhe: 3.757 ± 0.156
3.757ValGly: 3.757 ± 0.448
0.751ValHis: 0.751 ± 0.815
4.132ValIle: 4.132 ± 0.958
2.254ValLys: 2.254 ± 0.632
7.513ValLeu: 7.513 ± 0.313
2.254ValMet: 2.254 ± 0.027
3.757ValAsn: 3.757 ± 1.657
6.011ValPro: 6.011 ± 0.475
3.005ValGln: 3.005 ± 2.051
3.757ValArg: 3.757 ± 0.761
8.264ValSer: 8.264 ± 1.107
3.757ValThr: 3.757 ± 0.448
4.508ValVal: 4.508 ± 2.472
0.751ValTrp: 0.751 ± 0.211
2.63ValTyr: 2.63 ± 1.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.211
0.376TrpCys: 0.376 ± 0.197
0.751TrpAsp: 0.751 ± 0.394
1.127TrpGlu: 1.127 ± 0.591
0.376TrpPhe: 0.376 ± 0.197
0.376TrpGly: 0.376 ± 0.197
0.376TrpHis: 0.376 ± 0.197
0.751TrpIle: 0.751 ± 0.394
1.878TrpLys: 1.878 ± 0.985
0.751TrpLeu: 0.751 ± 0.211
0.0TrpMet: 0.0 ± 0.0
2.254TrpAsn: 2.254 ± 0.578
0.376TrpPro: 0.376 ± 0.197
0.0TrpGln: 0.0 ± 0.0
0.751TrpArg: 0.751 ± 0.815
1.127TrpSer: 1.127 ± 0.591
0.751TrpThr: 0.751 ± 0.815
0.376TrpVal: 0.376 ± 0.197
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.381TyrAla: 3.381 ± 1.25
0.751TyrCys: 0.751 ± 0.394
2.254TyrAsp: 2.254 ± 0.632
1.878TyrGlu: 1.878 ± 0.224
1.878TyrPhe: 1.878 ± 0.224
1.503TyrGly: 1.503 ± 0.184
0.376TyrHis: 0.376 ± 0.408
1.127TyrIle: 1.127 ± 0.014
3.005TyrLys: 3.005 ± 0.367
3.757TyrLeu: 3.757 ± 1.366
1.503TyrMet: 1.503 ± 0.788
2.63TyrAsn: 2.63 ± 0.17
2.254TyrPro: 2.254 ± 0.578
0.751TyrGln: 0.751 ± 0.394
0.751TyrArg: 0.751 ± 0.815
3.381TyrSer: 3.381 ± 0.041
2.63TyrThr: 2.63 ± 0.17
2.63TyrVal: 2.63 ± 1.644
0.376TyrTrp: 0.376 ± 0.408
1.503TyrTyr: 1.503 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski