#SQL SERVER Online Test Questions For Freshers
Explore tagged Tumblr posts
Link
#SQL SERVER Online Test#SQL SERVER Online Test Question#SQL SERVER questions#SQL SERVER#SQL SERVER Questions For Online Test#Best SQL SERVER Online Test Questions#SQL SERVER Online Test Questions For Freshers#SQL SERVER Online Test questions for experienced#most common SQL SERVER Online Test questions#Top 20 SQL SERVER Online Test questions#Top 50 SQL SERVER Online Test questions#sql online coding test#sql interview test#sql test questions and answers#sql practice questions
0 notes
Text
300+ TOP IBM TIVOLI Interview Questions and Answers
IBM Tivoli Interview Questions for freshers experienced :-
1. The Run as Administrator option is used to bypass user account control specifically on which operating system? Windows 2008 2. Which service access point is created on the target computer during the Tivoli Common Agent installation? Agent-Server (IPv4 / CommonAgent) 3. Which statement is true about static groups? Members can be added anytime after group creation. 4. Which supported database can be used by IBM Tivoli Provisioning Manager V7.2 on a Windows server? IBM DB2 5. What are the two main catalogs managed by the image library? 1. Saved Images 2. Master Images 6. What are your daily activities? TSM Server DB and Log size monitoring, client and admin schedule monitoring and library media management. I also have to install and configure TSM BA clients whenever required and troubleshoot backup/restore issues if any. 7. What are first * and second * in the “Query Event * *”? First * is Domain name, the second * is schedule name 8. How to check admin schedules status? Query Event * t=a 9. Where/How will you see, if errors occurred in the tapes? We can check it in actlog for hardware/media & read-write errors by using the command – “q actlog se=errors” To determine read/write errors for a particular volume run the command “q volume f=d“. This output will show the number of read/write errors occurred to that particular volume. 10. What is your DB size in your environment? Approximately 60 GB. Tip: size depends on your environment
IBM Tivoli Interview Questions 11. What is your tape model? HP LTO4 tape – capacity 800 GB, speed is 120 MB/Sec. 12. Have you done the DR test? Yes, In our environment we should do DR drills for every 6 months to ensure we are ready for any disaster recovery. 13. Have you done a library configuration? Yes, I did it once. 14. What is your library model? IBM 3584 library model with 4 drives 15. Can you explain how to install and configure the tdpsql? STEPS TO INSTALL AND CONFIG TDPSQL 16. What is the use of a maximum number of mount points parameter? Maximum parameter is used to increase/decrease the client backup/restore sessions. This parameter value should not be greater than available tape drives. 17. What are the TSM server tuning parameters? DBMEMPERCENT EXPINTERVAL MAX SESSIONS MOVEBATCHSIZE and MOVESIZETHRESH RESTOREINTERVAL TCPNODELAY TCPWINDOWSIZE TXNGROUPMAX 18. What is collocation? Where will you update it? Collocation is the process of keeping the client backed up data in a minimum number of tapes. Collocation can be done by nodes, node groups and by Filespace. We will update collocation while defining or updating storage pool definitions. update stgpool colloc=node 19. What is cloptset? Where will you update it? Cloptset is used to define a name for a set of options which you can assign to clients for archive, backup, restore, and retrieve operations. We should first define a clioptset name by using “define cloptset” command and then define each client options to that cloptset by using “define clientopt” command. Then you should use this cloptset name while defining any client node. Generally, cloptset is used to assign extra client options for any particular node from the TSM Server end without logging into the client machine. 20. Have you performed TSM DB restoration? Yes, 21. How will you see the errors in the tape? To determine read/write errors for a particular volume run “q volume f=d“. This output will show the number of read/write errors occurred to that particular volume. 22. What will you do if u find errors in the tapes? First, we have to figure it out the source of the error, whether tape drive has caused the error or tape itself has some problems. If the tape drive is the culprit make sure no other tapes are mounted on that particular tape. If tape itself has problems, then audit that tape by using “audit volume fix=no” command. This will show if there are any damaged files. IF there are no damaged files move the data from that tape to another tape, if there are any damaged files then we should restore the contents of that tape from the copy storage pool. 23. Have you done a manual backup? Yes, We can do it by the BA client command line and also through the BA client GUI. 24. How will you login into the client machine do you have any credentials for you separately? Yes, I have client login credentials with limited privileges. 25. How many servers in your environment? 3 TSM Servers 26. What is the TSM library manager? TSM LIBRARY MANAGER 27. On which TSM Server you will define tape drives in a shared tape library environment? We have to define all the drives on both the Library Manager and Library clients. CONFIGURATION OF TSM LIBRARY MANAGER 28. How will you see the lanfree backup process in the server side? By running “storageagentname:q session” Querying act log for “ANR0415I” message which says that “session 101 was proxies by storage agent S1 for node N1” You can also check the backup report for total lanfree bytes transferred. 29. How will you clean the drive? Generally, tape library will have separate cleaning drives, we have to just mount that drive through Library GUI. OR From TSM Server we can run “clean drive” command, this will also mount the cleaning cartridge/tape onto the drive and starts cleaning. 30. How to label the volume? If new tapes are kept in I/O exit port, run this below command label libvol search=bulk labelsource=barcode checkin=scratch TROUBLESHOOTING TAPE LIBRARY 31. What are the parameters in the copy group? a version exists, version deleted, retention extra, retention only, serialization, destination, mode, frequency …… etc. 31. What is the use of reclamation? Reclamation makes the fragmented space on volumes usable again by moving any remaining files from one volume to another volume. This action makes the original volume available for reuse. 32. Is there any chance of moving inactive data from one volume to another using reclamation? No, reclamation will move both active and inactive data. 33. What is audit volume? Audit volume command is used to check the inconsistencies between database information and a storage pool volume. 34. What is fix=yes/no in audit volume command? Fix — Specifies how the server resolves inconsistencies between the database inventory and the specified storage pool volume. Fix=No — Tivoli Storage Manager reports, but does not delete, database records. Fix=Yes — The server fixes any inconsistencies as they are detected 35. What is migration? use of migration? Migration is used to move client files from one storage pool to the next storage pool in the storage hierarchy. The migration will make some empty space in disk pools which can be used for taking a backup of the client machine. 36. Have you done up gradation? which method you have used? Yes, we used the Same System Media Method 37. Have you done upgradation for client machine? Yes, First we have to copy all the important client configuration files, disconnect the connection with TSM Server and then uninstall/install new client version. 38. Have you done the installation of TSM server? tell me the procedure? TSM SERVER 6.3 INSTALLATION STEPS 39. Explain the log flow in the 6.3? RECOVERY LOG IN TSM VERSION 6.30 40. What are the details available in the dsmsched.log and dsmerror.log? dsmsched.log & dsmerror.log will contain the client schedule status details and any schedule error details. 41. Will the lanfree data go to the disk pool? No, Lanfree data will go directly to tape/tape pools. If we have a disk library in a SAN, then we can send lanfree to disks but we need another supporting software. 42. How to see the DB size? Query db f=d 43. How to check the paths? query path f=d 44. How to update path? update path source name destination name srctype=server desttype=drive/library device=devicename online=yes 45. What is the difference between search=yes and search=bulk in LABEL command? YES – Specifies that the server labels only volumes that are stored in the library BULK – Specifies that the server searches the library entry/exit ports for usable volumes to label 46. In tdpsql what does tdpsql.cfg file contains? It contains all the tdpsql performance monitoring parameters like Buffers, Buffers size, SQL buffers, SQL buffer size, data stripes, and log file, etc. 47. How to install tdpsql in the client machine? INSTALL AND CONFIGURATION OF IBM TDPSQL 48. Any idea on TSM Offsite Reclamation? OFFSITE RECLAMATION PROCESS 49. What is the use of storage pool backup (copy stgpools)? If there are any read/write errors on the primary pool volumes, we can restore them from copy pool volumes. 50. How to do Lanfree configuration? STEPS TO INSTALL AND CONFIGURE TSM STORAGE AGENT 51. Explain about policy management? TSM POLICY MANAGEMENT BASICS & POLICY STRUCTURE 52. What are the parameters in the copy group? a version exists, version deleted, retention extra, retention only, serialization, destination, mode, frequency …… etc 53. What is Serialization? It specifies how Tivoli Storage Manager processes files or directories when they are modified during backup processing. IBM Tivoli Questions and Answers Pdf Download Read the full article
0 notes
Text
50% off #Fundamentals of Business Intelligence & Data Analytics World – $10
Learn how SQL Server, SSIS, SSAS, SSRS, DB2, Teradata, Informatica, Data Stage, Tibco, Cognos etc fit in BI architecture
Beginner Level, – Video: 1 hour Other: 0 mins, 12 lectures
Average rating 4.8/5 (4.8)
Course requirements:
Students need to know absolutely nothing about BI or Analytics as the course assumes that the students are completely non-technical
Course description:
This is an introductory course for freshers / beginners / non-technical folks who want to take a sneak peek into the working of a Business Intelligence project and understand different career options and associated responsibilities for the same. This course is about how a BI project is executed, what are the different layers of a BI architecture, and which career track one can choose to work on based on your areas of interest in a BI project. This course is not meant to be a exhaustive reference of each and every tool and technology available in the Business Intelligence field.
This is a fully diagram guided course and focused on BI architecture majorly. This course also lists some industry leading tools and technologies from different vendors that fit into different areas of a BI project.
The course has three major sections – BI Project Lifecycle, BI Architecture and BI Career Tracks. The course does not come with any practice tests or do-it-yourself kind of exercises. But if you have a career or technical question related to this course, I would be able to help the students with the same.
Full details Learn about typical BI Project Lifecycle Learn about end-to-end BI Architecture Learn about different career tracks in a BI Architecture Learn about different tools / technologies available in the industry that are used to specific layers of BI Architecture
Full details Students who are non-technical and curious about how to start a career in Business Intelligence and Analytics should take this course Students who are taking their starting steps out of college in the field of Business Intelligence and Analytics should take this course Students who are from a different technical domain and wants to hear a 14-year experienced architect’s advise to make a career in BI and Analytics should take this course
Full details
Reviews:
“Very Nice course” (Nilesh Bhandarwar)
“This course was great for gaining a broad higher level understanding of BI Architecture. Thank you.” (Carmen Rodriguez)
“Its quick and concise, but you get the big picture and that’s what counts!” (Nayyar Khan)
About Instructor:
Siddharth Mehta
My name is Siddharth Mehta. I work in the role of a Senior Technology Architect with a multinational IT firm. I have career experience of more than 14 years in the IT industry working with multinational IT companies and Fortune 500 clients. Industry Experience -|- International experience of working across geographies (US, UK, Singapore) for multi-national clients in Banking, Logistics, Government, Media Entertainment, Products, Life Sciences and other domains -|- I usually work as a lead architect for a portfolio of applications in web, mobile, BI, Analytics, Data warehousing, Reporting, Collaboration, CMS, NoSQL and other categories. -|- In my present role, I remain responsible for Arch Design, Tech Stack selection, Infra design, 3rd party products evaluation and procurement, and Performance engineering. My primary specialty is Microsoft SQL Server and Microsoft Business Intelligence. I usually work with MS BI integration with cross technologies like Elasticsearch / Lucene, MongoDB, Sharepoint 2013 and 2010, jQuery based framework like Highcharts and GoJS, jQueryMobile, Bootstrap, iOS xCode framework, and many others. Authoring Achievements -|- I have authored more than 200 articles and tutorials collectively for online publications like mssqltips, sqlserverperformance, sqlservercental, techrepublic and others. -|- I have authored / co-authored as well as reviewed books for print-media publication houses like Manning publication, Packt publication. -|- I have also authored a whitepaper for MSDN Library titled: Bulk report generation using SSIS and SSRS 2008 R2. Certifications -|- TOGAF 9 Part I & II certified architect -|- CITA-F certified architect -|- HCAHD (Hortonworks Hadoop certified) -|- MCSD.Net, and MCTS in SQL Server maintenance as well as Developing BI apps using SQL Server -|- Certified Sr. Technology Architect as well as Digital Architect Academics Master of Science with major in Computer Applications and Information Technology
Instructor Other Courses:
SQL Server Master Data Services for Master Data Management SQL Server Analysis Services – SSAS, Data Mining & Analytics …………………………………………………………… Siddharth Mehta coupons Development course coupon Udemy Development course coupon Databases course coupon Udemy Databases course coupon Fundamentals of Business Intelligence & Data Analytics World Fundamentals of Business Intelligence & Data Analytics World course coupon Fundamentals of Business Intelligence & Data Analytics World coupon coupons
The post 50% off #Fundamentals of Business Intelligence & Data Analytics World – $10 appeared first on Udemy Cupón.
from http://www.xpresslearn.com/udemy/coupon/50-off-fundamentals-of-business-intelligence-data-analytics-world-10/
0 notes
Text
50% off #Fundamentals of Business Intelligence & Data Analytics World – $10
Learn how SQL Server, SSIS, SSAS, SSRS, DB2, Teradata, Informatica, Data Stage, Tibco, Cognos etc fit in BI architecture
Beginner Level, – Video: 1 hour Other: 0 mins, 12 lectures
Average rating 4.8/5 (4.8)
Course requirements:
Students need to know absolutely nothing about BI or Analytics as the course assumes that the students are completely non-technical
Course description:
This is an introductory course for freshers / beginners / non-technical folks who want to take a sneak peek into the working of a Business Intelligence project and understand different career options and associated responsibilities for the same. This course is about how a BI project is executed, what are the different layers of a BI architecture, and which career track one can choose to work on based on your areas of interest in a BI project. This course is not meant to be a exhaustive reference of each and every tool and technology available in the Business Intelligence field.
This is a fully diagram guided course and focused on BI architecture majorly. This course also lists some industry leading tools and technologies from different vendors that fit into different areas of a BI project.
The course has three major sections – BI Project Lifecycle, BI Architecture and BI Career Tracks. The course does not come with any practice tests or do-it-yourself kind of exercises. But if you have a career or technical question related to this course, I would be able to help the students with the same.
Full details Learn about typical BI Project Lifecycle Learn about end-to-end BI Architecture Learn about different career tracks in a BI Architecture Learn about different tools / technologies available in the industry that are used to specific layers of BI Architecture
Full details Students who are non-technical and curious about how to start a career in Business Intelligence and Analytics should take this course Students who are taking their starting steps out of college in the field of Business Intelligence and Analytics should take this course Students who are from a different technical domain and wants to hear a 14-year experienced architect’s advise to make a career in BI and Analytics should take this course
Full details
Reviews:
“Very Nice course” (Nilesh Bhandarwar)
“This course was great for gaining a broad higher level understanding of BI Architecture. Thank you.” (Carmen Rodriguez)
“Its quick and concise, but you get the big picture and that’s what counts!” (Nayyar Khan)
About Instructor:
Siddharth Mehta
My name is Siddharth Mehta. I work in the role of a Senior Technology Architect with a multinational IT firm. I have career experience of more than 14 years in the IT industry working with multinational IT companies and Fortune 500 clients. Industry Experience -|- International experience of working across geographies (US, UK, Singapore) for multi-national clients in Banking, Logistics, Government, Media Entertainment, Products, Life Sciences and other domains -|- I usually work as a lead architect for a portfolio of applications in web, mobile, BI, Analytics, Data warehousing, Reporting, Collaboration, CMS, NoSQL and other categories. -|- In my present role, I remain responsible for Arch Design, Tech Stack selection, Infra design, 3rd party products evaluation and procurement, and Performance engineering. My primary specialty is Microsoft SQL Server and Microsoft Business Intelligence. I usually work with MS BI integration with cross technologies like Elasticsearch / Lucene, MongoDB, Sharepoint 2013 and 2010, jQuery based framework like Highcharts and GoJS, jQueryMobile, Bootstrap, iOS xCode framework, and many others. Authoring Achievements -|- I have authored more than 200 articles and tutorials collectively for online publications like mssqltips, sqlserverperformance, sqlservercental, techrepublic and others. -|- I have authored / co-authored as well as reviewed books for print-media publication houses like Manning publication, Packt publication. -|- I have also authored a whitepaper for MSDN Library titled: Bulk report generation using SSIS and SSRS 2008 R2. Certifications -|- TOGAF 9 Part I & II certified architect -|- CITA-F certified architect -|- HCAHD (Hortonworks Hadoop certified) -|- MCSD.Net, and MCTS in SQL Server maintenance as well as Developing BI apps using SQL Server -|- Certified Sr. Technology Architect as well as Digital Architect Academics Master of Science with major in Computer Applications and Information Technology
Instructor Other Courses:
SQL Server Master Data Services for Master Data Management SQL Server Analysis Services – SSAS, Data Mining & Analytics …………………………………………………………… Siddharth Mehta coupons Development course coupon Udemy Development course coupon Databases course coupon Udemy Databases course coupon Fundamentals of Business Intelligence & Data Analytics World Fundamentals of Business Intelligence & Data Analytics World course coupon Fundamentals of Business Intelligence & Data Analytics World coupon coupons
The post 50% off #Fundamentals of Business Intelligence & Data Analytics World – $10 appeared first on Udemy Cupón.
from Udemy Cupón http://www.xpresslearn.com/udemy/coupon/50-off-fundamentals-of-business-intelligence-data-analytics-world-10/
from https://xpresslearn.wordpress.com/2017/02/04/50-off-fundamentals-of-business-intelligence-data-analytics-world-10/
0 notes
Text
50% off #Fundamentals of Business Intelligence & Data Analytics World – $10
Learn how SQL Server, SSIS, SSAS, SSRS, DB2, Teradata, Informatica, Data Stage, Tibco, Cognos etc fit in BI architecture
Beginner Level, – 1 hour, 12 lectures
Average rating 4.4/5 (4.4 (28 ratings) Instead of using a simple lifetime average, Udemy calculates a course’s star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.)
Course requirements:
Students need to know absolutely nothing about BI or Analytics as the course assumes that the students are completely non-technical
Course description:
This is an introductory course for freshers / beginners / non-technical folks who want to take a sneak peek into the working of a Business Intelligence project and understand different career options and associated responsibilities for the same. This course is about how a BI project is executed, what are the different layers of a BI architecture, and which career track one can choose to work on based on your areas of interest in a BI project. This course is not meant to be a exhaustive reference of each and every tool and technology available in the Business Intelligence field.
This is a fully diagram guided course and focused on BI architecture majorly. This course also lists some industry leading tools and technologies from different vendors that fit into different areas of a BI project.
The course has three major sections – BI Project Lifecycle, BI Architecture and BI Career Tracks. The course does not come with any practice tests or do-it-yourself kind of exercises. But if you have a career or technical question related to this course, I would be able to help the students with the same.
Full details Learn about typical BI Project Lifecycle Learn about end-to-end BI Architecture Learn about different career tracks in a BI Architecture Learn about different tools / technologies available in the industry that are used to specific layers of BI Architecture
Full details Students who are non-technical and curious about how to start a career in Business Intelligence and Analytics should take this course Students who are taking their starting steps out of college in the field of Business Intelligence and Analytics should take this course Students who are from a different technical domain and wants to hear a 14-year experienced architect’s advise to make a career in BI and Analytics should take this course
Full details
Reviews:
“The course is just too basic.” (Satya Kolachina)
“Author has been very clear about the scope of this course. It was very useful to me to know the BI architecture, different roles and skills required for the same.” (Ramesh Bupathy)
“A knowledgable teacher. I like the big picture, simple illustrations and the speaking style. A small point is that I found myself bumping the speed up to 1.25x and liking that a bit better but that is a preference.” (Michael Scaman)
About Instructor:
Siddharth Mehta
My name is Siddharth Mehta. I work in the role of a Senior Technology Architect with a multinational IT firm. I have career experience of more than 14 years in the IT industry working with multinational IT companies and Fortune 500 clients. Industry Experience -|- International experience of working across geographies (US, UK, Singapore) for multi-national clients in Banking, Logistics, Government, Media Entertainment, Products, Life Sciences and other domains -|- I usually work as a lead architect for a portfolio of applications in web, mobile, BI, Analytics, Data warehousing, Reporting, Collaboration, CMS, NoSQL and other categories. -|- In my present role, I remain responsible for Arch Design, Tech Stack selection, Infra design, 3rd party products evaluation and procurement, and Performance engineering. My primary specialty is Microsoft SQL Server and Microsoft Business Intelligence. I usually work with MS BI integration with cross technologies like Elasticsearch / Lucene, MongoDB, Sharepoint 2013 and 2010, jQuery based framework like Highcharts and GoJS, jQueryMobile, Bootstrap, iOS xCode framework, and many others. Authoring Achievements -|- I have authored more than 200 articles and tutorials collectively for online publications like mssqltips, sqlserverperformance, sqlservercental, techrepublic and others. -|- I have authored / co-authored as well as reviewed books for print-media publication houses like Manning publication, Packt publication. -|- I have also authored a whitepaper for MSDN Library titled: Bulk report generation using SSIS and SSRS 2008 R2. Certifications -|- TOGAF 9 Part I & II certified architect -|- CITA-F certified architect -|- HCAHD (Hortonworks Hadoop certified) -|- MCSD.Net, and MCTS in SQL Server maintenance as well as Developing BI apps using SQL Server -|- Certified Sr. Technology Architect as well as Digital Architect Academics Master of Science with major in Computer Applications and Information Technology
Instructor Other Courses:
SQL Server Master Data Services for Master Data Management Siddharth Mehta, Published Author, Microsoft MVP Alumni, Sr. Tech Arch (5) $10 $100 SQL Server Analysis Services – SSAS, Data Mining & Analytics …………………………………………………………… Siddharth Mehta coupons Development course coupon Udemy Development course coupon Databases course coupon Udemy Databases course coupon Fundamentals of Business Intelligence & Data Analytics World Fundamentals of Business Intelligence & Data Analytics World course coupon Fundamentals of Business Intelligence & Data Analytics World coupon coupons
The post 50% off #Fundamentals of Business Intelligence & Data Analytics World – $10 appeared first on Udemy Cupón/ Udemy Coupon/.
from Udemy Cupón/ Udemy Coupon/ http://coursetag.com/udemy/coupon/50-off-fundamentals-of-business-intelligence-data-analytics-world-10/ from Course Tag https://coursetagcom.tumblr.com/post/156798412228
0 notes
Text
300+ TOP COGNOS Interview Questions and Answers
COGNOS Interview Questions for freshers experienced :-
1. What Is Model And Say About Process How To Create Model And How To Test Model? Model in PowerPlay Transformer is combination of five windows- Dimension Map,Data Source, Measures,Power cube,Sign On. Models specify the data sources and then defining the dimensions, levels,and measures.They may also contain dimension views, user classes, user class views, and other authentication-related information. The information for a model is stored in a model file, with the extension .pyi (binary, means platform depentent) or .mdl (text,platform independent). 2. How To Create Powerplay Reports? To create a powerplay report You have to create a IQD file from impromptu by saving IMR report as IQD. Use Frame Work manager model and externalise the query subject you want to use inpowerplay transformer and powerplay. When you have the IQD file published to a location such as UNC. Go to powerplay transformer model and select new. 3. What Are Products Of Cognos? Cognos 6.6 7.0,7.3(PowerPlay, Impromptu)--ReportNet1.0,1.1mr1, 1.1mr22---ReportNet 8.0(latest)IWR is use by Impromptu to publish reports, PPES is used by PP, Cognos Connection is used by repornet. there are many other tools but these are the main. 4. What Are Versions Of Cognos From Starting Release To Latest In The Market? EP7 series is the oldest cognos series. cognos report net is the next series. cognos 8.1. cognos 8.3 is the latest version. but most of the companies use cognos 8.1. 5. What Is Cognos Visualizer And Cognos Scripting? Visualizer is a representation of data cubes in a dashboard format. We can drill through to the ground level of a hierarchy as like in power play report but cannot add or remove fields dynamically. Cognos script editor : We can write cognos macros or programs in this tool and can fine tune or process some execution. 6. What Are The Limitations Of Cognos Reportnet? What Are The Enhancements In Reportnet ? ADVANTAGES: PURELY WEB BASED REPORTING TOOL WE CAN CREATE MULTILINGUAL REPORTS IT CAN SUPPORT ANY TYPE OF OPERATING SYSTEM & DATA BASE WE CAN CREATE SINGLE MODEL IN FRAMEWORK MANAGER COMING FROM MULTIPLE DATA SOURCES DISADVANTAGES: WE CAN'T DRILL DOWN AND DRILL UP IT CAN'T SUPPORT MULTI DIMENSIONAL ANALYSIS 7. What Is The Difference Between Group And Association? Group: used to eliminate duplicate values in the report. Association: used with grouped item that displays data only when the values are changed. 8. How Do Add Dynamic Titles In Pp? PowerplayFrom format menu-->title, header & footer-->title-->insert report filename. 9. What Are The Ways To Import Data Into Catalong? One through Database option other one though hotfile option. 10. How Can I Test Reports In Cagnos? In cognos report net by the validate report option a report can be tested. If there will be any error, it will specify the error, unless it will give message -'report specification is valid'. After creating the report we will connect to the oracle and we will write the sql query and we will compare this 2 reports .
COGNOS Interview Questions 11. How Can I Schedule Reports In Cognos? By using Cognos Schedular, one can schedule the running reports in Impromptu to execute and save it in desired format. By using Cognos MACRO script language the reports can be executed and distributed to recipients by using mail applications. Compiled Cognos MACROs can be scheduled using Cognos Schedular. 12. How Can Create Users And Permissions In Cognos? impromptu menu go to Catlog->User profiles -> userclass tab -> click on Creator ->you can give there folder access,table access,filters,Governor etc. you can create users by using USERS PROFILE option..and you can give permissions using ROLE IN C8 and GOVERNERS in C7. 13. What You Mean By Drill Across And What Is The Difference Between Drill Through, Drill Across? Drill across means when you are moving from one report to another report like country in one report and States in another Report whereas if the report is using same report from moving from country to state. In other when you are moving details level if you are using the different report it is drill across and if you are using same report it is drill through. 14. What Is Loop In Frameworkmanager? Loop: loop is a closed path (relation) that exists among 3 (or) more tables. For example, if we have '3' tables T1, T2, T3 then, a loop exists among these tables only when we create joins in the following fashion: Loop: T1 ---> T2 ---> T3 ---> T1 To resolve the above problem, we have to create a shortcut (or) Alias to the Table T1. No Loop: T1 ---> T2 ---> T3 ---> Alias (or) Shortcut of T1 15. How Create Measures And Demensions? You can create measure once you import all the data into the data source.You can create measures and dimensions by draging the required source from datasource into dimension map and measure tab.(need to find scope of measures for all the dimensions) 16. What Is Meant By Alternate Drill Down With Ex? Alternate drill down path means there will be two path to opt. For example- We will have time dimension now the client requirement is they want two type of year -one is calender year(Jan-Dec) and another is fiscal year(Apr-Mar). 17. What Is The Difference Between Enterprise Data Warehouse And Data Warehouse? Big Organizations have a lot of diversified sources of data.There might be a dataware house exclusively for transport and others for data related to the project the company runs.In such case the complete enterprise's(company's) data warehouse is a big combination of all and is known the Enterprise data Warehouse(EDW). 18. How You Drill From Powerplay To Impromptu? Explain All Steps? go to transformer, build the cube right click on the cube and open properties go to drill through tab and select your report from that browse option 19. Give Me Any Example Of Semi And Non Additive Measures? Semi-Additive: Semi-additive facts are facts that can be summed up for some of the dimensions in the fact table, but not the others. Non-Additive: Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table. Current Balance and Profit Margin are the facts.Current Balance is a semi-additive fact, as it makes sense to add them up for all accounts , but it does not make sense to add them up through time (adding up all current balances for a given account for each day of the month does not give us any useful information). Profit Margin is a non-additive fact, for it does not make sense to add them up for the account level or the day level. 20. What Is Drill Down And Slicing And Dicing Whats The Difference Between Them? Drilling lets you quickly move from one level of detail to another to explore different aspects of your business. Drilling down moves you down in the hierarchy. drilling up moves you up in the hierarchy. Slicing and Dicing While you can drill to look at information at any level, you can slice and dice to select the exact information you want in your report. 21. What Is The Difference Between A Cascading Report And Drillthru Report?why Do We Go For Drill Thru Report? Cascading report works based on the condition but drill thru work based on the data item what we select as a drill thru options. 22. What Junk Dimension Contains? it contains the miscellaneous attributes such as flags , which does belong to the main dimension table. for eg; In a sales order fact table ,some informations like, web-order , whether this is a online order or not new_cust-Is this person a new customer , verification- whether the order has been verified. 23. What Is Catalog And Types Of Catalogs In Cagnos And Which Is Better? A catalog is a file containing the information (Database tables) that Impromptu users need to create reports. personal distributed shared secured shared catalog is better. 24. What Is Meant By Junk Dimension? The junk dimension is simply a structure that provides a convenient place to store the junk attributes. 25. Suppose You Run A Report With Empty Data, How Will You Inform The End User That It Has No Data While Running The Report In Report Studio? Is It Possible, If So How? Add a list footer and a text messsage inside "No Data". create a boolean conditional variable "Test" with code "rownum()>0". Select the complete list footer and set Style Variable, if Yes, Box Type of the List footer should be set to "none". 26. What Is The Difference B/w Macros And Prompt? Macro-A macro is a set of instructions that can run applications. Example : A macro can open your catalog,select a report(say for instance) convert that to another format and export it to any specified location,provided the code (Program)is such. Prompt-A prompt specifies the manner in which data in the reports are to be displayed. A Prompt can be defined at the catalog level either or during report generation. 27. What Is Cognos Powerhouse And What Is It Used For? Cognos Powerhouse is High-Productivity Application Development Solutions equips you with high-productivity development environments for creating your data-driven business solutions faster,whether for Web-based, client/server, or traditional terminal-based access. PowerHouse has gained a worldwide reputation for productivity, reliability, performance, and flexibility. 28. How Can We Create A Dynamic Column Name In Cognos? Select Properties, and then click the Headers/Footers tab. 29. What Is Transformer? Transformer is used for building the Cubes (Multidimensional Structure used for OLAP processing). 30. What Is The Difference Between Filter And Condition? A Filter is a restriction that reduces the amount of data displayed in the report. The report filters are non-reusable and specific to the report. Filters are before aggregation: non-aggregated records are filtered(details record) after aggregation: summary records are filtered 31. What Is The Security Module Used In Cognos? The security module used in cognos is cognos access manager. 32. What Is Report Burn And Where It Occurs? Report can be divided into different parts and sending these different parts of report to different users. It occurs in cognos connection. 33. What Is Difference Between Drill Through And Drill Down? drill through allows you to navigate from summary to detailed information. drill down also similar kind of thing,for example will do drill down on year if we drill down on this the next level will come means year contains quaters, it contains months, months contains weeks, week contains days. so,we can view all the levels through drill down. 34. How Can I Change Reports To Another Package? Open that report and save as that reportSelect public folder and then select package in which package you want to save then save it. 35. How To Create Users And Permissions In Cognos? Users and Permissions can be given in Access manager - Administration. Individual users can be created using their names or their Ids (if any) in Access manager and then be given permissions. This tool is loaded when you install Cognos 7.x in your machine. In content manager--we have one component called cognos process manager, using process manager we can integrate with third party tools like LDAP or NTLM. we cannot create users in reportnet directly, in real time we can use LDAP. 36. What Is The Difference Between Macros And Prompt? Macro is set of instructions to run report. Prompt is just like a dialog box which will ask user to select/enter the information what he needs. 37. What Is Query Subject? A query subject is the basic building block in Framework Manager. A query subject is a set of query items that have an inherent relationship. 38. How Can I Convert A List Report/cross Tab Report In Cognos Ep Series Into A Bar Chart Or Pie Chart Etc.? This can be done in Impromptu administrator. 39. How Many Numbers Of Cubes Can We Create On A Single Model? How Can We Navigate Between Those Cubes? Using a single Model, we can create as many number of cubes u want. By using the dimension views...etc. Regarding the navigation, when we save cubes, these act as separator multidimensional databases. There wont be any relations. Navigate means opening the cubes separately. 40. What Is The Difference Between Group And Association? In cognos 'group' is used to suppress the duplicate values. And 'associate' is used to suppress the values if it is one-one relationship. Group - It Eliminates the Duplicate values from the report break the columns. Its having a one to many relationship. Association-It Eliminates the Duplicate values from the report and its having one-one relationship. Both Group & Association eliminate duplicates in a column.But we cannot use Association alone.It should have one to one relationship with grouped column. Using Association is a performance enhancement technique. 41. What Is Cardinality? Cardinality is nothing but relation between tables like One to One, One to Many and Many to Many. Cardinality is the nature of the relationship. 1-1,1-many relationships Cardinality is the nature of the relation between two query items. 42. How To Test Reports In Cognos? Go to power play transformer, on toolbar select RUN menu. In RUN menu there is one option that is TEST BUILD you can test ur report there. You may also test the outcome of a report by writing and executing SQL Queries and comparing the outcome with the report outcome. BY USING SYSTEM TESTING, UNIT TESTING, PERFORMANCE TESTING 43. How You Create Security To Reports? Basically there are two types of securities for any object Levels of Securities: Database level Security Row level security Even we can provide LDAP security too. By setting up the Governors in frame work manager also we can give the security to the data accessing through the report. 44. How To Configure The Cognos Configurations To Work In The Windows 2000 Server Machine? Go to Cognos Configuration and see the settings for cognos in the explorer page. There you can set for win2000 server. 45. What Is Drill Down And Slicing And Dicing What Are The Difference Between Them? Drill down means it will explains the summary level information to lowest level information. Slicing is nothing but cutting edge of the cube. Dicing is nothing but viewing the cube in all possible directions. Drill down is the way to get the more detailed data. Slicing and dicing is to get the data according to where clause. 46. What Are The Enhancements In Cognos Reportnet ? Enhancements in Cognos reportnet is Cognos 1.0, 1.1, MR1.1, MR1.2 Cognos 1.0,1.1, MR1.1, MR1.2 AND cognos 8, 8.1.1MR, 8.1.2MR, now new version is 8.2 Enhancement in cognos in the list report:- Apply list column title styles Apply list column body styles Apply table styles. 47. Explain The Different Stages In Creating A Report In Cognos Report Net? Open cognos connection in that select Report studio or query studio it will displays there you have packages(ex: gosales and goretailr (defult)) click on that it display Report studio in that select Object pane select required query subject then click run it displays your report on report viewer screen. 48. What Is Report Item? Report item is nothing but a query item when it is drag and drop into the work area. Example in Go sales and retailers package-> Product (is a query subject) and-> Product line is(a query item)->when PL is dropped into the work area then its a report item. 49. How To Perform Single Sign In Cognos By Using Url? In cognos configuration under authentication allow anonymous access should be false.In cgi-bin properties (under iis) the enable anonymous access should be false. 50. How You Migrate Reports To Cognos 8 From Previous Versions? using deplyement tool we can migrate the reports. 51. What Are The Filters In Framework Manager And Report Studio? Filters in framework manager are Standalone filters Embedded filters Report studio Filters are Detail filters Summary filters 52. What Is Iqd? What Is Contained In Iqd? IQD stands for Impromptu query definition. It is report save extension with .iqd it is use for creating a cube in power play transformer. IQD is Impromptu Query Definition. It contains the SQL query to display the data from the database and SQL query is automatically created based on the columns selected from database. 53. What Are The Special Features In Cognos Reportnet? CRN is web based tool. So it will very useful to view the reports very easily. So that they preferred CRN 54. How You Create Iqd In Reportnet Framework? Open framework manager click on any query subject and go Open the properties dialogue box and you can observe the EXTERNALIZE METHOD you change the iqd. 55. How To Pass Multiple Values From Pick List Prompt To Sub Report Filter? #parameter1#. Give this in ur filter where u pass multiple values 56. What Is Stitch Query In Reportnet? Framework Manager generates a separate query for each 'FACT' table and joins the result set this is called Stitched query. 57. What Is Snapshot ? A Snapshot is a Permanent Local Copy Of the Report.Snapshot is Static Data Source it is saved with .imr File it is suitable for Disconnected Network. 58. What Is Cube Size? 2.0 GB (It depends on your project requirements). 59. What Is Log In Cognos? While creating reports or creating models logs have all information till session closed. 60. What Is Associated Grouping? And How It Works In Cognos Impromptu? You can associate one or more data items with a grouped data item. An associated data item should have a one-to-one relationship with the grouped data item. For example, Order No. is a grouped data item. For each order number, there is an Order Date. Order No. and Order Date have a one-to-one relationship. 61. What Are The Limitations Of Cognos Reportnet? In CRN we can't view multi dimensionally, we cant' see a report in excel, we can't format a report in CRN Report net does not support drill through and also bursting of reports is no possible in report net. Also it does not support dimensional analysis. 62. What Is Loop In Framework Manager ? Loop is an very dangerous exception in framework manager we can resolve the loop create alias table. loop Display wrong results in the reportnet. A Loop is a Closed Path Circuit. Avoid loops: using shortcuts.We have ambiguous relationships types:Hierarchical relationships Recursive relationships Multi-valid relationships. To avoid these relationships using shortcuts. 63. What Are The Different Ways Of Adding Data In Transformer? In transformer you import metadata from architect or catalog to create a cube. We just import metadata. we don’t add data to it. 64. What Are Slowly Changing Dimensions? Why We Are Used Scd? Slowly Changing Dimensions are those whose data are not fixed. SCD types: SCD type 1: Whose data is not fixed. Historical data are not saved. Data keeps on changing.-> Current Data SCD type 2: Whose data is fixed and also save historical data. SCD type 3: Like SCD type 2 only but save historical data in another table. In one table data will updated and in another table historical data will be there. 65. What Are The Names Of The Reports That You Prepared? List report, Cross tab report, pie charts etc. Master detail, Drill through, CascadingList, Cross tab, Chart, Map report are the basic report of Cognos. 66. What Is The Importance Of Dimension In The Cognos.? Without Dimension and Fact cannot make a relation between tables it could not be use for joins and retrieve the data as a form of reports in cognos. Dimension is a major subject area through which we can analysis our business. 67. What Is Exact Catalog Size? There is no limit for catalog size. It may be 3MB or 4MB. 68. Give Me Some Examples For Lifecycle Reporting, I Mean Which Lifecycle We Will Use For Reporting? There is no specific reporting lifecycle. We can generate the reports from data warehouse/data marts or we can generate the reports directly from OLTP systems.What I mean generating reports from OLTP systems is loading data into one system and generating the reports. But this is not recommended. This will depends on the business. 1) Generating reports from the OLAP interface system retrieving the data from the data warehouse to generating forms and reports. 2) They can use Business intelligence project life cycle. 69. How Many Levels Can Be Used In Drill Through Reports? Two levels: summary and detail. 70. Where You Can Save The Report Net Documentation In Our Local System? Whenever we Install Cognos Server we get it documentation alone otherwise we can store it in Deployment Folder of Cognos. 71. What Is Model And How To Create Model And How To Test Model? Create a Model In framework Manager and then publish it, make sure that Model meets the requirements, don't overload the model with unwanted query items. Model is nothing but a project. And the project will be created by framework manager in which we can extract the metadata from the various operational sources... we will test the model by using the TOAD. Model contains meta data which is related to business reporting Information. We import this data from the data base. Using this metadata we are creating Packages, Publishing packages, This metadata contains query subjects and namespaces. 72. How To Create Cubes In Transformer ? Cube can be created by using different data sources. Data sources: iqd, tex file Cube creation by using iqd data sources. These iqd sources are coming from impromptu. 73. How You Burst The Reports? If You Are Bursted Reports Is Not Reached To Destination How You Identify?when You Import Data Into Catalog You Have Complex Columns Names. How Do You Change The Name Of Those Columns? First you have to click on column header which you have to change. Then in property pane you will found name option. Then change column name. 74. How To Create Measures And Dimensions? By using the Framework Manager we can create measures and dimensions according to our business needs. 75. What Is The Difference Between List Report And Crosstab Report? List report show data in rows and columns versus a cross tab report show data in grids with dimension in rows and columns and measures in cells or in the intersection points. List Report and Cross tab report both contains rows and columns but difference is list report is a detail report for analysis whereas cross tab report is the intersection of data or summarized data for analysis. 76. What Are The Migration Tools Available In The Market With Respect To Cognos Such As Impromptu Reports Are Migrated To Cognos Reportnet? Reportnet1.1 migration tool is available to migrate the impromptu report to reportnet. 77. Can Reportnet Supports Cubes? No 78. How You Provide Security To Reports? How You Provide Security To Packages ? For framework manager packages, before you head away with creating a new package it asks whether or not you want to have security set up for the package in use. Even if you haven't defined any security at this stage, you can always goto the actions >> package >> Edit package settings and there you can define the security settings for the package. For reports, you can either give report-level or folder- level security by going to the properties of that and choosing suitable roles/user groups that are supposed to be entitled to view/modify/create reports. COGNOS Questions and Answers Pdf Download Read the full article
0 notes
Text
300+ TOP CASSANDRA Interview Questions and Answers
CASSANDRA Interview Questions for freshers experienced :-
1. What is Cassandra? Cassandra is an open source data storage system developed at Facebook for inbox search and designed for storing and managing large amounts of data across commodity servers. It can server as both Real time data store system for online applications Also as a read intensive database for business intelligence system OR Apache Cassandra is an open source, distributed and decentralized/distributed storage system (database), for managing very large amounts of structured data spread out across the world. It provides highly available service with no single point of failure.It was developed at Facebook for inbox search and it was open-sourced by Facebook in July 2008. 2. What was the design goal of Cassandra? The design goal of Cassandra is to handle big data workloads across multiple nodes without any single point of failure. 3. What is NoSQLDatabase? NoSQL database (sometimes called as Not Only SQL) is a database that provides a mechanism to store and retrieve data other than the tabular relations used in relational databases. These databases are schema-free, support easy replication, have simple API, eventually consistent, and can handle huge amounts of data. 4. Cassandra is written in which language? Java 5. How many types of NoSQL databases? Document Stores (MongoDB, Couchbase) Key-Value Stores (Redis, Volgemort) Column Stores (Cassandra) Graph Stores (Neo4j, Giraph) 6. What do you understand by composite type? Composite Type is a cool feature of Hector and Cassandra.It allow to define a key or a column name with a concatenation of data of different type.With CassanraUnit, you can use CompositeType in 2 places : row key column name 7. What is mandatory while creating a table in Cassandra? While creating a table primary key is mandatory, it is made up of one or more columns of a table. 8. What is the relationship between Apache Hadoop, HBase, Hive and Cassandra? Apache Hadoop, File Storage, Grid Compute processing via Map Reduce. Apache Hive, SQL like interface ontop of hadoop. Apache Hbase, Column Family Storage built like BigTable Apache Cassandra, Column Family Storage build like BigTable with Dynamo topology and consistency. 9. List out some key features of Apache Cassandra? It is scalable, fault-tolerant, and consistent. It is a column-oriented database. Its distribution design is based on Amazon’s Dynamo and its data model on Google’s Bigtable. Created at Facebook, it differs sharply from relational database management systems. Cassandra implements a Dynamo-style replication model with no single point of failure, but adds a more powerful “column family” data model. Cassandra is being used by some of the biggest companies such as Facebook, Twitter, Cisco, Rackspace, ebay, Twitter, Netflix, and more. 10. What do you understand by Data Replication in Cassandra? Database replication is the frequent electronic copying data from a database in one computer or server to a database in another so that all users share the same level of information. Cassandra stores replicas on multiple nodes to ensure reliability and fault tolerance. A replication strategy determines the nodes where replicas are placed. The total number of replicas across the cluster is referred to as the replication factor. A replication factor of 1 means that there is only one copy of each row on one node. A replication factor of 2 means two copies of each row, where each copy is on a different node. All replicas are equally important; there is no primary or master replica. As a general rule, the replication factor should not exceed the number of nodes in the cluster. However, you can increase the replication factor and then add the desired number of nodes later.
CASSANDRA Interview Questions 11. What do you understand by Node in Cassandra? Node is the place where data is stored. 12. What do you understand by Data center in Cassandra? Data center is a collection of related nodes. 13. What do you understand by Cluster in Cassandra? Cluster is a component that contains one or more data centers. 14. What do you understand by Commit log in Cassandra? Commit log is a crash-recovery mechanism in Cassandra. Every write operation is written to the commit log. 15. What do you understand by Mem-table in Cassandra? Mem-table is a memory-resident data structure. After commit log, the data will be written to the mem-table. Sometimes, for a single-column family, there will be multiple mem-tables. 16. What do you understand by SSTabl in Cassandra? SSTable is a disk file to which the data is flushed from the mem-table when its contents reach a threshold value. 17. What do you understand by Bloom filter in Cassandra? Bloom filter are nothing but quick, nondeterministic, algorithms for testing whether an element is a member of a set. It is a special kind of cache. Bloom filters are accessed after every query. 18. What do you understand by CQL? User can access Cassandra through its nodes using Cassandra Query Language (CQL. CQL treats the database (Keyspace) as a container of tables. Programmers use cqlsh: a prompt to work with CQL or separate application language drivers. 19. What do you understand by Column Family? Column family is a container for an ordered collection of rows. Each row, in turn, is an ordered collection of columns. 20. What is the use of "void close()" method? This method is used to close the current session instance. 21. What is the use of "ResultSet execute(Statement statement)" method? This method is used to execute a query. It requires a statement object. 22. Which command is used to start the cqlsh prompt? Cqlsh 23. What is the use of "cqlsh --version" command? This command will provides the version of the cqlsh you are using. 24. What are the collection data types provided by CQL? List : A list is a collection of one or more ordered elements. Map : A map is a collection of key-value pairs. Set : A set is a collection of one or more elements. 25. What is Cassandra database used for? Apache Cassandra is a second-generation distributed database originally open-sourced by Facebook. Its write-optimized shared-nothing architecture results in excellent performance and scalability. The Cassandra storage cluster and S3 archival layer are designed to expand horizontally to any arbitrary size with linear cost.Cassandra’s memory footprint is more dependent on the number of column families than on the size of the data set. Cassandra scales pretty well horizontally for storage and IO, but not for memory footprint, which is tied to your schema and your cache settings regardless of the size of your cluster. some of the important link about casandara is available-here. 26. What is the syntax to create keyspace in Cassandra? Syntax for creating keyspace in Cassandra is CREATE KEYSPACE WITH 27. What is a keyspace in Cassandra? In Cassandra, a keyspace is a namespace that determines data replication on nodes. A cluster consist of one keyspace per node. 28. What is cqlsh? cqlsh is a Python-based command-line client for cassandra. 29. Does Cassandra works on Windows? Yes, Cassandra works pretty well on windows. Right now we have linux and windows compatible versions available. 30. What do you understand by Consistency in Cassandra? Consistency means to synchronize and how up-to-date a row of Cassandra data is on all of its replicas. 31. Explain Zero Consistency? In this write operations will be handled in the background, asynchronously. It is the fastest way to write data, and the one that is used to offer the least confidence that operations will succeed. 32. What do you understand by Thrift? Thrift is the name of the RPC client used to communicate with the Cassandra server. 33. What do you understand by Kundera? Kundera is an object-relational mapping (ORM) implementation for Cassandra written using Java annotations. 34. JMX stands for? Java Management Extension 35. How does Cassandra write? Cassandra performs the write function by applying two commits-first it writes to a commit log on disk and then commits to an in-memory structured known as memtable. Once the two commits are successful, the write is achieved. Writes are written in the table structure as SSTable (sorted string table). Cassandra offers speedier write performance. 36. When to use Cassandra? Being a part of NoSQL family Cassandra offers solution for problem where your requirement is to have very heavy write system and you want to have quite responsive reporting system on top of that stored data. Consider use case of Web analytic where log data is stored for each request and you want to built analytical platform around it to count hits by hour, by browser, by IP, etc in real time manner. 37. When should you not use Cassandra? OR When to use RDBMS instead of Cassandra? Cassandra is based on NoSQL database and does not provide ACID and relational data property. If you have strong requirement of ACID property (for example Financial data), Cassandra would not be a fit in that case. Obviously, you can make work out of it, however you will end up writing lots of application code to handle ACID property and will loose on time to market badly. Also managing that kind of system with Cassandra would be complex and tedious for you. 38. What are secondary indexes? Secondary indexes are indexes built over column values. In other words, let’s say you have a user table, which contains a user’s email. The primary index would be the user ID, so if you wanted to access a particular user’s email, you could look them up by their ID. However, to solve the inverse query given an email, fetch the user ID requires a secondary index. 39. When to use secondary indexes? You want to query on a column that isn't the primary key and isn't part of a composite key. The column you want to be querying on has few unique values (what I mean by this is, say you have a column Town, that is a good choice for secondary indexing because lots of people will be form the same town, date of birth however will not be such a good choice). 40. When to avoid secondary indexes? Try not using secondary indexes on columns contain a high count of unique values and that will produce few results. 41. I have a row or key cache hit rate of 0.XX123456789 reported by JMX. Is that XX% or 0.XX% ? XX% 42. What happens to existing data in my cluster when I add new nodes? When a new nodes joins a cluster, it will automatically contact the other nodes in the cluster and copy the right data to itself. 43. What are "Seed Nodes" in Cassandra? A seed node in Cassandra is a node that is contacted by other nodes when they first start up and join the cluster. A cluster can have multiple seed nodes. Seed node helps the process of bootstrapping for a new node joining a cluster. Its recommended to use the 2 seed node per data center. 44. What are "Coordinator Nodes" in Cassandra? Coordinator Nodes: Its a node which receive the request from client and send the request to the actual node depending upon the token. So all the nodes acts as coordinator node,because every node can receive a request and proxy that request. 45. What are the befefits of NoSQL over relational database? NoSQL overcome the weaknesses that the relational data model does not address well, which are as follows: Huge volume of sructured, semi-structured, and unstructured data Flexible data model(schema) that is easy to change Scalability and performance for web-scale applications Lower cost Impedance mismatch between the relational data model and object-oriented programming Built-in replication Support for agile software development 46. What ports does Cassandra use? By default, Cassandra uses 7000 for cluster communication, 9160 for clients (Thrift), and 8080 for JMX. These are all editable in the configuration file or bin/cassandra.in.sh (for JVM options. All ports are TCP. 47. What do you understand by High availability? A high availability system is the one that is ready to serve any request at any time. High avaliability is usually achieved by adding redundancies. So, if one part fails, the other part of the system can serve the request. To a client, it seems as if everything worked fine. 48. How Cassandra provide High availability feature? Cassandra is a robust software. Nodes joining and leaving are automatically taken care of. With proper settings, Cassandra can be made failure resistant. That means that if some of the servers fail, the data loss will be zero. So, you can just deploy Cassandra over cheap commodity hardware or a cloud environment, where hardware or infrastructure failures may occur. 49. Who uses Cassandra? Cassandra is in wide use around the world, and usage is growing all the time. Companies like Netflix, eBay, Twitter, Reddit, and Ooyala all use Cassandra to power pieces of their architecture, and it is critical to the day-to-da operations of those organizations. to date, the largest publicly known Cassandra cluster by machine count has over 300TB of data spanning 400 machines. Because of Cassandra's ability to handle high-volume data, it works well for a myriad of applications. This means that it's well suited to handling projects from the high-speed world of advertising technology in real time to the high-volume world of big-data analytics and everything in between. It is important to know your use case before moving forward to ensure things like proper deployment and good schema design. 50. When to use secondary indexes? You want to query on a column that isn't the primary key and isn't part of a composite key. The column you want to be querying on has few unique values (what I mean by this is, say you have a column Town, that is a good choice for secondary indexing because lots of people will be form the same town, date of birth however will not be such a good choice). 51. When to avoid secondary indexes? Try not using secondary indexes on columns contain a high count of unique values and that will produce few results. 52. What do you understand by Snitches? A snitch determines which data centers and racks nodes belong to. They inform Cassandra about the network topology so that requests are routed efficiently and allows Cassandra to distribute replicas by grouping machines into data centers and racks. Specifically, the replication strategy places the replicas based on the information provided by the new snitch. All nodes must return to the same rack and data center. Cassandra does its best not to have more than one replica on the same rack. 53. What is Hector? Hector is an open source project written in Java using the MIT license. It was one of the early Cassandra clients and is used in production at Outbrain. It wraps Thrift and offers JMX, connection pooling, and failover. 54. What do you understand by NoSQL CAP theorem? Consistency: means that data is the same across the cluster, so you can read or write to/from any node and get the same data. Availability: means the ability to access the cluster even if a node in the cluster goes down. Partition: Tolerance means that the cluster continues to function even if there is a "partition" (communications break) between two nodes (both nodes are up, but can't communicate). In order to get both availability and partition tolerance, you have to give up consistency. Consider if you have two nodes, X and Y, in a master-master setup. Now, there is a break between network comms in X and Y, so they can't synch updates. At this point you can either: A) Allow the nodes to get out of sync (giving up consistency), or B) Consider the cluster to be "down" (giving up availability) All the combinations available are: CA - data is consistent between all nodes - as long as all nodes are online - and you can read/write from any node and be sure that the data is the same, but if you ever develop a partition between nodes, the data will be out of sync (and won't re-sync once the partition is resolved). CP - data is consistent between all nodes, and maintains partition tolerance (preventing data desync) by becoming unavailable when a node goes down. AP - nodes remain online even if they can't communicate with each other and will resync data once the partition is resolved, but you aren't guaranteed that all nodes will have the same data (either during or after the partition) 55. What is Keyspace in Cassandra? Before doing any work with the tables in Cassandra, we have to create a container for them, otherwise known as a keyspace. One of the main uses for keyspaces is defining a replication mechanism for a group of tables. Example: CREATE KEYSPACE used_cars WITH replication = { 'class': 'SimpleStrategy', 'replication_factor' : 1}; 56. Explain Cassandra data model? The Cassandra data model has 4 main concepts which are cluster, keyspace, column, column family. Clusters contain many nodes(machines) and can contain multiple keyspaces. A keyspace is a namespace to group multiple column families, typically one per application. A column contains a name, value and timestamp. A column family contains multiple columns referenced by a row keys. 57. Can you add or remove Column Families in a working Cluster? Yes, but keeping in mind the following processes. Do not forget to clear the commitlog with ‘nodetool drain’ Turn off Cassandra to check that there is no data left in commitlog Delete the sstable files for the removed CFs 58. What is Replication Factor in Cassandra? ReplicationFactor is the measure of number of data copies existing. It is important to increase the replication factor to log into the cluster. 59. Can we change Replication Factor on a live cluster? Yes, but it will require running repair to alter the replica count of existing data. 60. How to iterate all rows in ColumnFamily? Using get_range_slices. You can start iteration with the empty string and after each iteration, the last key read serves as the start key for next iteration. CASSANDRA Questions and Answers pdf Download Read the full article
0 notes
Text
300+ TOP OBIEE Interview Questions and Answers
OBIEE Interview Questions for freshers experienced :-
1.What Is Obiee? Obiee stands for Oracle Business Intelligence Enterprise Edition (OBIEE). It is a business intelligence system for the enterprise that delivers abilities for reporting, online analytical processing (OLAP), ad hoc query and analysis dashboards, and scorecards.Now a days OBIEE reporting is used for mobility and mobility reporting. 2.Explain OBIEE Architecture? Presentation server : The presentation server is responsible to convert the request comes from end user to logical SQL and sends it to Oracle BI Server. Oracle BI server : The main task of Oracle BI server is converting the logical sql to Physical sql so that database will understand the sql. Oracle BI server converts the logical sql to physical sql so that database syntactically identifies the query and send it to the Database. Database : Database fetches the required result. 3. How will you do sort in Reports in OBIEE. Click modify and then click sort (order by icon) on the relevant column in the criteria pane. 4. How will you do different types of narrative Reports in OBIEE? By clicking modify request and Narrative View and by giving @1 for the first column result and @2 for the 2nd column and son on and we can also give a heading for No Results by clicking the Narrative view. 5. How will you create Interactive Dashboards ? By clicking Administration and Manage dashboards and by adding column selector also by using view selector etc.and also by using prompts. 6. What is write-back in obiee ? In Reports of Answer you can give a column as updatebale and then view the reports,this option is called write back. 7. How will you execute Direct SQL in OBIEE? By clicking Direct Database Request below the subject area in Answers. 8. How will you create report from two subject areas From the Criteria Pane of the Report Created from First Subject Area come to the bottom of the page and click combine request. but the options are limited for combining like union etc.. 9. How will you Port changes for dashboards,reports,rpd from development to production? For the RPD use the Merge option in Admin Tool and for dashboards and reports use Content Accelerator Framework. 10. What are the different types of variables in OBIEE? explain There are two types of variables in obiee: a.Repository variable. This variable is for the whole repository. b.session variable . session variable can be system variable and non system variable. system variable uses NQ_SESSION. —- (system reserved variable). examples of non system variables are user defined filters etc. 11. How will you enable or disable caching in the system level and table level? In the NQSConfig.ini file use ENABLE under CACHE Section for System Level for tables .If you want to enable the cache at table level , open the repository in offline mode(not the current working repository). this should be different from the current repository and click enable or disable the cache . right click the table and properties and click the cache or deselect the cache. then merge the repository with the current working one. 12. How will you go about adding additional column to the repository in the presentation layer? Check whether the table is already existing if so add in physical layer ,then BMM and then Presentation layer, then reload server metadata ,then it will be visible to all users. 13. How will effect the changes for a report , if for certain users only the column heading in the report should be changed? Using session variables for that user. 14. What is a table alias in obiee? and where and how will you create it. Table alias can be created by right clicking the table in the physical layer then click alias, Table alias is mostly used for creating self joins. 15. Have you created Hierarchy in OBIEE. if so where and how? Yes we can create hierarchy in BMM Layer of OBIEE in dimensions for the dimension tables. this can be done by right clicking the dimension table and click create dimension and then we should manually define the hierarchy and its levels. 16. What is Level Based Metrics.How will you create it? Leval-base matrics means, having a measure pinned at a certain level of the dimension. Monthly Total Sales or Quarterly Sales are the examples. To create a level based measure, create a new logical column based on the original measure (like Sales in the example above). Drag and drop the new logical column to the appropriate level in the Dimension hierarchy (in the above example you will drag and drop it to Month in Time Dimension. 17. What are the different layers of OBIEE Repository? Physical Layer, Business Model and Mapping Layer, and Presentation Layer. 18. What is Authentication? How many types of authentication. Authentication is the process by which a system verifies, through the use of a user Operaing system autentication External table authentication Database authentication LDAP authentication 19. What are the different types of security you have worked in OBIEE? Object Level and data level. 20. What is a bridge table ? If you want to connect two tables where there is no relation ship you can use a thrid bridge table for connecting them which will have common columns in both tables, this is used in BMM Layer. 21. You come Morning to the office. and all the users are complaining about the obiee reporting system is very slow(dasboards). what is your approach to resolve the issue? Mainly check NQServer.log and in Admin Tool -> click Session Manager. check if there is any bottleneck and accordingly resolve. 22. What are the different log files in OBIEE? Following are the different log files inOBIEE NQServer.log , NQQuery.log, NQSAdminTool.log etc. There are mainly 25 Configuration files available in obiee. 23. What are the Key Configuration Files in OBIEE? NQSConfig.ini, NQSCluster.ini, odbc.ini, instanceconfig.xml 24. How will change port of obiee answers? By changing its port in instanceconfig.xml 25. What is ClusterServices in OBIEE ? Why its used? To Scale up the Performance by clustering and distributing the services across multiple Servers ClusterServices are used. 26. Can you change the location of the OBIEE Repository? No, Not in Standalone OBIEE Install. you can give shared location in NQSCluster.ini if the OBIEE is clustered. 27. How many BI Services Node can be clustered together? 16 28. Can you run multiple rpd’s in a Single OBIEE Instance? No . you should create another instance in the same server , then its possible. 29. How will you implement security in obiee,so that subjects areas accessed by one group are not accessed by another group. Using Security Manger in OBIEE Admin Tool, by creating two user groups and allowing only part of the subject area for one group and other part to the other user group. 30. What is SCD type 1 and 2. SCD is slow changing dimensions. in type 1 we replace the changed dimension with old dimension. in type 2 we use surrogate keys and keep both the records(rows). Advantage of SCD2 is we keep history of the old dimensiont. 31. What is Star Schema and SnowFlake Schema? Star is one fact and many dimensions where as Snowflake design is one fact and many dim’s and dim’s can have additonal dim relations. 32. How to startup OBIEE in Linux? ./run-sa.sh start #starts obi server ./run-saw.sh start #start saw server ./run-sch.sh start #start scheduler 33. How you generally Approach to ur Analytics Project? Any project should start from defining the scope of the project and the approach should be not to deviate from the scope. Then the project should be functionally divided into smaller modules generally done by project managers along with technical and functional leads. The functional leads then decide on majorly three things: According to the defined scope of the project they start gathering requirements while interacting with the clients. They had a discussion with the technical leads and try to reach a solution. Technical leads decides what schemas to create and what requirements are going to fulfill by that schema. Technical leads discuss all this with the developers and try to close requirements. Simultaneously testing and deployment is planned in a phased manner. 34. How Will You Enable Or Disable Caching In The System Level And Table Level? In the NQSConfig.ini file use ENABLE under CACHE Section for System Level For tables, if we want to enable the cache at table level , open the repository in offline mode This should be different from the current repository and click enable or disable the cache 35. What are the Challenges You Faced while making of Reports? Making of an report has never been a difficult task. But problem comes when users are reluctant to adopt a new system. Many developers have experienced that if they are not able to create the report in exactly the way they used to see, they will keep asking for the changes. Developers approach should be to first show them what they want to see and then add more information in the report. 36. What you will do when your Report is not Fetching Right Data? This is the biggest problem in report creation and verification. There could be two reasons for report not fetching the right data. Mostly clients do not have correct data in their database and on top of that to correct the results they make some changes at the report level to bring the desired result which you may not e aware of while creating the reports. Clients try to match the data with their existing reports and you never get the correct results. you try to discover the things and at later stage come to know of all these problems and you are held responsible for this delay. Hence always consult the SPOC(Single Point of Contact) and try to understand the logic they have used to generate their reports. If the database values are correct, there there could be a problem with the joins and relations in the schema. You need to discover that analyzing and digging deep into the matter. 37. How analytics Process Your Request When you Create your Requests? Oracle BI server converts the logical SQL submitted by the client into optimised physical SQL which is then sent to the backend database. Also in between it performs various tasks like converting the user operations like user selections to form a logical SQL, checking and verifying credentials, breaking the request into threads(as Oracle BI is a multi threaded server), processes the requests, manages the cached results, again converting the results received from the database into user presentable form etc. 38. From where u Get the Logical Query of your Request? The logical SQL generated by the server can be viewed in BI Answers. 39. What are the Major Challenges You Faced While Creating the RPD? Every now and then there are problems with the database connections but the problem while creating the repository RPD files comes with complex schemas made on OLTP systems consisting of lot of joins and checking the results. Th type of join made need to be checked. By default it is inner join but sometimes the requirement demands other types of joins. There are lot of problems with the date formats also. 40. What are Global Filter and how thery differ From Column Filter? Column filter- Simply a filter applied on a column which we can use to restrict our column values while pulling the data or in charts to see the related content. Global filter- This filter will have impact on across the application. 41. How to hide Certain Columns From a User? We can hide a column in Application access level security Do not add the column in the report Do not add the column in the presentation layer. 42. How to Enable Drills in a Given Column Data? To enable Drill down for a column, it should be included in the hierarchy in OBIEE. Hyperion IR has a drill anywhere feature where dont have to define and can drill to any available column. 43. How the users Created Differs From RPD/Answers/Dashboards Level? RPD users can do administrator tasks like adding new data source, create hirarchies, change column names where as Answers users may create new charts, edit those charts and Dashboard users may only view and analyse the dashboard or can edit dashboard by adding/removing charts objects. 44. How Online/Offline mode impact Development and Delpoyment? Online Mode- You can make changes in the RPD file and push in changes which will be immediately visible to the users who are already connected. This feature we may use in production environment. Offline mode- can be useful in test or development environment. 45. What is Guided Navigation? Guided navigation is a feature of Oracle BI Interactive Dashboards that enables the content and layout of Oracle BI Interactive Dashboard to change dynamically based on changes in the information being analyzed. Specifically, sections in a dashboard page can be set up and only appear when there is “interesting” information in the data. 46. What are the different types of variables in OBIEE 11g? explain There are two types of variables in OBIEE 11g. Repository variable : This variable is used for the whole repository. Session variable : session variable are of two types: system variable and non system variable. System variable uses NQ_SESSION. Examples of non system variables are user defined filters. 47. What is a table alias in OBIEE 11g? Where and how will you create it? Table alias is used for creating self joins. Table alias can be created by right clicking the table in the physical layer then click alias. 48. What is Authentication? How many types of authentication. Authentication is the procedure by which a system confirms, through the use of a user Operaing system authentication. xternal table authentication. Database authentication. LDAP authentication 49. What are different types of joins that are possible in OBIEE RPD? Complex Join – it uses multiple conditions, such as A.ROW_WID = B.ROW_WID AND / OR A.A_WID = B.B_WID Natural Join 50. What are the Key Configuration Files in OBIEE? The key configuration files in OBIEE 11g are NQSConfig.ini, NQSCluster.ini, odbc.ini, instanceconfig.xml 51. Explain the Architecture of OBIEE and what each components do? OBIEE Consists of Presentation Services(Web),Oracle BI (Analytics Engine) Server The client (Answers) constructs sql and passes it to the Analytic Engine and then the Oracle BI (Analytic Engine) parses the physical sql to the Datasources and retrieve the data back to the Engine and presents to the presentation Services . 52. How to get sql from obiee for reports? There are many ways to get the sql Modify the request and click Advanced in that you get xml code and also the actual sql. In the catalog Manager click Tools –>Create Report .In the Create Report Window –> Click Request SQL and save the sql to the physical path in your PC. Enable Loglevel to 2 in the OBIEE Admin Tool from Mange-> Security and enable the log level to 2 by clicking properties for the user, then go to the NQQuery.log in BI_HOME/OracleBI/Server/Logs.You will find the SQL for that User. By clcking Administration->Manage sessions-> view sql. OBIEE Questions and Answers Pdf Download Read the full article
0 notes
Text
300+ TOP CASSANDRA Interview Questions and Answers
CASSANDRA Interview Questions for freshers experienced :-
1. What is Cassandra? Cassandra is an open source data storage system developed at Facebook for inbox search and designed for storing and managing large amounts of data across commodity servers. It can server as both Real time data store system for online applications Also as a read intensive database for business intelligence system OR Apache Cassandra is an open source, distributed and decentralized/distributed storage system (database), for managing very large amounts of structured data spread out across the world. It provides highly available service with no single point of failure.It was developed at Facebook for inbox search and it was open-sourced by Facebook in July 2008. 2. What was the design goal of Cassandra? The design goal of Cassandra is to handle big data workloads across multiple nodes without any single point of failure. 3. What is NoSQLDatabase? NoSQL database (sometimes called as Not Only SQL) is a database that provides a mechanism to store and retrieve data other than the tabular relations used in relational databases. These databases are schema-free, support easy replication, have simple API, eventually consistent, and can handle huge amounts of data. 4. Cassandra is written in which language? Java 5. How many types of NoSQL databases? Document Stores (MongoDB, Couchbase) Key-Value Stores (Redis, Volgemort) Column Stores (Cassandra) Graph Stores (Neo4j, Giraph) 6. What do you understand by composite type? Composite Type is a cool feature of Hector and Cassandra.It allow to define a key or a column name with a concatenation of data of different type.With CassanraUnit, you can use CompositeType in 2 places : row key column name 7. What is mandatory while creating a table in Cassandra? While creating a table primary key is mandatory, it is made up of one or more columns of a table. 8. What is the relationship between Apache Hadoop, HBase, Hive and Cassandra? Apache Hadoop, File Storage, Grid Compute processing via Map Reduce. Apache Hive, SQL like interface ontop of hadoop. Apache Hbase, Column Family Storage built like BigTable Apache Cassandra, Column Family Storage build like BigTable with Dynamo topology and consistency. 9. List out some key features of Apache Cassandra? It is scalable, fault-tolerant, and consistent. It is a column-oriented database. Its distribution design is based on Amazon’s Dynamo and its data model on Google’s Bigtable. Created at Facebook, it differs sharply from relational database management systems. Cassandra implements a Dynamo-style replication model with no single point of failure, but adds a more powerful “column family” data model. Cassandra is being used by some of the biggest companies such as Facebook, Twitter, Cisco, Rackspace, ebay, Twitter, Netflix, and more. 10. What do you understand by Data Replication in Cassandra? Database replication is the frequent electronic copying data from a database in one computer or server to a database in another so that all users share the same level of information. Cassandra stores replicas on multiple nodes to ensure reliability and fault tolerance. A replication strategy determines the nodes where replicas are placed. The total number of replicas across the cluster is referred to as the replication factor. A replication factor of 1 means that there is only one copy of each row on one node. A replication factor of 2 means two copies of each row, where each copy is on a different node. All replicas are equally important; there is no primary or master replica. As a general rule, the replication factor should not exceed the number of nodes in the cluster. However, you can increase the replication factor and then add the desired number of nodes later.
CASSANDRA Interview Questions 11. What do you understand by Node in Cassandra? Node is the place where data is stored. 12. What do you understand by Data center in Cassandra? Data center is a collection of related nodes. 13. What do you understand by Cluster in Cassandra? Cluster is a component that contains one or more data centers. 14. What do you understand by Commit log in Cassandra? Commit log is a crash-recovery mechanism in Cassandra. Every write operation is written to the commit log. 15. What do you understand by Mem-table in Cassandra? Mem-table is a memory-resident data structure. After commit log, the data will be written to the mem-table. Sometimes, for a single-column family, there will be multiple mem-tables. 16. What do you understand by SSTabl in Cassandra? SSTable is a disk file to which the data is flushed from the mem-table when its contents reach a threshold value. 17. What do you understand by Bloom filter in Cassandra? Bloom filter are nothing but quick, nondeterministic, algorithms for testing whether an element is a member of a set. It is a special kind of cache. Bloom filters are accessed after every query. 18. What do you understand by CQL? User can access Cassandra through its nodes using Cassandra Query Language (CQL. CQL treats the database (Keyspace) as a container of tables. Programmers use cqlsh: a prompt to work with CQL or separate application language drivers. 19. What do you understand by Column Family? Column family is a container for an ordered collection of rows. Each row, in turn, is an ordered collection of columns. 20. What is the use of "void close()" method? This method is used to close the current session instance. 21. What is the use of "ResultSet execute(Statement statement)" method? This method is used to execute a query. It requires a statement object. 22. Which command is used to start the cqlsh prompt? Cqlsh 23. What is the use of "cqlsh --version" command? This command will provides the version of the cqlsh you are using. 24. What are the collection data types provided by CQL? List : A list is a collection of one or more ordered elements. Map : A map is a collection of key-value pairs. Set : A set is a collection of one or more elements. 25. What is Cassandra database used for? Apache Cassandra is a second-generation distributed database originally open-sourced by Facebook. Its write-optimized shared-nothing architecture results in excellent performance and scalability. The Cassandra storage cluster and S3 archival layer are designed to expand horizontally to any arbitrary size with linear cost.Cassandra’s memory footprint is more dependent on the number of column families than on the size of the data set. Cassandra scales pretty well horizontally for storage and IO, but not for memory footprint, which is tied to your schema and your cache settings regardless of the size of your cluster. some of the important link about casandara is available-here. 26. What is the syntax to create keyspace in Cassandra? Syntax for creating keyspace in Cassandra is CREATE KEYSPACE WITH 27. What is a keyspace in Cassandra? In Cassandra, a keyspace is a namespace that determines data replication on nodes. A cluster consist of one keyspace per node. 28. What is cqlsh? cqlsh is a Python-based command-line client for cassandra. 29. Does Cassandra works on Windows? Yes, Cassandra works pretty well on windows. Right now we have linux and windows compatible versions available. 30. What do you understand by Consistency in Cassandra? Consistency means to synchronize and how up-to-date a row of Cassandra data is on all of its replicas. 31. Explain Zero Consistency? In this write operations will be handled in the background, asynchronously. It is the fastest way to write data, and the one that is used to offer the least confidence that operations will succeed. 32. What do you understand by Thrift? Thrift is the name of the RPC client used to communicate with the Cassandra server. 33. What do you understand by Kundera? Kundera is an object-relational mapping (ORM) implementation for Cassandra written using Java annotations. 34. JMX stands for? Java Management Extension 35. How does Cassandra write? Cassandra performs the write function by applying two commits-first it writes to a commit log on disk and then commits to an in-memory structured known as memtable. Once the two commits are successful, the write is achieved. Writes are written in the table structure as SSTable (sorted string table). Cassandra offers speedier write performance. 36. When to use Cassandra? Being a part of NoSQL family Cassandra offers solution for problem where your requirement is to have very heavy write system and you want to have quite responsive reporting system on top of that stored data. Consider use case of Web analytic where log data is stored for each request and you want to built analytical platform around it to count hits by hour, by browser, by IP, etc in real time manner. 37. When should you not use Cassandra? OR When to use RDBMS instead of Cassandra? Cassandra is based on NoSQL database and does not provide ACID and relational data property. If you have strong requirement of ACID property (for example Financial data), Cassandra would not be a fit in that case. Obviously, you can make work out of it, however you will end up writing lots of application code to handle ACID property and will loose on time to market badly. Also managing that kind of system with Cassandra would be complex and tedious for you. 38. What are secondary indexes? Secondary indexes are indexes built over column values. In other words, let’s say you have a user table, which contains a user’s email. The primary index would be the user ID, so if you wanted to access a particular user’s email, you could look them up by their ID. However, to solve the inverse query given an email, fetch the user ID requires a secondary index. 39. When to use secondary indexes? You want to query on a column that isn't the primary key and isn't part of a composite key. The column you want to be querying on has few unique values (what I mean by this is, say you have a column Town, that is a good choice for secondary indexing because lots of people will be form the same town, date of birth however will not be such a good choice). 40. When to avoid secondary indexes? Try not using secondary indexes on columns contain a high count of unique values and that will produce few results. 41. I have a row or key cache hit rate of 0.XX123456789 reported by JMX. Is that XX% or 0.XX% ? XX% 42. What happens to existing data in my cluster when I add new nodes? When a new nodes joins a cluster, it will automatically contact the other nodes in the cluster and copy the right data to itself. 43. What are "Seed Nodes" in Cassandra? A seed node in Cassandra is a node that is contacted by other nodes when they first start up and join the cluster. A cluster can have multiple seed nodes. Seed node helps the process of bootstrapping for a new node joining a cluster. Its recommended to use the 2 seed node per data center. 44. What are "Coordinator Nodes" in Cassandra? Coordinator Nodes: Its a node which receive the request from client and send the request to the actual node depending upon the token. So all the nodes acts as coordinator node,because every node can receive a request and proxy that request. 45. What are the befefits of NoSQL over relational database? NoSQL overcome the weaknesses that the relational data model does not address well, which are as follows: Huge volume of sructured, semi-structured, and unstructured data Flexible data model(schema) that is easy to change Scalability and performance for web-scale applications Lower cost Impedance mismatch between the relational data model and object-oriented programming Built-in replication Support for agile software development 46. What ports does Cassandra use? By default, Cassandra uses 7000 for cluster communication, 9160 for clients (Thrift), and 8080 for JMX. These are all editable in the configuration file or bin/cassandra.in.sh (for JVM options. All ports are TCP. 47. What do you understand by High availability? A high availability system is the one that is ready to serve any request at any time. High avaliability is usually achieved by adding redundancies. So, if one part fails, the other part of the system can serve the request. To a client, it seems as if everything worked fine. 48. How Cassandra provide High availability feature? Cassandra is a robust software. Nodes joining and leaving are automatically taken care of. With proper settings, Cassandra can be made failure resistant. That means that if some of the servers fail, the data loss will be zero. So, you can just deploy Cassandra over cheap commodity hardware or a cloud environment, where hardware or infrastructure failures may occur. 49. Who uses Cassandra? Cassandra is in wide use around the world, and usage is growing all the time. Companies like Netflix, eBay, Twitter, Reddit, and Ooyala all use Cassandra to power pieces of their architecture, and it is critical to the day-to-da operations of those organizations. to date, the largest publicly known Cassandra cluster by machine count has over 300TB of data spanning 400 machines. Because of Cassandra's ability to handle high-volume data, it works well for a myriad of applications. This means that it's well suited to handling projects from the high-speed world of advertising technology in real time to the high-volume world of big-data analytics and everything in between. It is important to know your use case before moving forward to ensure things like proper deployment and good schema design. 50. When to use secondary indexes? You want to query on a column that isn't the primary key and isn't part of a composite key. The column you want to be querying on has few unique values (what I mean by this is, say you have a column Town, that is a good choice for secondary indexing because lots of people will be form the same town, date of birth however will not be such a good choice). 51. When to avoid secondary indexes? Try not using secondary indexes on columns contain a high count of unique values and that will produce few results. 52. What do you understand by Snitches? A snitch determines which data centers and racks nodes belong to. They inform Cassandra about the network topology so that requests are routed efficiently and allows Cassandra to distribute replicas by grouping machines into data centers and racks. Specifically, the replication strategy places the replicas based on the information provided by the new snitch. All nodes must return to the same rack and data center. Cassandra does its best not to have more than one replica on the same rack. 53. What is Hector? Hector is an open source project written in Java using the MIT license. It was one of the early Cassandra clients and is used in production at Outbrain. It wraps Thrift and offers JMX, connection pooling, and failover. 54. What do you understand by NoSQL CAP theorem? Consistency: means that data is the same across the cluster, so you can read or write to/from any node and get the same data. Availability: means the ability to access the cluster even if a node in the cluster goes down. Partition: Tolerance means that the cluster continues to function even if there is a "partition" (communications break) between two nodes (both nodes are up, but can't communicate). In order to get both availability and partition tolerance, you have to give up consistency. Consider if you have two nodes, X and Y, in a master-master setup. Now, there is a break between network comms in X and Y, so they can't synch updates. At this point you can either: A) Allow the nodes to get out of sync (giving up consistency), or B) Consider the cluster to be "down" (giving up availability) All the combinations available are: CA - data is consistent between all nodes - as long as all nodes are online - and you can read/write from any node and be sure that the data is the same, but if you ever develop a partition between nodes, the data will be out of sync (and won't re-sync once the partition is resolved). CP - data is consistent between all nodes, and maintains partition tolerance (preventing data desync) by becoming unavailable when a node goes down. AP - nodes remain online even if they can't communicate with each other and will resync data once the partition is resolved, but you aren't guaranteed that all nodes will have the same data (either during or after the partition) 55. What is Keyspace in Cassandra? Before doing any work with the tables in Cassandra, we have to create a container for them, otherwise known as a keyspace. One of the main uses for keyspaces is defining a replication mechanism for a group of tables. Example: CREATE KEYSPACE used_cars WITH replication = { 'class': 'SimpleStrategy', 'replication_factor' : 1}; 56. Explain Cassandra data model? The Cassandra data model has 4 main concepts which are cluster, keyspace, column, column family. Clusters contain many nodes(machines) and can contain multiple keyspaces. A keyspace is a namespace to group multiple column families, typically one per application. A column contains a name, value and timestamp. A column family contains multiple columns referenced by a row keys. 57. Can you add or remove Column Families in a working Cluster? Yes, but keeping in mind the following processes. Do not forget to clear the commitlog with ‘nodetool drain’ Turn off Cassandra to check that there is no data left in commitlog Delete the sstable files for the removed CFs 58. What is Replication Factor in Cassandra? ReplicationFactor is the measure of number of data copies existing. It is important to increase the replication factor to log into the cluster. 59. Can we change Replication Factor on a live cluster? Yes, but it will require running repair to alter the replica count of existing data. 60. How to iterate all rows in ColumnFamily? Using get_range_slices. You can start iteration with the empty string and after each iteration, the last key read serves as the start key for next iteration. CASSANDRA Questions and Answers pdf Download Read the full article
0 notes